notesum.ai

Published at November 29

KV Shifting Attention Enhances Language Modeling

cs.CL

Released Date: November 29, 2024

Authors: Mingyu Xu1, Wei Cheng1, Bingning Wang1, Weipeng Chen1

Aff.: 1Baichuan Inc.

Arxiv: http://arxiv.org/pdf/2411.19574v1