notesum.ai

Published at November 26

An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models

cs.LG

Released Date: November 26, 2024

Authors: Yunzhe Hu1, Difan Zou2, Dong Xu1

Aff.: 1School of Computing and Data Science, The University of Hong Kong; 2School of Computing and Data Science & Institute of Data Science, The University of Hong Kong

Arxiv: http://arxiv.org/abs/2411.17182v1