notesum.ai

Published at November 11

LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models

cs.CL
cs.AI
cs.LG

Released Date: November 11, 2024

Authors: Runming Yang1, Taiqiang Wu2, Jiahao Wang3, Pengfei Hu4, Ngai Wong2, Yujiu Yang1

Aff.: 1Shenzhen International Graduate School Tsinghua University, Shenzhen, China; 2Department of EEE, The University of Hong Kong, Hong Kong, China; 3Department of Computer Science, The University of Hong Kong, Hong Kong, China; 4PCG, Tencent, Beijing, China

Arxiv: http://arxiv.org/abs/2411.06839v1