notesum.ai

Published at November 5

Change Is the Only Constant: Dynamic LLM Slicing based on Layer Redundancy

cs.CL
cs.AI
cs.LG
I.2.7; I.2.0

Released Date: November 5, 2024

Authors: Razvan-Gabriel Dumitru1, Paul-Ioan Clotan2, Vikas Yadav3, Darius Peteleaza4, Mihai Surdeanu1

Aff.: 1University of Arizona; 2Università di Bologna; 3ServiceNow AI; 4Lucian Blaga University of Sibiu

Arxiv: http://arxiv.org/abs/2411.03513v1