notesum.ai

Published at December 9

LLM-BIP: Structured Pruning for Large Language Models with Block-Wise Forward Importance Propagation

cs.CL
cs.AI

Released Date: December 9, 2024

Authors: Haihang Wu1

Aff.: 1The University of Melbourne

Arxiv: http://arxiv.org/pdf/2412.06419v1