notesum.ai

Published at November 21

DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization

cs.CL

Released Date: November 21, 2024

Authors: Hexuan Deng1, Wenxiang Jiao1, Xuebo Liu1, Min Zhang1, Zhaopeng Tu1

Aff.: 1Institute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, China

Arxiv: http://arxiv.org/abs/2411.14055v1