notesum.ai

Published at October 31

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

cs.CL
cs.AI
cs.CV
cs.LG

Released Date: October 31, 2024

Authors: Xinghao Wang1, Pengyu Wang1, Bo Wang1, Dong Zhang1, Yunhua Zhou2, Xipeng Qiu1

Aff.: 1Fudan University; 2Shanghai Artificial Intelligence Laboratory

Arxiv: http://arxiv.org/abs/2410.23918v1