notesum.ai

Published at November 26

Pushing the Limits of Large Language Model Quantization via the Linearity Theorem

cs.LG

Released Date: November 26, 2024

Authors: Vladimir Malinovskii1, Andrei Panferov2, Ivan Ilin3, Han Guo4, Peter Richtárik3, Dan Alistarh

Aff.: 1Yandex, HSE University; 2ISTA; 3GenAI CoE, KAUST; 4MIT

Arxiv: http://arxiv.org/abs/2411.17525v1