notesum.ai

Published at December 6

Direct Quantized Training of Language Models with Stochastic Rounding

cs.LG
cs.CL

Released Date: December 6, 2024

Authors: Kaiyan Zhao1, Tsuguchika Tabaru2, Kenichi Kobayashi2, Takumi Honda2, Masafumi Yamazaki2, Yoshimasa Tsuruoka1

Aff.: 1The University of Tokyo; 2Fujitsu Limited

Arxiv: http://arxiv.org/pdf/2412.04787v1