notesum.ai

Published at December 6

Adaptive Optimization for Enhanced Efficiency in Large-Scale Language Model Training

cs.AI

Released Date: December 6, 2024

Authors: Jiajing Chen1, Bingying Liu2, Xiaoxuan Liao1, Jia Gao3, Hongye Zheng4, Yue Li5

Aff.: 1New York University; 2Independent Researcher; 3Stevens Institute of Technology; 4The Chinese University of Hong Kong; 5Purdue University

Arxiv: http://arxiv.org/pdf/2412.04718v1