notesum.ai

Published at December 6

APOLLO: SGD-like Memory, AdamW-level Performance

cs.LG
cs.AI
cs.PF

Released Date: December 6, 2024

Authors: Hanqing Zhu1, Zhenyu Zhang1, Wenyan Cong1, Xi Liu1, Sem Park1, Vikas Chandra1, Bo Long1, David Z. Pan2, Zhangyang Wang2, Jinwon Lee1

Aff.: 1AI at Meta; 2University of Texas at Austin

Arxiv: http://arxiv.org/pdf/2412.05270v1