notesum.ai

Published at December 10

EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models

cs.DC
cs.AI

Released Date: December 10, 2024

Authors: Jialiang Cheng1, Ning Gao1, Yun Yue1, Zhiling Ye1, Jiadi Jiang1, Jian Sha1

Aff.: 1Ant Group

Arxiv: http://arxiv.org/pdf/2412.07210v1