notesum.ai

Published at November 16

HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization

cs.AI

Released Date: November 16, 2024

Authors: Huaqin Zhao1, Jiaxi Li1, Yi Pan1, Shizhe Liang1, Xiaofeng Yang2, Wei Liu3, Xiang Li4, Fei Dou1, Tianming Liu1, Jin Lu1

Aff.: 1University of Georgia; 2Emory University; 3Mayo Clinic; 4Massachusetts General Hospital and Harvard Medical School

Arxiv: http://arxiv.org/abs/2411.10696v1