notesum.ai

Published at November 11

Warmstarting for Scaling Language Models

cs.LG
cs.AI

Released Date: November 11, 2024

Authors: Neeratyoy Mallik1, Maciej Janowski2, Johannes Hog1, Herilalaina Rakotoarison1, Aaron Klein3, Josif Grabocka4, Frank Hutter5

Aff.: 1University of Freiburg; 2University of Freiburg, University of Technology Nuremberg; 3ScaDS.AI Leipzig; 4University of Technology Nuremberg; 5ELLIS Institute Tübingen, University of Freiburg

Arxiv: http://arxiv.org/abs/2411.07340v1