notesum.ai

Published at November 4

Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language Models

cs.CL
cs.AI
cs.CE
cs.LG

Released Date: November 4, 2024

Authors: Jonas Zausinger1, Lars Pennig1, Kacper Chlodny1, Vincent Limbach1, Anna Ketteler1, Thorben Prein1, Vishwa Mohan Singh2, Michael Morris Danziger3, Jannis Born3

Aff.: 1TU Munich, Germany; TUM.AI, Germany; 2TUM.AI, Germany; LMU Munich, Germany; 3IBM Research Europe, Switzerland

Arxiv: http://arxiv.org/abs/2411.02083v1