notesum.ai

Published at December 5

The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation

cs.CL
cs.AI

Released Date: December 5, 2024

Authors: Fredrik Carlsson1, Fangyu Liu2, Daniel Ward1, Murathan Kurfali1, Joakim Nivre3

Aff.: 1RISE Research Institutes of Sweden; 2Google DeepMind; 3Uppsala University

Arxiv: http://arxiv.org/pdf/2412.04318v1