notesum.ai

Published at October 31

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

cs.CL
cs.AI
cs.LG

Released Date: October 31, 2024

Authors: Ming Li1, Yanhong Li2, Tianyi Zhou1

Aff.: 1University of Maryland; 2University of Chicago

Arxiv: http://arxiv.org/abs/2410.23743v1