notesum.ai

Published at November 1

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling

cs.CL
cs.AI
cs.LG

Released Date: November 1, 2024

Authors: Yiwen Ding1, Zhiheng Xi1, Wei He1, Zhuoyuan Li2, Yitao Zhai3, Xiaowei Shi3, Xunliang Cai3, Tao Gui1, Qi Zhang1, Xuanjing Huang1

Aff.: 1Fudan University; 2Macau University of Science and Technology; 3Meituan

Arxiv: http://arxiv.org/abs/2411.00750v1