notesum.ai

Published at November 6

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

cs.AI
cs.CL
cs.LG
stat.ML
I.2.7

Released Date: November 6, 2024

Authors: Haolin Chen1, Yihao Feng1, Zuxin Liu1, Weiran Yao1, Akshara Prabhakar1, Shelby Heinecke1, Ricky Ho1, Phil Mui1, Silvio Savarese1, Caiming Xiong1, Huan Wang1

Aff.: 1Salesforce AI Research

Arxiv: http://arxiv.org/abs/2411.04282v1