notesum.ai

Published at November 18

Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search

cs.CL
cs.AI

Released Date: November 18, 2024

Authors: Jinhao Jiang1, Zhipeng Chen1, Yingqian Min1, Jie Chen1, Xiaoxue Cheng1, Jiapeng Wang1, Yiru Tang1, Haoxiang Sun2, Jia Deng1, Wayne Xin Zhao1, Zheng Liu3, Dong Yan4, Jian Xie4, Zhongyuan Wang3, Ji-Rong Wen1

Aff.: 1Gaoling School of Artificial Intelligence, Renmin University of China; 2School of Information, Renmin University of China; 3BAAI; 4Baichuan AI

Arxiv: http://arxiv.org/abs/2411.11694v1