notesum.ai

Published at October 31

RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner

cs.AI

cs.LG

Released Date: October 31, 2024

Authors: Fu-Chieh Chang¹, Yu-Ting Lee², Hui-Ying Shih³, Pei-Yuan Wu⁴

Aff.: ¹MediaTek Research, Taipei, Taiwan; ²Department of Mathematical Sciences, National Chengchi University, Taipei, Taiwan; ³Department of Mathematics, National Tsing Hua University, Hsinchu, Taiwan; ⁴Graduate Institute of Communication Engineering, National Taiwan University, Taipei, Taiwan

Arxiv: http://arxiv.org/abs/2410.23912v1