notesum.ai

Published at November 19

Reward Modeling with Ordinal Feedback: Wisdom of the Crowd

cs.AI
cs.CL
stat.ML

Released Date: November 19, 2024

Authors: Shang Liu1, Yu Pan2, Guanting Chen3, Xiaocheng Li1

Aff.: 1Imperial College Business School, Imperial College London; 2Department of Intelligent Transportation, HKUST-GZ; 3Department of Statistics and Operations Research, University of North Carolina

Arxiv: http://arxiv.org/abs/2411.12843v1