notesum.ai

Published at October 31

Towards Reliable Alignment: Uncertainty-aware RLHF

cs.AI
cs.LG

Released Date: October 31, 2024

Authors: Debangshu Banerjee1, Aditya Gopalan1

Aff.: 1Department of Electrical and Communication Engineering, Indian Institute of Science, India

Arxiv: http://arxiv.org/abs/2410.23726v1