notesum.ai

Published at November 12

Entropy Controllable Direct Preference Optimization

cs.LG
cs.AI
cs.CL

Released Date: November 12, 2024

Authors: Motoki Omura1, Yasuhiro Fujita2, Toshiki Kataoka2

Aff.: 1The University of Tokyo; 2Preferred Networks, Inc.

Arxiv: http://arxiv.org/abs/2411.07595v1