notesum.ai

Published at November 7

Towards Improved Preference Optimization Pipeline: from Data Generation to Budget-Controlled Regularization

cs.LG
cs.AI
cs.CL

Released Date: November 7, 2024

Authors: Zhuotong Chen1, Fang Liu1, Jennifer Zhu1, Wanyu Du1, Yanjun Qi1

Aff.: 1AWS Bedrock Science

Arxiv: http://arxiv.org/abs/2411.05875v1