notesum.ai

Published at November 6

Self-Consistency Preference Optimization

cs.CL
cs.AI
cs.LG

Released Date: November 6, 2024

Authors: Archiki Prasad1, Weizhe Yuan1, Richard Yuanzhe Pang1, Jing Xu1, Maryam Fazel-Zarandi1, Mohit Bansal2, Sainbayar Sukhbaatar1, Jason Weston1, Jane Yu1

Aff.: 1Meta FAIR; 2UNC Chapel Hill

Arxiv: http://arxiv.org/abs/2411.04109v1