notesum.ai

Published at December 3

Optimizing Latent Goal by Learning from Trajectory Preference

cs.AI
cs.LG

Released Date: December 3, 2024

Authors: Guangyu Zhao1, Kewei Lian1, Haowei Lin1, Haobo Fu2, Qiang Fu2, Shaofei Cai1, Zihao Wang1, Yitao Liang1

Aff.: 1Peking University; 2Tencent AI Lab

Arxiv: http://arxiv.org/pdf/2412.02125v1