notesum.ai

Published at November 22

Continual SFT Matches Multimodal RLHF with Negative Supervision

cs.AI
cs.CL
cs.CV

Released Date: November 22, 2024

Authors: Ke Zhu1, Yu Wang2, Yanpeng Sun3, Qiang Chen2, Jiangjiang Liu2, Gang Zhang2, Jingdong Wang2

Aff.: 1Nanjing University; 2Baidu VIS; 3Nanjing University of Science and Technology

Arxiv: http://arxiv.org/abs/2411.14797v1