notesum.ai

Published at November 27

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

cs.CV
cs.CL

Released Date: November 27, 2024

Authors: Di Zhang1, Jingdi Lei2, Junxian Li3, Xunzhi Wang4, Yujie Liu5, Zonglin Yang6, Jiatong Li7, Weida Wang8, Suorong Yang9, Jianbo Wu10, Peng Ye1, Wanli Ouyang11, Dongzhan Zhou11

Aff.: 1Fudan University; 2Beijing Institute of Technology; 3Shanghai Jiaotong University; 4Nankai University; 5Shanghai University; 6Nanyang Technological University; 7Hong Kong Polytechnic University; 8Tongji University; 9Nanjing University; 10University of California, Merced; 11Shanghai Artificial Intelligence Laboratory

Arxiv: http://arxiv.org/abs/2411.18203v1