notesum.ai

Published at October 21

Improve Vision Language Model Chain-of-thought Reasoning

cs.AI
cs.CV
68T07

Released Date: October 21, 2024

Authors: Ruohong Zhang1, Bowen Zhang2, Yanghao Li2, Haotian Zhang2, Zhiqing Sun1, Zhe Gan2, Yinfei Yang2, Ruoming Pang2, Yiming Yang1

Aff.: 1CMU LTI; 2Apple

Arxiv: https://arxiv.org/abs/2410.16198v1