notesum.ai

Published at December 6

CompCap: Improving Multimodal Large Language Models with Composite Captions

cs.CV
cs.AI
cs.LG

Released Date: December 6, 2024

Authors: Xiaohui Chen1, Satya Narayan Shukla2, Mahmoud Azab2, Aashu Singh2, Qifan Wang2, David Yang2, ShengYun Peng3, Hanchao Yu2, Shen Yan2, Xuewen Zhang2, Baosheng He2

Aff.: 1Meta, Tufts University; 2Meta; 3Meta, Georgia Tech

Arxiv: http://arxiv.org/pdf/2412.05243v1