notesum.ai

Published at December 4

Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning

cs.CV

Released Date: December 4, 2024

Authors: Wujian Peng1, Lingchen Meng, Yitong Chen, Yiweng Xie, Yang Liu, Tao Gui, Hang Xu, Xipeng Qiu, Zuxuan Wu, Yu-Gang Jiang

Aff.: 1Fudan University

Arxiv: http://arxiv.org/pdf/2412.03565v1