notesum.ai

Published at November 21

Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts

cs.CV

Released Date: November 21, 2024

Authors: Honglin Li1, Yuting Gao2, Chenglu Zhu3, Jingdong Chen, Ming Yang2, Lin Yang3

Aff.: 1Zhejiang University; 2Ant Group; 3Westlake University

Arxiv: http://arxiv.org/abs/2411.13909v1