notesum.ai

Published at November 21

FocusLLaVA: A Coarse-to-Fine Approach for Efficient and Effective Visual Token Compression

cs.CV

Released Date: November 21, 2024

Authors: Yuke Zhu1, Chi Xie2, Shuang Liang2, Bo Zheng1, Sheng Guo1

Aff.: 1Mybank, Ant Group; 2Tongji University

Arxiv: http://arxiv.org/abs/2411.14228v1