notesum.ai

Published at December 6

LinVT: Empower Your Image-level Large Language Model to Understand Videos

cs.CV
cs.LG
cs.MM

Released Date: December 6, 2024

Authors: Lishuai Gao, Yujie Zhong, Yingsen Zeng, Haoxian Tan, Dengjie Li, Zheng Zhao

Arxiv: http://arxiv.org/pdf/2412.05185v1