notesum.ai
Published at November 17VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
cs.CV
cs.AI
Released Date: November 17, 2024
Authors: Yunlong Tang1, Junjia Guo1, Hang Hua1, Susan Liang1, Mingqian Feng1, Xinyang Li1, Rui Mao1, Chao Huang1, Jing Bi1, Zeliang Zhang1, Pooyan Fazli2, Chenliang Xu1
Aff.: 1University of Rochester; 2Arizona State University
![[Uncaptioned image]](https://arxiv.org/html/2411.10979v2/extracted/6011001/fig/logo.jpg)
| Composi- |
| tional QA |