notesum.ai

Published at November 22

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

cs.CV
cs.AI
cs.CL

Released Date: November 22, 2024

Authors: Songhao Han1, Wei Huang2, Hairong Shi1, Le Zhuo3, Xiu Su4, Shifeng Zhang5, Xu Zhou5, Xiaojuan Qi2, Yue Liao6, Si Liu1

Aff.: 1Beihang University; 2The University of Hong Kong; 3Shanghai AI Lab; 4Central South University; 5Sangfor Technologies Inc.; 6CUHK

Arxiv: http://arxiv.org/abs/2411.14794v1