notesum.ai

Published at November 20

Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension

cs.CV
cs.AI

Released Date: November 20, 2024

Authors: Yongdong Luo1, Xiawu Zheng1, Xiao Yang1, Guilin Li1, Haojia Lin1, Jinfa Huang2, Jiayi Ji1, Fei Chao1, Jiebo Luo2, Rongrong Ji1

Aff.: 1Xiamen University; 2University of Rochester

Arxiv: http://arxiv.org/abs/2411.13093v1