notesum.ai

Published at November 22

Whats in a Video: Factorized Autoregressive Decoding for Online Dense Video Captioning

cs.CV
cs.CL

Released Date: November 22, 2024

Authors: AJ Piergiovanni1, Dahun Kim1, Michael S. Ryoo1, Isaac Noble1, Anelia Angelova1

Aff.: 1Google Deepmind

Arxiv: http://arxiv.org/abs/2411.14688v1