notesum.ai

Published at December 6

Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model

cs.CV

Released Date: December 6, 2024

Authors: Keunwoo Peter Yu1, Achal Dave2, Rares Ambrus2, Jean Mercat2

Aff.: 1University of Michigan; 2Toyota Research Institute

Arxiv: http://arxiv.org/pdf/2412.04729v1