notesum.ai

Published at November 22

Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction

cs.CV
cs.AI

Released Date: November 22, 2024

Authors: Huiwon Jang1, Sihyun Yu1, Jinwoo Shin1, Pieter Abbeel2, Younggyo Seo2

Aff.: 1KAIST; 2UC Berkeley

Arxiv: http://arxiv.org/abs/2411.14762v1