notesum.ai

Published at December 5

DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models

cs.CV

Released Date: December 5, 2024

Authors: Yizhuo Li1, Yuying Ge2, Yixiao Ge2, Ping Luo1, Ying Shan2

Aff.: 1The University of Hong Kong; 2ARC Lab, Tencent

Arxiv: http://arxiv.org/pdf/2412.04446v1