notesum.ai

Published at December 4

Mimir: Improving Video Diffusion Models for Precise Text Understanding

cs.CV

Released Date: December 4, 2024

Authors: Shuai Tan1, Biao Gong1, Yutong Feng2, Kecheng Zheng1, Dandan Zheng1, Shuwei Shi1, Yujun Shen1, Jingdong Chen1, Ming Yang1

Aff.: 1Ant Group; 2Tsinghua University

Arxiv: http://arxiv.org/pdf/2412.03085v1