notesum.ai

Published at December 10

STIV: Scalable Text and Image Conditioned Video Generation

cs.CV
cs.AI
cs.LG
cs.MM

Released Date: December 10, 2024

Authors: Zongyu Lin1, Wei Liu1, Chen Chen1, Jiasen Lu1, Wenze Hu1, Tsu-Jui Fu1, Jesse Allardice1, Zhengfeng Lai1, Liangchen Song1, Bowen Zhang1, Cha Chen1, Yiran Fei1, Yifan Jiang1, Lezhi Li1, Yizhou Sun2, Kai-Wei Chang2, Yinfei Yang1

Aff.: 1Apple; 2University of California, Los Angeles

Arxiv: http://arxiv.org/pdf/2412.07730v1