notesum.ai

Published at November 11

ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis

cs.CV
cs.AI

Released Date: November 11, 2024

Authors: Zanlin Ni1, Yulin Wang1, Renping Zhou1, Yizeng Han1, Jiayi Guo1, Zhiyuan Liu1, Yuan Yao2, Gao Huang1

Aff.: 1Tsinghua University; 2National University of Singapore

Arxiv: http://arxiv.org/abs/2411.06959v1