notesum.ai

Published at December 4

SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model

cs.CV
cs.LG
cs.SD

Released Date: December 4, 2024

Authors: Yan Li1, Ziya Zhou1, Zhiqiang Wang1, Wei Xue1, Wenhan Luo1, Yike Guo1

Aff.: 1University

Arxiv: http://arxiv.org/pdf/2412.03430v1