notesum.ai

Published at November 29

V2SFlow: Video-to-Speech Generation with Speech Decomposition and Rectified Flow

cs.CV
cs.SD
eess.AS

Released Date: November 29, 2024

Authors: Jeongsoo Choi1, Ji-Hoon Kim1, Jinyu Li2, Joon Son Chung1, Shujie Liu2

Aff.: 1Korea Advanced Institute of Science and Technology; 2Microsoft

Arxiv: http://arxiv.org/pdf/2411.19486v1