notesum.ai

Published at November 4

CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching

cs.SD
cs.AI
eess.AS

Released Date: November 4, 2024

Authors: Yu Pan1, Yuguang Yang2, Jixun Yao2, Jianhao Ye2, Hongbin Zhou2, Lei Ma3, Jianjun Zhao1

Aff.: 1Kyushu University, Japan; 2Ximalaya Inc., China; 3University of Tokyo, Japan

Arxiv: http://arxiv.org/abs/2411.02026v1