notesum.ai

Published at December 9

Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey

cs.CL
cs.AI
cs.LG
cs.MM
cs.SD
eess.AS

Released Date: December 9, 2024

Authors: Tianxin Xie1, Yan Rong1, Pengfei Zhang1, Li Liu1

Aff.: 1Hong Kong University of Science and Technology (Guangzhou)

Arxiv: http://arxiv.org/pdf/2412.06602v1