notesum.ai

Published at December 10

Preserving Speaker Information in Direct Speech-to-Speech Translation with Non-Autoregressive Generation and Pretraining

cs.SD
cs.MM
eess.AS

Released Date: December 10, 2024

Authors: Rui Zhoua, Akinori Itoa, Takashi Nosea

Arxiv: http://arxiv.org/pdf/2412.07316v1