notesum.ai

Published at November 7

Audiobox TTA-RAG: Improving Zero-Shot and Few-Shot Text-To-Audio with Retrieval-Augmented Generation

eess.AS
cs.SD

Released Date: November 7, 2024

Authors: Mu Yang1, Bowen Shi2, Matthew Le2, Wei-Ning Hsu2, Andros Tjandra2

Aff.: 1Center for Robust Speech Systems (CRSS), University of Texas at Dallas, USA; 2Meta AI, USA

Arxiv: http://arxiv.org/abs/2411.05141v1