notesum.ai
Published at November 7Audiobox TTA-RAG: Improving Zero-Shot and Few-Shot Text-To-Audio with Retrieval-Augmented Generation
eess.AS
cs.SD
Released Date: November 7, 2024
Authors: Mu Yang1, Bowen Shi2, Matthew Le2, Wei-Ning Hsu2, Andros Tjandra2
Aff.: 1Center for Robust Speech Systems (CRSS), University of Texas at Dallas, USA; 2Meta AI, USA
| Evaluation |
| Dataset |