notesum.ai
Published at November 22VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space
eess.AS
Released Date: November 22, 2024
Authors: Armani Rodriguez1, Silvija Kokalj-Filipovic1
Aff.: 1Rowan University

| R case 1 | R case 2 | |||
|---|---|---|---|---|
| Accuracy | 0.966 | 0.961 | 0.912 | 0.909 |