notesum.ai
Published at December 5CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing
eess.AS
cs.CL
cs.LG
cs.SD
Released Date: December 5, 2024
Authors: Yen-Ju Lu1, Jing Liu, Thomas Thebaud, Laureano Moro-Velazquez, Ariya Rastrow, Najim Dehak, Jesus Villalba
Aff.: 1Johns Hopkins University

| ASR Adapted. | Bottleneck Dims. | ASR CER | SV | ||
| Normal | Few-shots | EER | DCF | ||
| XLSR | - | 29.0 | 39.0 | 1.29 | 0.093 |
| + ASR-FT | - | 17.1 | 32.2 | 1.29 | 0.095 |
| + ASR-Houlsby | 256 | 20.3 | 34.6 | 1.37 | 0.097 |
| + ASR-CA-XLSRL (ours) | 256 | 18.6 | 31.6 | 1.15 | 0.088 |