notesum.ai
Published at November 29Scaling Transformers for Low-Bitrate High-Quality Speech Coding
eess.AS
cs.AI
cs.LG
cs.SD
eess.SP
Released Date: November 29, 2024
Authors: Julian D Parker, Anton Smirnov1, Jordi Pons1, CJ Carr1, Zack Zukowski1, Zach Evans1, Xubo Liu1
Aff.: 1Stability AI

| Model | BPS | TPF | TPS | SISDR | Mel | STFT | PESQ | STOI | MOSNet |
| DAC | |||||||||
| Encodec | |||||||||
| SpeechTokenizer | |||||||||
| SemantiCodec | – | ||||||||
| – | |||||||||
| Mimi | |||||||||
| TAAE | |||||||||
| + no quant. |