notesum.ai
Published at October 24Dynamic Vocabulary Pruning in Early-Exit LLMs
cs.RO
cs.AI
Released Date: October 24, 2024
Authors: Jort Vincenti1, Karim Abdel Sadek1, Joan Velja1, Matteo Nulli1, Metod Jazbec1
Aff.: 1University of Amsterdam

| Dataset | Conf. | Method | Score () | FLOPs/Token () | Avg. Exit () | Conf. Time (s) () |
|---|---|---|---|---|---|---|
| SQuAD [13] | CALM | 87.5 | 2.21 | 2.4 | 44.5 | |
| + DVP (ours) | 87.4 | 1.97 | 2.4 | 40.8 | ||
| CALM | 90.6 | 13.91 | 20.9 | 499.9 | ||
| + DVP (ours) | 90.6 | 1.99 | 20.8 | 413.1 | ||
| SamSum [6] | CALM | 33.8 | 4.21 | 5.5 | 90.0 | |
| + DVP (ours) | 33.7 | 2.01 | 5.4 | 81.0 | ||
| CALM | 43.1 | 11.13 | 16.5 | 162.0 | ||
| + DVP (ours) | 43.1 | 2.12 | 16.4 | 136.0 |