notesum.ai
Published at November 8SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding
cs.CL
cs.AI
Released Date: November 8, 2024
Authors: Ryan Sun1, Tianyi Zhou2, Xun Chen3, Lichao Sun1
Aff.: 1Lehigh University; 2University of Maryland, College Park; 3Samsung Research America

| T | RRS | RRSw | SpecHub |
|---|---|---|---|
| 0.3 | 0.0426 | 0.1114 | 0.1184 |
| 0.6 | 0.0740 | 0.1089 | 0.1379 |
| 1.0 | 0.1021 | 0.1140 | 0.1660 |