notesum.ai

Published at November 8

SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding

cs.CL
cs.AI

Released Date: November 8, 2024

Authors: Ryan Sun1, Tianyi Zhou2, Xun Chen3, Lichao Sun1

Aff.: 1Lehigh University; 2University of Maryland, College Park; 3Samsung Research America

Arxiv: http://arxiv.org/abs/2411.05289v1