notesum.ai

Published at December 9

SparseAccelerate: Efficient Long-Context Inference for Mid-Range GPUs

cs.CL

Released Date: December 9, 2024

Authors: James Vo1

Aff.: 1AGILESODA INC., South Korea

Arxiv: http://arxiv.org/pdf/2412.06198v1