notesum.ai

Published at November 18

BitMoD: Bit-serial Mixture-of-Datatype LLM Acceleration

cs.AR

Released Date: November 18, 2024

Authors: Yuzong Chen1, Ahmed F. AbouElhamayed1, Xilai Dai1, Yang Wang2, Marta Andronic3, George A. Constantinides3, Mohamed S. Abdelfattah1

Aff.: 1Computer Systems Lab, Cornell University; 2Systems and Networking Research Group, Microsoft Research; 3Department of Electrical and Electronic Engineering, Imperial College London

Arxiv: http://arxiv.org/abs/2411.11745v1