notesum.ai

Published at November 8

MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization

cs.AR
cs.AI
cs.LG

Released Date: November 8, 2024

Authors: Akshat Ramachandran1, Souvik Kundu2, Tushar Krishna1

Aff.: 1Georgia Institute of Technology; 2Intel Labs

Arxiv: http://arxiv.org/abs/2411.05282v1