notesum.ai

Published at December 6

BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits

cs.CL
cs.AI
cs.NE

Released Date: December 6, 2024

Authors: Wazib Ansar1, Saptarsi Goswami2, Amlan Chakrabarti1

Aff.: 1University of Calcutta; 2Bangabasi Morning College

Arxiv: http://arxiv.org/pdf/2412.05225v1