notesum.ai

Published at October 21

Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small

cs.AR
cs.DC
q-bio.QM

Released Date: October 21, 2024

Authors: Zhehui Wang, Tao Luo, Cheng Liu, Weichen Liu, Rick Siow Mong Goh, Weng-Fai Wong

Arxiv: https://arxiv.org/abs/2410.15977v1