notesum.ai

Published at November 9

Optimizing Large Language Models through Quantization: A Comparative Analysis of PTQ and QAT Techniques

cs.LG
cs.AI
cs.CL

Released Date: November 9, 2024

Authors: Jahid Hasan1

Aff.: 1Unknown

Arxiv: http://arxiv.org/abs/2411.06084v1