notesum.ai

Published at December 10

TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation

cs.CL

Released Date: December 10, 2024

Authors: Alfredo Garrachón Ruiz1, Tomás de la Rosa1, Daniel Borrajo1

Aff.: 1AI Research, JPMorganChase

Arxiv: http://arxiv.org/pdf/2412.07682v1