notesum.ai

Published at December 6

GUIDE: A Global Unified Inference Engine for Deploying Large Language Models in Heterogeneous Environments

cs.AI

Released Date: December 6, 2024

Authors: Yanyu Chen1, Ganhong Huang1

Aff.: 1Sun Yat-sen University

Arxiv: http://arxiv.org/pdf/2412.04788v1