notesum.ai

Published at November 11

Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data

cs.LG
cs.AI
cs.CL
stat.ML

Released Date: November 11, 2024

Authors: Alex Havrilla1, Wenjing Liao1

Aff.: 1Department of Mathematics, Georgia Institute of Technology

Arxiv: http://arxiv.org/abs/2411.06646v1