notesum.ai
Published at November 11Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data
cs.LG
cs.AI
cs.CL
stat.ML
Released Date: November 11, 2024
Authors: Alex Havrilla1, Wenjing Liao1
Aff.: 1Department of Mathematics, Georgia Institute of Technology

| GPT-2 | Chinchilla | ||
|---|---|---|---|