notesum.ai
Published at November 8Aioli: A Unified Optimization Framework for Language Model Data Mixing
cs.LG
cs.AI
cs.CL
stat.ML
Released Date: November 8, 2024
Authors: Mayee F. Chen1, Michael Y. Hu2, Nicholas Lourie3, Kyunghyun Cho4, Christopher Ré1
Aff.: 1Department of Computer Science, Stanford University; 2Center for Data Science, New York University; 3Computer Science Department, New York University; 4Computer Science Department, New York University, Prescient Design, Genentech

| Method | A/SE | GH/C4 | B/SE | A/B/SE | CC/GH/W | SlimPajama | # stratified | # extra runs |
|---|---|---|---|---|---|---|---|---|
| Stratified | - | 0 | ||||||
| GS | 4 | 10 | ||||||
| DML | 4 | 10 | ||||||
| Skill-It | 5 | |||||||
| DoReMi | 3 | 2 | ||||||
| DoGE | 1 | 1 | ||||||
| Aioli | 6 | 0 |