notesum.ai
Published at November 20BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
cs.AI
Released Date: November 20, 2024
Authors: Anka Reuel1, Amelia Hardy1, Chandler Smith2, Max Lamparth1, Malcolm Hardy1, Mykel J. Kochenderfer1
Aff.: 1Stanford University; 2Northeastern University

| Stage | FM | Non-FM | All |
|---|---|---|---|
| Design | 10.6 | 11.2 | 10.8 |
| Implementation | 5.5 | 7.5 | 6.2 |
| Documentation | 10.3 | 9.9 | 10.1 |
| Maintenance | 9.1 | 10.6 | 9.6 |