notesum.ai

Published at November 20

BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices

cs.AI

Released Date: November 20, 2024

Authors: Anka Reuel1, Amelia Hardy1, Chandler Smith2, Max Lamparth1, Malcolm Hardy1, Mykel J. Kochenderfer1

Aff.: 1Stanford University; 2Northeastern University

Arxiv: http://arxiv.org/abs/2411.12990v1