BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices

Anka Reuel-Lamparth, Amelia F. Hardy, Chandler Smith, Max Lamparth, Malcolm Hardy, Mykel J. Kochenderfer

Published: 2024, Last Modified: 26 May 2026NeurIPS 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading