OpenReview.net
  • Login
back arrowGo to DBLP homepage

Establishing Best Practices for Building Rigorous Agentic Benchmarks

Open Webpage

Yuxuan Zhu, Tengjun Jin, Yada Pruksachatkun, Andy Zhang, Shu Liu, Sasha Cui, Sayash Kapoor, Shayne Longpre, Kevin Meng, Rebecca Weiss, Fazl Barez, Rahul Gupta, Jwala Dhamala, Jacob Merizian, Mario Giulianelli, Harry Coppock, Cozmin Ududec, Jasjeet S. Sekhon, Jacob Steinhardt, Antony Kellerman et al. (5 additional authors not shown)

Published: 2025, Last Modified: 09 Jan 2026CoRR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
External IDs:dblp:journals/corr/abs-2507-02825
Loading
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Contact
  • Sponsors
  • Donate
  • FAQ
  • Terms of Use / Privacy Policy
  • News
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Sponsors
  • News
  • FAQ
  • Contact
  • Donate
  • Terms of Use
  • Privacy Policy

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2026 OpenReview