OpenReview.net
  • Login
back arrowGo to DBLP homepage

AgentBench: Evaluating LLMs as Agents

Open Webpage

Xiao Liu, Hao Yu, Hanchen Zhang, Yifan Xu, Xuanyu Lei, Hanyu Lai, Yu Gu, Hangliang Ding, Kaiwen Men, Kejuan Yang, Shudan Zhang, Xiang Deng, Aohan Zeng, Zhengxiao Du, Chenhui Zhang, Sheng Shen, Tianjun Zhang, Yu Su, Huan Sun, Minlie Huang et al. (2 additional authors not shown)

Published: 01 Jan 2024, Last Modified: 17 May 2025ICLR 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Contact
  • Feedback
  • Sponsors
  • Join the Team
  • Frequently Asked Questions
  • Terms of Use
  • Privacy Policy
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Sponsors
  • Join the Team
  • Frequently Asked Questions
  • Contact
  • Feedback
  • Terms of Use
  • Privacy Policy

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview