Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
AgentBench: Evaluating LLMs as Agents
Xiao Liu
,
Hao Yu
,
Hanchen Zhang
,
Yifan Xu
,
Xuanyu Lei
,
Hanyu Lai
,
Yu Gu
,
Hangliang Ding
,
Kaiwen Men
,
Kejuan Yang
,
Shudan Zhang
,
Xiang Deng
,
Aohan Zeng
,
Zhengxiao Du
,
Chenhui Zhang
,
Sheng Shen
,
Tianjun Zhang
,
Yu Su
,
Huan Sun
,
Minlie Huang
et al. (2 additional authors not shown)
Published: 01 Jan 2024, Last Modified: 17 May 2025
ICLR 2024
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading