AgentBench: Evaluating LLMs as Agents

Xiao Liu, Jie Tang(Tsinghua University), Yu‐Cheng Gu, Hanyu Lai, Zhengxiao Du, Minlie Huang, Yuxiao Dong, Hao Yu, Yifan Xu, Hangliang Ding(Wenzhou Medical University), Chenhui Zhang(King Abdullah University of Science and Technology), Sheng Shen, Huan Sun, Tianjun Zhang, Xuanyu Lei, Kejuan Yang, Yu Su, Xiang Deng, Shudan Zhang, Hanchen Zhang(Chinese Academy of Sciences), Kaiwen Men, Aohan Zeng
arXiv (Cornell University)
August 7, 2023
Cited by 51


Related Papers