AgentBench: Evaluating LLMs as AgentsXiao Liu, Jie Tang, Yu‐Cheng Gu et al.|arXiv (Cornell University)|2023Cited by 51