Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonTHUDM/AgentBench
AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)