Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonsierra-research/tau2-bench
tau2-bench
τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains