Tau2Bench
Benchmarks
Agentic Index
About Tau2Bench
Mock description: evaluates multi-turn, instruction-following robustness and conversation coherence.