Tau2Bench

About Tau2Bench

Mock description: evaluates multi-turn, instruction-following robustness and conversation coherence.