Mock description: measures instruction-following fidelity, adherence to constraints, and response robustness.