Knowledge check · 15 questions
Tests understanding of trajectory-level agent evaluation, task completion measurement, and quality tracking systems
Agent Quality Evaluation: Beyond Single-Step Accuracy — Quick Check
15 questions