01Comprehensive 100-point scoring system across five critical validation categories
02YAML-based test spec generation for standardized and repeatable agent testing
0332 GitHub stars
04Interactive simulated and live preview modes for debugging agent behavior
05Automated test execution with real-time coverage metrics for topics and actions
06Agentic Fix Loop for autonomous resolution of failing test cases