01Schema-validated YAML generation for structured agent test cases
02Configuration of LLM judges for qualitative response assessment
03Support for multi-role conversation threading including system, user, assistant, and tool roles
04Sequential evaluator chaining for multi-stage testing workflows
05Integration of custom code-based evaluators for programmatic validation
064 GitHub stars