01Dynamic definition of quality dimensions and scoring thresholds
021 GitHub stars
03Integration with Langfuse for dataset and prompt management
04Generation of structured YAML evaluation configurations
05Built-in smoke testing to verify evaluation pipeline integrity
06Automated discovery of agent entry points and execution flows