01Support for LLM-based qualitative judging for content and prompts
02Flexible storage options for project-specific or global configurations
032 GitHub stars
04Interactive configuration wizard for multi-parameter experiment setup
05Automated baseline metric verification and experiment branching
06Built-in evaluators for benchmarking speed, size, and memory usage