01Statistical analysis with p-values, effect sizes, and confidence intervals
02Comprehensive HTML and Markdown report generation with data visualizations
03Deep metrics collection covering context usage, tool calls, and backtracking
04Automated side-by-side trial execution in synchronized terminal windows
05Isolated task execution using Git worktrees to ensure clean environments
065 GitHub stars