01Evolution tracking with performance summaries and judge scores
025 GitHub stars
03Automated self-critique based on judge feedback and reflection memory
04Configurable iteration depth for both refinement and arena rounds
05Multi-model consensus gathering using rotating LLM battles
06Recursive refinement loop (decompose, critique, reflect, refine)