01Multi-phase verification covering initialization, refinement, outlining, planning, and execution.
02Automated structural checks for configuration files, task lists, and project deliverables.
031 GitHub stars
04Hybrid assessment engine combining deterministic Python scripts with qualitative LLM evaluations.
05Detailed scoring rubric (0-100) and failure taxonomy for root cause analysis.
06Interactive test case management including creation, listing, and automated execution against golden samples.