Orchestrates AI-driven development through eval-first loops, granular task decomposition, and cost-optimized model routing.
Agentic Engineering is a specialized skill designed to transform Claude Code into a high-performance autonomous developer by establishing a rigorous methodology for AI-human collaboration. It moves beyond simple prompting by implementing an 'eval-first' execution loop, where completion criteria and tests are defined before coding begins. The skill provides a framework for breaking complex features into 15-minute verifiable units, intelligently routing tasks between model tiers (Haiku, Sonnet, Opus) based on complexity, and focusing human oversight on high-risk architectural invariants rather than stylistic minutiae.
Key Features
01172,650 GitHub stars
02Intelligent Model Routing: Optimize costs by assigning tasks to Haiku, Sonnet, or Opus based on reasoning requirements.
0315-Minute Task Decomposition: Break complex workflows into small, verifiable units with single dominant risks.
04Eval-First Execution: Define capability and regression tests before starting any implementation work.
05Session Strategy Management: Guidelines on when to continue or refresh sessions to maintain optimal context and performance.
06Risk-Centric Review Focus: Prioritize edge cases, security assumptions, and error boundaries during AI code reviews.
Use Cases
01Standardizing software quality through rigorous automated evaluation and human-in-the-loop risk controls.
02Managing complex feature implementations where AI agents perform the majority of the coding work.
03Optimizing API costs and performance by matching model tiers to task difficulty.