소개
The Incident Runbook Templates skill equips Claude with specialized frameworks for managing high-pressure production incidents and establishing site reliability engineering (SRE) best practices. It provides comprehensive templates for detection, triage, mitigation, and recovery, specifically tailored for common infrastructure issues like Kubernetes service outages or database performance degradation. By standardizing severity levels, escalation paths, and communication templates, this skill helps engineering teams reduce cognitive load during downtime, ensuring faster Mean Time to Resolution (MTTR) and consistent stakeholder updates.