This skill empowers Claude to guide teams through the high-pressure lifecycle of a production incident, from initial declaration to final resolution. It implements industry-standard frameworks like the Incident Command System (ICS) and Google SRE best practices to ensure clear role definition (Commander, Comms, Tech Lead), rapid diagnosis, and consistent stakeholder communication. By following these structured procedures, teams can minimize downtime, avoid common anti-patterns like 'blame-storming' or 'committee decision-making,' and ensure a smooth path toward post-mortem learning and system hardening.
주요 기능
01Incident Command System (ICS) role assignment and structure
02Real-time status page update and stakeholder guidance
035 GitHub stars
04Structured war room management and communication protocols
05SEV 1-3 severity classification based on user impact
06Blameless post-mortem and root cause analysis procedures