Streamlines incident response with rapid severity classification, automated coordination, and structured post-mortem documentation.
This skill acts as an AI-powered incident commander within Claude Code, guiding teams through the critical phases of system outages and performance degradations. It automates the heavy lifting of incident management by defining severity levels (P0-P3), establishing communication protocols, and generating standardized incident records. By enforcing best practices in SRE and DevOps workflows, it ensures that responders stay focused on mitigation while maintaining clear stakeholder communication and thorough post-mortem documentation.
주요 기능
01Stakeholder communication templates for internal and external updates
02Automated severity assessment based on customer and business impact
03102 GitHub stars
04Structured incident record generation with real-time timeline tracking
05Standardized response coordination and escalation workflows
06Post-Incident Review (PIR) framework and root cause analysis templates
사용 사례
01Managing critical production outages with real-time severity classification
02Generating comprehensive post-mortem reports to identify root causes and action items
03Coordinating cross-functional response teams during system degradations