Conducts blameless incident reviews to identify root causes and establish actionable prevention strategies.
This skill empowers Claude to guide engineering teams through the incident postmortem process using industry-standard frameworks like Google SRE guidelines and the '5 Whys' methodology. It focuses on constructing precise timelines, analyzing system states without attributing blame to individuals, and generating measurable action items to prevent recurrence. By grounding the analysis in SWEBOK and SRE principles, it ensures that every production failure becomes a structured learning opportunity that improves long-term system reliability and team culture.
主な機能
01Detailed incident timeline reconstruction with 5-minute interval precision
02Generation of specific, measurable, and assignable prevention action items
039 GitHub stars
04Blameless Root Cause Analysis (RCA) using 5 Whys and Fishbone diagrams
05Systemic failure identification focusing on processes rather than individuals
06Contextual analysis of system load, recent changes, and environmental state
ユースケース
01Analyzing recurring software bugs to identify underlying architectural flaws
02Standardizing incident reporting formats across diverse engineering squads
03Conducting a formal review after a major production service outage