소개
The Incident Response Runbooks skill provides a comprehensive framework for managing system outages and service degradations. It offers production-ready templates that cover the entire incident lifecycle—from initial detection and triage to mitigation, resolution, and post-mortem communication. By standardizing severity levels and providing ready-to-use CLI commands for Kubernetes and database environments, this skill helps engineering teams reduce time-to-resolution (MTTR) and maintain clear communication during high-pressure production events.