소개
This skill equips developers and SREs with professional-grade incident response frameworks designed to minimize downtime and reduce Mean Time to Resolution (MTTR). It provides comprehensive templates for defining severity levels, escalation paths, and specific mitigation steps for common infrastructure issues like service outages, database bottlenecks, and traffic spikes. By using these standardized procedures, teams can ensure consistent communication, faster triage, and more reliable recovery during critical system failures or high-pressure on-call rotations.