What incident severities does this skill support?

It provides specific workflows and response requirements for SEV1 (complete outage), SEV2 (major impact), and SEV3 (minor impact or workaround available) incidents.

Can this skill help with post-incident documentation?

Yes, it includes mandatory RCA (Root Cause Analysis) templates and requirements based on incident severity to ensure blameless retrospectives are conducted and documented.

When should I skip this workflow?

This workflow should be skipped for development environment issues, non-production alerts, and planned maintenance, which follow standard debugging or change management processes.

How does it help during the mitigation phase?

It provides a prioritized list of mitigation strategies—such as rollbacks, scaling, and failovers—along with verification checklists to ensure the fix is stable before resolution.

Does it provide templates for status updates?

Yes, it includes structured templates for incident declaration, mitigation progress, and final resolution to ensure consistent communication across the organization.

Operations Incident Response

Name: Operations Incident Response
Author: LerianStudio

byLerianStudio

•

Despliegue y DevOps

Standardizes production incident management through structured SRE workflows from detection to post-mortem analysis.

This skill implements a rigorous Site Reliability Engineering (SRE) framework for managing production outages and service degradations. It guides teams through critical phases including incident declaration with severity mapping, coordinated triage using the 5 Whys approach, and systematic mitigation strategies. By enforcing mandatory documentation and verification periods, it ensures that SEV1-SEV3 incidents are handled with consistency, reducing recovery time and preventing premature resolutions while facilitating blameless post-mortems to improve long-term system reliability.

Características Principales

01Structured triage coordination using 5 Whys methodology

02Mitigation decision support and rollback planning

0317 GitHub stars

04Standardized incident severity assessment (SEV1-SEV3)

05Post-mortem (RCA) generation and action item tracking

06Automated incident declaration and resolution templates

Casos de Uso

01Ensuring SLA compliance through structured response timelines and updates

02Managing high-pressure production outages with multi-team coordination

03Conducting blameless post-incident reviews to identify systemic improvements

What are Skills?·How to Install

Install with 🐟 Skill.Fish

npx skillfish add lerianstudio/ring ops-incident-response

For use in Claude.ai and ChatGPT

Download Skill