소개
This skill provides a comprehensive framework for Site Reliability Engineering (SRE) practices, focusing on the hierarchy of SLAs, SLOs, and SLIs. It enables Claude to assist developers in defining meaningful reliability metrics, calculating error budgets, and configuring automated alerting based on burn rates. By integrating Prometheus recording rules and Grafana dashboard structures, it bridges the gap between business requirements and technical observability, allowing teams to effectively balance innovation velocity with system stability.