关于
This skill provides a comprehensive Site Reliability Engineering (SRE) framework for Claude Code, enabling developers to establish precise reliability targets. It guides users through defining SLIs, setting SLOs, and managing error budgets to effectively balance innovation velocity with system stability. By providing ready-to-use Prometheus recording rules, alerting configurations, and Grafana dashboard patterns, it helps teams implement proactive observability that measures actual user-perceived performance rather than just raw infrastructure metrics.