概要
This skill empowers Site Reliability Engineers and DevOps teams to streamline their observability workflows by automating the generation of comprehensive alerting rules. It guides users through defining critical thresholds for latency, error rates, and resource utilization while establishing robust routing and escalation policies. By providing automated runbook generation and testing procedures, it ensures that monitoring setups are not only proactive but also actionable, significantly reducing alert fatigue and manual configuration time.