Acerca de
This skill provides a comprehensive framework for Site Reliability Engineering (SRE), helping teams establish measurable reliability targets through standardized SLIs and SLOs. It facilitates the creation of Prometheus recording and alerting rules, precise calculation of error budgets, and the design of observability dashboards, allowing organizations to balance rapid feature innovation with consistent service stability and performance.