Acerca de
This skill provides a comprehensive framework for Site Reliability Engineering (SRE) practices, enabling teams to quantify service health through Service Level Indicators (SLIs) and Service Level Objectives (SLOs). It guides users through creating Prometheus recording rules, setting up multi-window burn rate alerts, and establishing error budget policies to balance feature delivery with operational stability. By integrating these patterns, developers can move away from reactive monitoring toward a proactive, data-driven approach to reliability and incident management within the FlashFusion ecosystem or any cloud-native environment.