关于
This skill provides a comprehensive framework for setting up and managing Prometheus monitoring systems. It guides users through the entire observability lifecycle, including installation via Helm or Docker, complex scrape configurations using Kubernetes service discovery, the creation of efficient recording rules for pre-computing metrics, and the definition of critical alerting rules. It is essential for engineering teams looking to implement robust infrastructure and application monitoring using industry-standard best practices for performance and reliability.