소개
This skill provides a comprehensive framework for setting up Prometheus observability, covering everything from initial installation via Helm or Docker to advanced scrape configurations and alert management. It simplifies the implementation of metric collection across Kubernetes clusters and standalone services, offering pre-defined recording rules for performance optimization and critical alert definitions to ensure high availability and proactive system maintenance. Ideal for SREs and developers needing to establish reliable monitoring with industry-standard best practices.