关于
This skill provides enterprise-grade expertise for monitoring Kubernetes environments, enabling developers and SREs to deploy and manage Prometheus, Grafana, Loki, and OpenTelemetry. It simplifies the creation of ServiceMonitors, complex PromQL queries, and SLO-based alerting rules to ensure cluster health and application performance. Whether you are troubleshooting resource bottlenecks or setting up multi-signal observability, this skill offers the implementation patterns and diagnostic workflows needed for production-ready cluster management.