关于
This skill empowers Claude to act as a senior Site Reliability Engineer by providing a systematic approach to cluster maintenance and incident response. It leverages a Popeye-inspired scoring framework to identify critical 'BOOM' issues, such as resource exhaustion and security vulnerabilities, across standard Kubernetes and OpenShift environments. Whether you are debugging complex pod failures like CrashLoopBackOff, validating RBAC permissions, or optimizing resource limits, this skill provides the deep-dive diagnostic patterns needed to maintain high-availability infrastructure.