소개
The Kubernetes Troubleshooting skill empowers Claude to act as a senior SRE, providing systematic diagnosis and resolution patterns for containerized environments. It features comprehensive decision trees for common failure modes like CrashLoopBackOff and ImagePullBackOff, deep-dive network connectivity analysis, and node-level health checks. Whether dealing with pending pods, DNS resolution failures, or resource-related performance bottlenecks, this skill provides the specific kubectl commands and investigative logic required to reduce Mean Time to Recovery (MTTR) and ensure high availability in production clusters.