概要
The Kubernetes SRE skill empowers Claude to act as a Site Reliability Engineer, providing a systematic approach to investigating pod failures, service degradations, and deployment issues. It enforces rigorous root cause analysis using the 5 Whys principle and ensures safe, read-only data collection across multiple environments including dev, integration, and live clusters. From debugging CrashLoopBackOff states and OOMKills to reconciling Flux GitOps resources, this skill provides the context-aware commands and documentation-lookup strategies needed to move beyond symptoms and fix the underlying infrastructure problems.