Diagnoses and manages Kubernetes clusters by inspecting resources, logs, and events to resolve workload failures and optimize operations.
This skill empowers Claude to navigate and debug Kubernetes environments with precision by leveraging either MCP servers or kubectl commands. It provides a structured, stepwise approach to resolving common container issues—such as CrashLoopBackOff, ImagePullBackOff, or pending pods—by systematically analyzing resource conditions, events, and logs. Designed with safety and multi-cluster awareness in mind, it ensures that operations are scoped correctly to namespaces and contexts while preferring non-destructive read operations for faster, safer troubleshooting.
主要功能
01Systematic debugging for common pod failure states
02Intelligent tool selection between MCP and kubectl
030 GitHub stars
04Resource inspection for deployments, nodes, and pods
05Real-time log retrieval and event analysis
06Multi-cluster context and namespace scoping
使用场景
01Performing health checks and resource audits across multiple namespaces
02Monitoring deployment rollouts and analyzing pod logs for errors
03Investigating why a specific service is stuck in a CrashLoopBackOff state