关于
Empowers developers and SREs to build more robust systems by facilitating the design and implementation of chaos engineering experiments. It provides domain-specific guidance on failure injection strategies, tool selection for environments like Kubernetes and AWS, and the validation of recovery mechanisms such as circuit breakers and retries. By simulating real-world stressors like network latency and resource exhaustion, the skill helps teams proactively identify weaknesses and ensure their systems can gracefully handle unexpected failures before they impact production.