Acerca de
This skill provides a comprehensive framework for transforming random fault injection into disciplined reliability engineering. It guides developers and SREs through a standard methodology of hypothesis formation, defining measurable Service Level Indicators (SLIs), and implementing strict blast radius controls to ensure experiments are safe and informative. By focusing on validation patterns and success criteria, it helps teams proactively identify system weaknesses and build more resilient distributed architectures without risking uncontrolled production outages.