01AI-agent coordination for running complex, multi-service resilience experiments
02Gradual blast radius control from development to production environments
03Steady-state metric definition and real-time monitoring
04Automated failure injection for network, infrastructure, and application layers
05Automatic rollback triggers based on configurable error rate thresholds
060 GitHub stars