概要
The Resilience Patterns skill provides a comprehensive framework for building fault-tolerant distributed systems and reliable LLM-based workflows. It equips developers with standardized implementations for circuit breakers to prevent cascading failures, bulkhead isolation to partition resources, and intelligent retry strategies with exponential backoff. Specifically optimized for AI development, it also includes patterns for LLM API fallback chains and token budget management, ensuring that production services remain stable and responsive even when external dependencies or AI models encounter transient errors or outages.