Automates and guides incident response for Clay integrations through standardized triage, mitigation, and postmortem procedures.
The Clay Incident Runbook skill provides a comprehensive framework for managing outages and errors within Clay-based workflows. It streamlines the incident response lifecycle by helping developers categorize severity levels, perform rapid triages using kubectl and curl, and execute immediate remediation actions for common API errors like 401s, 429s, and 503s. By integrating status page checks with internal metrics, this skill ensures that teams can differentiate between local configuration issues and global provider outages while maintaining consistent communication through built-in Slack and PagerDuty templates.
主な機能
01Pre-formatted communication templates for internal and external stakeholders
02Standardized P1-P4 severity level classification for Clay incidents
03Step-by-step remediation for authentication, rate-limiting, and server errors
04Automated triage workflows using status checks and system metrics
05Postmortem evidence collection and documentation generation
060 GitHub stars
ユースケース
01Managing team-wide communication and status updates during critical production downtime
02Automating secret rotation and pod restarts during 401/403 authentication failures
03Identifying if a service failure is a local credential issue or a global Clay API outage