Monitors and analyzes application error rates across HTTP endpoints, databases, and APIs to ensure system reliability and uptime.
The Error Rate Monitor skill automates the detection and analysis of application failures by tracking HTTP status codes, database exceptions, and background job timeouts. It enables developers to proactively identify reliability issues through threshold-based alerting and error budget tracking, helping teams maintain high service-level objectives (SLOs). Whether setting up monitoring for a new service or diagnosing recurring patterns in legacy logs, this skill provides actionable insights and visualization strategies to minimize downtime and improve overall system health.
主要功能
01Granular log analysis and failure pattern detection
021,909 GitHub stars
03Automated error source identification for HTTP, DB, and API layers
04Error budget tracking and SLO/SLI alignment
05Dashboard and visualization configuration for error metrics
06Threshold-based alerting and notification channel configuration
使用场景
01Setting up real-time error monitoring for a new production web application
02Diagnosing the root cause of frequent background job failures or timeouts
03Establishing error budgets and reliability targets for legacy systems