EventOrOutage
CreatedRootly-AI-Labs
Identifies potential external events like holidays or major happenings that may be causing traffic drops, helping SREs distinguish them from actual outages.
About
EventOrOutage leverages Large Language Models (LLMs) to assist Site Reliability Engineers (SREs) in determining whether a decrease in traffic is due to external events, such as holidays, elections, or sporting events, rather than an actual system outage. The tool provides a probability score for each event, indicating its potential impact on traffic, along with the affected geographies and the estimated number of people involved. This standalone prototype demonstrates how this functionality could be integrated into an AI SRE solution or monitoring tool.
Key Features
- Estimates the probability of an event impacting traffic.
- Supports multiple data sources via API integration for holidays and events.
- Leverages LLMs (GPT-4, Claude, Gemini, Deepseek) to identify external events.
- Can analyze traffic logs from a file.
- Identifies affected geographies and the number of people involved.
- 16 GitHub stars
Use Cases
- Diagnosing traffic drops during holidays or major events.
- Integrating with monitoring tools for proactive alerting.
- Analyzing the impact of specific events on traffic patterns.