Analytics & Monitoring Agent Skills

Discover Agent Skills for analytics & monitoring. Browse 47skills for Claude, ChatGPT & Codex.

Observability Alert Manager

Configures and manages Grafana alerts for Claude Code to monitor session anomalies, error rates, and resource utilization.

Claude Code Telemetry Enabler

Configures and enables OpenTelemetry logging, metrics, and tracing for Claude Code to monitor session performance and costs.

Observability Railway Deploy

Deploys a comprehensive LGTM observability stack to Railway cloud for centralized monitoring and team access.

D3 Visualization Specialist

Builds bespoke, interactive data visualizations and complex codebase maps using the D3.js library for high-level data storytelling.

Claude Cost Optimization

Monitors and reduces Anthropic API expenses through advanced token tracking and implementation of optimization patterns like prompt caching and effort selection.

Multi-AI Debugging Council

Orchestrates a multi-agent debugging workflow using Claude, Gemini, and Codex to perform advanced root cause analysis and automated fix generation.

xAI Crypto Sentiment Analysis

Analyzes real-time cryptocurrency market sentiment and whale activity using Grok's native X integration.

Multi-AI Debugging

Orchestrates multiple AI agents to perform systematic root cause analysis, semantic log classification, and automated fix generation for complex system failures.

Observability Analyzer

Analyzes Claude Code telemetry data to provide deep insights into performance, costs, and tool usage patterns.

Observability Dashboard Creator

Automates the setup and management of comprehensive Grafana dashboards for monitoring Claude Code performance, costs, and errors.

Maestro MOT Health Check

Performs comprehensive system audits and diagnostic health checks for Maestro skills, agents, hooks, and memory systems.

Performance Analysis & Optimization

Analyzes Claude Flow swarms to detect performance bottlenecks, profile operations, and provide actionable AI-powered optimization recommendations.

Autonomous Coding Insight Extractor

Extracts actionable patterns and learnings from autonomous coding sessions to optimize future AI performance.

Claude Code Observability Dashboards

Deploys and manages comprehensive Grafana dashboards for monitoring Claude Code performance, costs, and session health.

Railway Log Management

Accesses and analyzes Railway build, deployment, and runtime logs for debugging and monitoring applications.

Autonomous Coding Insight Extractor

Extracts actionable insights and performance patterns from autonomous coding sessions to optimize future AI interactions.

Observability Analyzer

Analyzes Claude Code telemetry to generate actionable insights into performance, costs, and tool usage patterns.

Autonomous Cost Optimizer

Monitors token usage and optimizes API expenditure for autonomous coding agents.

Global Error Handling

Implements robust error-handling patterns across API routes, client-side components, and data fetching logic to ensure application stability and graceful failure.

OpenEvent Detection Triage

Debugs and resolves intent classification, routing errors, and detection misfires within the OpenEvent-AI workflow.

OpenEvent Trace & Fallback Triage

Debugs and eliminates generic fallback responses by pinpointing failure triggers and automating reproduction steps.

Decision Log Auditor

Validates Decision API JSONL logs against schema requirements and platform invariants to ensure data integrity.

Postmortem Writing & Incident Analysis

Guides teams through creating blameless post-incident reviews, identifying root causes, and implementing actionable follow-up items.

Python Performance Profiling

Identifies performance bottlenecks in Python code through systematic profiling and applies targeted, measurable optimizations.

On-Call Handoff Patterns

Standardizes on-call shift transitions using structured context transfer, incident documentation, and escalation procedures to ensure service reliability.

Incident Response Runbooks

Streamlines production incident management by providing structured runbook templates and standardized response procedures.

Service Mesh Observability

Implements comprehensive monitoring, distributed tracing, and visualization for Istio and Linkerd service mesh environments.

Distributed Tracing & Observability

Implements distributed tracing using Jaeger and Tempo to monitor request flows and optimize performance in microservices architectures.

Prometheus Configuration

Configures and optimizes Prometheus for robust metric collection, alerting, and observability across infrastructure and applications.

Observe Before Editing

Enforces an observation-first debugging workflow by verifying system outputs and logs before making code changes.

30 results loaded • More available

Scroll for more results...