Discover Agent Skills for analytics & monitoring. Browse 51skills for Claude, ChatGPT & Codex.
Implements production-grade structured logging, OpenTelemetry tracing, and LLM-specific metrics tracking for AI applications.
Captures comprehensive execution metrics and performance data for Betty Framework components to provide deep system observability.
Monitors usage, tracks API costs, and measures the ROI of Claude Code using OpenTelemetry metrics and observability tools.
Orchestrates a specialized multi-agent workflow to perform systematic root-cause analysis and debugging for software issues.
Enhances Claude Code sessions by providing actionable patterns for real-time status line monitoring, cost tracking, and workflow awareness.
Analyzes frontend applications to identify performance bottlenecks and provides actionable optimizations for bundle size, rendering, and Core Web Vitals.
Defines and implements service reliability targets using SLIs, SLOs, and error budgets to balance innovation with system stability.
Implements end-to-end request tracking across microservices using Jaeger, Tempo, and OpenTelemetry to identify performance bottlenecks and resolve latency issues.
Configures Prometheus for comprehensive metric collection, alerting, and observability across infrastructure and applications.
Master systematic debugging techniques and profiling tools to efficiently resolve bugs and performance issues across any technology stack.
Builds and manages production-ready Grafana dashboards for real-time observability and metric visualization.
Profiles and optimizes Python codebases to eliminate execution bottlenecks and reduce resource consumption.
Implements end-to-end request tracking across microservices using Jaeger and Tempo to identify performance bottlenecks and system dependencies.
Create and manage production-ready Grafana dashboards for real-time visualization of system, infrastructure, and application metrics.
Defines and implements Service Level Indicators (SLIs) and Service Level Objectives (SLOs) with error budgets and automated alerting.
Configures comprehensive Prometheus monitoring environments including metric collection, alerting rules, and service discovery.
Builds and manages production-grade Grafana dashboards for real-time observability and comprehensive system monitoring.
Implements end-to-end request tracking using Jaeger and Tempo to monitor performance and debug latency across distributed systems.
Establishes measurable reliability targets using SLIs, SLOs, and error budgets to balance service stability with innovation velocity.
Profiles, analyzes, and optimizes Python code to eliminate bottlenecks and improve application efficiency.
Configures Prometheus for robust infrastructure and application metric collection, storage, and alerting.
Implements robust error management systems using structured logging, custom exceptions, and graceful degradation strategies to ensure application resilience.
Generates multi-dimensional stock analysis reports by synthesizing technical indicators, fundamental financials, and real-time sentiment data.
Configures comprehensive error tracking and performance monitoring for Next.js applications using Sentry and OpenTelemetry.
Implements comprehensive distributed tracing using Jaeger and Tempo to monitor request flows and optimize microservice performance.
Configures Prometheus for end-to-end infrastructure monitoring, metric scraping, and automated alerting systems.
Implements comprehensive observability, health checks, and alerting systems for Redis database environments.
Configures and manages Flower for real-time Celery task monitoring, secure authentication, and Prometheus metrics integration.
Creates and manages production-ready Grafana dashboards for real-time visualization of system, infrastructure, and application metrics.
Profiles and optimizes Python code to eliminate bottlenecks, reduce memory usage, and improve execution speed.
Scroll for more results...