Discover Agent Skills for analytics & monitoring. Browse 47skills for Claude, ChatGPT & Codex.
Identifies and resolves software bottlenecks through systematic measurement and empirical optimization workflows.
Diagnoses and resolves application performance issues across CPU, memory, I/O, and database layers to optimize resource utilization.
Monitors PostgreSQL and MySQL health using real-time metrics and predictive alerts to ensure database performance and uptime.
Monitors and optimizes application resource consumption including CPU, memory, and network I/O to improve performance and reduce costs.
Establishes measurable reliability targets using SLIs, SLOs, and error budgets to balance service stability with innovation velocity.
Configures comprehensive Prometheus monitoring environments including metric collection, alerting rules, and service discovery.
Create and manage production-ready Grafana dashboards for real-time visualization of system, infrastructure, and application metrics.
Monitors and optimizes PostgreSQL and MySQL performance through real-time metrics, predictive alerts, and automated remediation.
Implements end-to-end request tracking across microservices using Jaeger and Tempo to identify performance bottlenecks and system dependencies.
Monitors real-time database health, detects long-running transactions, and identifies lock contention issues using proactive alerting.
Builds and manages production-ready Grafana dashboards for real-time observability and metric visualization.
Identifies and resolves memory leaks in code to improve application performance and stability.
Automates the configuration of uptime, transaction, and API monitoring to ensure application performance and availability.
Automates the deployment and configuration of production-ready monitoring stacks including Prometheus, Grafana, and Datadog.
Analyzes and optimizes network request patterns to reduce latency and improve application performance.
Automates the analysis and integration of logging, metrics, and tracing into existing software applications.
Centralizes performance metrics from diverse applications and infrastructure into a unified monitoring and alerting system.
Automates the deployment and configuration of centralized logging solutions like ELK, Loki, and Splunk for production environments.
Analyzes network request patterns and diagnoses latency bottlenecks to optimize application performance and communication efficiency.
Analyzes infrastructure utilization and forecasts growth trends to provide proactive scaling recommendations and cost estimates.
Automates the deployment and configuration of centralized logging infrastructure using ELK, Loki, or Splunk.
Monitors and analyzes application error rates across HTTP endpoints, databases, and background jobs to improve system reliability.
Monitors active development in real-time to detect and prevent architectural drift and scope creep.
Optimizes Python application speed and memory efficiency through advanced profiling, benchmarking, and implementation strategies.
Configures and deploys OpenTelemetry pipelines to manage traces, metrics, and logs in Kubernetes environments.
Analyzes frontend applications to identify performance bottlenecks and provides actionable optimizations for bundle size, rendering, and Core Web Vitals.
Performs comprehensive analysis of code quality, development workflows, and skill effectiveness to generate actionable insights.
Implements comprehensive monitoring, logging, and observability solutions for data infrastructure and production pipelines.
Implements production-grade structured logging, distributed tracing, and metrics collection patterns for robust system monitoring.
Implements a rigorous diagnostic framework to identify and resolve software bugs through structured hypothesis testing and data validation.
Scroll for more results...