概要
This skill empowers developers to architect and deploy comprehensive observability stacks using industry-standard tools like Prometheus, Grafana, and OpenTelemetry. It provides structured guidance for implementing Service Level Objectives (SLOs), configuring distributed tracing across microservices, and setting up proactive alerting to prevent incidents before they impact users. Whether you are bootstrapping a new production environment or optimizing existing monitoring infrastructure, this skill ensures best practices are followed for metrics collection, dashboard visualization, and performance tracking.