Acerca de
This skill streamlines the process of observability by identifying critical KPIs across diverse infrastructure layers and configuring industry-standard agents such as Prometheus, Datadog, or AWS CloudWatch. It assists developers and SREs in setting up automated metrics collection, centralizing data aggregation, and generating real-time health dashboards to proactively identify system bottlenecks and optimize performance. Whether you are scaling a web server or troubleshooting database latency, this skill provides the configuration scripts and alerting rules needed for a robust monitoring setup.