This skill provides production-ready guidance and implementation patterns for monitoring Temporal workflows and cluster health. It streamlines the setup of metrics collection, visualization dashboards, and critical alerting rules across frontend, history, matching, and worker services. Ideal for platform engineers and developers needing to ensure high availability and performance of their Temporal infrastructure, the skill includes specific PromQL queries for service health, persistence latency, and task queue backlogs, as well as worker-side SDK metrics configuration.
主要功能
010 GitHub stars
02Official Grafana dashboard integration (IDs 10270 and 10271)
03Essential PromQL queries for service health and workflow performance
04Worker-side SDK metrics implementation for Go environments
05Pre-configured Prometheus scrape configurations for Temporal K8s services
06Critical and warning alerting rules for persistence and service latency