01116 GitHub stars
02Standardized alerting rules based on severity levels and dynamic percentile thresholds
03LLM observability with Langfuse for tracing, cost tracking, and evaluation scoring
04Statistical and quality drift detection to identify performance regressions in AI models
05Silent failure detection for LLM agents including tool skipping and token spike alerting
06Infrastructure monitoring using Prometheus metrics and Grafana dashboard templates