Discover Agent Skills for analytics & monitoring. Browse 47 skills for Claude, ChatGPT & Codex.
Diagnoses and resolves stalled or failing Inspect AI evaluations running on Hawk infrastructure.
Diagnoses and resolves stalled or failing AI model evaluations running on METR's Inspect framework.
Troubleshoots and resolves stalled or hanging AI evaluations within the METR Hawk and Inspect AI ecosystem.
Diagnoses and resolves hanging, stalled, or failing AI evaluations within the Hawk and Inspect AI framework.
Diagnoses and resolves stalled or failing Inspect AI evaluations by analyzing logs, pod states, and API connectivity within the Hawk cloud environment.
Diagnoses and resolves hanging or failing UK AISI Inspect AI evaluations running in the Hawk cloud environment.
Diagnoses and resolves stalled, hanging, or failing UK AISI Inspect AI evaluations in cloud environments.
Diagnoses and resolves hanging or failing Inspect AI evaluations within Hawk cloud environments.
Diagnoses and resolves hanging or failed evaluations in METR Hawk and UK AISI Inspect AI environments.
Diagnoses and resolves hanging or failing LLM evaluations within the Hawk and Inspect AI ecosystem.
Implements production-grade JSON logging patterns and distributed tracing to enhance system observability and incident debugging.
Troubleshoots and resolves hung or failing UK AISI Inspect AI evaluations running in the Hawk cloud environment.
Monitors and diagnoses the operational status of all Gmail Commander subsystems and automation processes.
Monitors the health and operational status of asciinema recording daemons and terminal session files.
Diagnoses and resolves hanging or failing AI model evaluations using Hawk and the UK AISI Inspect framework.
Monitors and calculates real-time API token usage and USD costs for specific Claude Code session operations.
Diagnoses and resolves stalled, hanging, or failing AI evaluations within the Hawk and Inspect AI frameworks.
Diagnoses and resolves stalled, hanging, or failing AI evaluations within the Hawk and Inspect AI framework.
Troubleshoots and resolves hanging or failing AI model evaluations running on the Hawk platform.
Diagnoses and resolves stalled, hanging, or failing AI evaluations within the UK AISI Inspect framework.
Diagnoses and resolves hanging or failing AI model evaluations running on the Hawk and Inspect frameworks.
Diagnoses and resolves stalled or failing Inspect AI evaluations running on the METR Hawk platform.
Performs semantic analysis and keyword extraction on terminal recordings to identify patterns and usage metrics.
Monitors and inspects real-time logs from the asciinema chunker daemon to troubleshoot background recording processes.
Monitors and diagnoses the operational status of the Cal.com Commander automation suite and its underlying subsystems.
Troubleshoots and resolves stuck or failing UK AISI Inspect AI evaluations on the Hawk platform.
Diagnoses and resolves stalled or failing AI evaluations running on the Hawk and Inspect AI cloud platform.
Monitors and analyzes dbt execution metadata to track model performance, test reliability, and data quality metrics over time.
Diagnoses and validates the health of the Specweave reflection system to ensure continuous AI learning and system stability.
Optimizes full-stack application performance through profiling, database tuning, and frontend efficiency strategies.
Scroll for more results...