This tool serves as a lightweight, single-file MCP server designed to streamline the management of jobs on SLURM-based High-Performance Computing (HPC) clusters. It offers advanced features like TTY-aware job monitoring with desktop notifications, robust command validation to prevent common SSH escape traps, and intelligent resource usage checks to warn against over-allocation. With multi-cluster support, SSH auto-reconnect, and a comprehensive suite of command-line utilities, it empowers users to efficiently submit, track, and manage their computational tasks directly from their development environment, enhancing productivity and preventing common errors.
Key Features
01TTY-Aware Job Watching with Desktop Notifications
02Command Guard for preventing SSH escape traps and command corruption
03Resource Check to warn on over-allocation based on historical usage
04Multi-Cluster Support and SSH Auto-Reconnect
05Job Templates and Direct Script Submission for SLURM jobs
060 GitHub stars
Use Cases
01Monitoring SLURM jobs across multiple terminal windows with real-time alerts.
02Securely submitting and managing batch and interactive jobs on HPC clusters.
03Automating HPC workflows, resource allocation checks, and usage reporting through AI clients.