关于
This skill streamlines the workflow for developers interacting with SRP's Slurm clusters, specifically optimized for H100 and H200 GPU workloads. It provides comprehensive guidance for submitting jobs via sbatch or the ssubmit wrapper, managing Apptainer container environments, and configuring multi-node distributed training. By integrating cluster monitoring, unified JuiceFS data access patterns, and automated Feishu notifications, this skill helps users troubleshoot failures, monitor resource utilization, and maintain efficient high-performance computing operations directly within Claude Code.