01Intelligent routing to 12 specialized Deep RL sub-skills
02Advanced multi-agent (MARL) and model-based RL strategy selection
030 GitHub stars
04Support for both online interaction and Offline RL strategies
05Structured debugging protocols for non-converging training loops
06Diagnostic framework for classifying action spaces and data regimes