013,983 GitHub stars
02First-class support for GLM-4, Qwen3, DeepSeek V3, and Llama 3 architectures
03Pre-configured workflows for GRPO, PPO, and Reinforce++ algorithms
04High-throughput rollout generation using SGLang with integrated routing
05Flexible data buffer system for custom prompt management and sample storage
06Native Megatron-LM integration supporting TP, PP, DP, and Sequence Parallelism