01Configuration of mixed precision and CPU offloading strategies
02Handling uneven training inputs with the Join context manager
03Implementation of parameter and optimizer state sharding
04Advanced debugging for NCCL and Gloo distributed backends
05Guidance on transitioning models to FSDP and FSDP2 patterns
065 GitHub stars