소개
This skill provides Claude with expert-level guidance for implementing sophisticated deep learning architectures and training pipelines in PyTorch. It covers everything from basic tensor operations and custom nn.Module construction to advanced performance optimizations like torch.compile, mixed-precision training, and distributed data parallelism (DDP). Whether you are building a simple MLP or a complex transformer-based model, this skill ensures best practices for memory management, GPU acceleration, and production-ready model deployment.