概要
The GPU Architecture Advisor skill empowers developers to maximize performance on NVIDIA hardware by providing deep technical guidance tailored to specific GPU architectures. From leveraging 4th-gen Tensor Cores and the Transformer Engine in Hopper to implementing Shader Execution Reordering in Ada Lovelace, this skill helps identify the optimal compute capabilities and hardware features for any high-performance computing project. It assists with writing performance-portable CUDA code, determining hardware-specific throughput bottlenecks, and implementing efficient memory management strategies across different GPU generations.