Analyzes Mojo source code to identify and implement SIMD optimization opportunities for high-performance tensor and array operations.
The analyze-simd-usage skill is designed for Mojo developers looking to squeeze every bit of performance out of their machine learning models and data-intensive applications. By scanning codebase patterns, it identifies loops and element-wise operations that are candidates for Single Instruction Multiple Data (SIMD) vectorization. It provides detailed guidance on utilizing decorators like @vectorize and @unroll, selecting hardware-appropriate SIMD widths, and implementing manual load/store operations to achieve 4x to 16x speed improvements in critical code paths.
Key Features
01Identification of performance-critical code hotspots
02Automated loop vectorizability assessment
03Pattern matching for element-wise tensor operations
0412 GitHub stars
05Hardware-specific SIMD width recommendations
06Implementation guidance for @vectorize and @unroll
Use Cases
01Identifying non-vectorizable dependencies that hinder performance
02Converting scalar array operations to vectorized SIMD implementations
03Optimizing machine learning training loops for faster execution