01One-shot pruning with Wanda and SparseGPT for rapid compression
02Support for N:M semi-structured sparsity (2:4, 4:8) for hardware acceleration
03Customizable layer-wise and iterative pruning strategies
04Accuracy-aware pruning achieving <1% loss at 50% sparsity
05384 GitHub stars
06Implementation of both structured and unstructured pruning methods