01Adaptive GMM-guided pivot filtering based on actual score distributions
02Validated workflows for datasets with 1.7M+ protein cells
03Sqrt-scaled matching ratio calculation to prevent noisy over-sampling
04Automated diagnostic checks for dataset modal imbalance
050 GitHub stars
06Conservative propagation filtering logic to minimize cumulative data loss