01Machine learning pipeline integration with scikit-learn and XGBoost
02Memory-efficient virtual columns and lazy evaluation strategies
03Fast statistical aggregations and grouping on massive data
04Interactive big data visualization including 2D heatmaps
05Out-of-core DataFrame operations for billion-row datasets
062,066 GitHub stars