01Support for CSV and large-scale dataset management
023 GitHub stars
03Customizable data partitioning ratios for training, validation, and testing
04Randomized shuffling to ensure model objectivity and prevent bias
05Stratified splitting capabilities for handling imbalanced class distributions
06Automated Python code generation and execution using standard ML libraries