01883 GitHub stars
02Automated partitioning into training, validation, and testing sets
03Customizable split ratios (e.g., 70/15/15 or 80/20 distributions)
04Maintains data integrity across CSV and other structured formats
05Generation of executable Python code using standard ML libraries
06Randomization logic to prevent selection bias in subsets