概要
The Dataset Splitter skill automates the critical data preparation phase of machine learning by dividing raw datasets into optimized subsets. It intelligently analyzes user-defined proportions, generates robust Python code using standard libraries like scikit-learn, and executes the partitioning while ensuring data integrity. By handling complex requirements such as stratification for imbalanced data and randomization to prevent bias, this skill allows developers to move rapidly from raw data collection to model training with confidence in their evaluation sets.