About
The Dataset Splitter skill streamlines the data preparation phase of machine learning projects by automatically dividing raw datasets into optimized subsets. It analyzes user requirements for specific ratios, generates robust Python code using standard libraries, and executes the partitioning to produce clean training, validation, and test files. This skill ensures data integrity and supports advanced techniques like randomization and stratification, making it an essential tool for developers and data scientists building and evaluating predictive models within the Claude Code environment.