概要
This skill equips Claude with specialized knowledge for building robust machine learning workflows using Python's industry-standard scikit-learn library. It provides domain-specific patterns for the entire ML lifecycle, including data preprocessing with ColumnTransformers, implementing supervised and unsupervised algorithms, and performing rigorous model evaluation through cross-validation. By emphasizing the use of production-grade Pipelines, the skill helps developers prevent data leakage and ensure that preprocessing steps are consistently applied during both training and inference. It is an essential companion for tasks ranging from exploratory data analysis and dimensionality reduction to complex hyperparameter tuning and model selection.