About
This skill empowers Claude to perform rigorous assessments of machine learning models by leveraging the model-evaluation-suite plugin. It automates the generation of critical performance insights, allowing users to analyze model accuracy, recall, and precision through standardized commands like /eval-model. Whether you are validating a model before deployment, comparing multiple architectures, or identifying specific areas for optimization, this skill provides the structured data needed to make informed decisions in the machine learning lifecycle.