About
This skill automates the creation and optimization of FastAPI-based inference services for machine learning models within the ML Deployment domain. It provides comprehensive support for model serving patterns, MLOps pipelines, and production optimization, helping developers transition from local notebooks to robust, scalable, and well-documented API endpoints that follow industry best practices.