소개
This skill automates the evaluation of Nixtla forecasting experiments by transforming raw metrics into comprehensive, production-ready benchmark reports. It calculates key summary statistics, identifies best-performing models, detects performance regressions against historical baselines, and generates actionable recommendations. Designed for data scientists and ML engineers, it streamlines the evaluation process from hours to minutes, ensuring consistent reporting and rigorous quality control across time-series forecasting workflows.