概要
The add-golden skill streamlines the creation of high-quality evaluation datasets for LLMs and RAG systems by automating the curation process. Using a sophisticated multi-agent architecture, it fetches web content, classifies document types, and performs parallel analysis to score quality across dimensions like accuracy and relevance. It ensures dataset integrity through automated duplicate detection, schema validation, and difficulty classification, providing developers with a robust, production-ready pipeline for managing ground-truth data within Claude Code.