소개
This skill provides a structured workflow for implementing data versioning within projects, focusing on reproducibility without the overhead of complex external tools. It guides Claude to define dataset sources, generate checksums, maintain metadata manifests, and separate raw data from derived artifacts. By recording dataset versions alongside experiment outputs, it ensures that every result is linked to the specific data state that produced it, making it ideal for data science projects and ML pipelines where tracking data evolution is critical.