概要
This skill provides a comprehensive framework for breaking large documents into semantically meaningful segments for vector databases and RAG pipelines. It guides developers through five levels of strategy complexity—ranging from basic fixed-size splitting to advanced embedding-based semantic boundary detection. By optimizing how data is ingested into vector stores, this skill ensures that Claude and other LLMs receive the most relevant, coherent context, directly improving the precision of AI-generated responses and reducing hallucination in document-heavy applications.