01Built-in evaluation framework for measuring retrieval precision and recall
02Configurable overlap and token-based sizing for optimal embedding model alignment
03Multi-level chunking strategies from fixed-size to advanced semantic boundary detection
04Structure-aware splitting for codebases, Markdown, and hierarchical documents
05126 GitHub stars
06Advanced methods including Late Chunking and Contextual Retrieval