0144 GitHub stars
02Structure-aware splitting for code, Markdown, tables, and multi-modal PDFs
03Performance evaluation framework using retrieval precision and recall metrics
04Five-tier strategy implementation from simple fixed-size to advanced semantic chunking
05Recursive character splitting with hierarchical separators for structural preservation
06Support for advanced methods like Late Chunking and Contextual Retrieval