01Layout-aware hierarchical chunking to preserve semantic structure
020 GitHub stars
03Abstract generation for progressive reading workflows
04Automated token counting and section mapping for easy navigation
05Dual-format output including machine-readable JSON and human-readable Markdown
06Extraction of tables, code blocks, and domain-specific benchmarks