01Advanced text chunking including recursive character and semantic splitting
020 GitHub stars
03Domain-specific pipelines for codebases and specialized technical documents
04Multi-model support for OpenAI, Voyage, and open-source BGE/E5 models
05Retrieval quality evaluation metrics like Precision@K and Recall@K
06Matryoshka representation learning for optimized dimension reduction