01Advanced chunking (Recursive, Semantic, and Token-based splitting)
02Multi-model comparison and selection (OpenAI, Voyage, BGE, E5)
03Local embedding implementation with Sentence Transformers
04Matryoshka dimension reduction for cost-effective storage
051 GitHub stars
06Retrieval quality evaluation metrics (Precision/Recall at K)