01Retrieval quality evaluation metrics (Precision@K, Recall@K)
02Multi-model comparison and selection (OpenAI, Voyage, BGE, E5)
03Matryoshka representation implementation for dimension reduction
04Advanced text chunking (token, sentence, and recursive strategies)
050 GitHub stars
06Domain-specific embedding pipelines for code and markdown