About
This skill provides a comprehensive framework for selecting and implementing embedding models across various use cases, from RAG systems to specialized code search. It guides developers through the entire embedding pipeline, including document preprocessing, sophisticated chunking methods (token-based, semantic, and recursive), and the use of both proprietary APIs like OpenAI and local open-source models via Sentence Transformers. Additionally, it offers implementation templates for reducing embedding dimensions and evaluating retrieval quality using metrics like MRR and nDCG, ensuring high-performance semantic search implementations.