What embedding models are supported by this skill?

The skill includes templates and comparisons for OpenAI (text-embedding-3), Voyage-2, BGE-large, E5, and lightweight models like MiniLM.

Is it suitable for code-specific search applications?

Absolutely, the skill includes specialized pipelines for code embedding and semantic markdown sectioning designed for technical repositories.

Can this help with document chunking for better RAG accuracy?

Yes, it provides several production-ready strategies including token-based, sentence-based, and recursive character splitting to optimize context window usage.

Does it support dimension reduction for faster vector search?

Yes, it includes implementation details for Matryoshka embeddings (like OpenAI's dimensions parameter) to reduce vector sizes while maintaining high retrieval performance.

Embedding Strategies & RAG Optimization

Name: Embedding Strategies & RAG Optimization
Author: as4584

byas4584

0•

데이터 과학 및 ML

Implements and optimizes text embedding models and chunking strategies for high-performance semantic search and RAG applications.

This skill provides a comprehensive framework for managing the embedding lifecycle in LLM applications. It guides developers through selecting the right model—ranging from high-accuracy proprietary APIs like OpenAI and Voyage to lightweight, open-source local options like BGE. It includes production-ready templates for advanced chunking techniques such as recursive character splitting and semantic sectioning, ensuring context is preserved for better retrieval. Whether you are building a document search engine or fine-tuning for specialized domains like legal or code, this skill provides the implementation patterns needed for accurate, cost-effective, and scalable vector search.

주요 기능

01Retrieval quality evaluation metrics (Precision@K, Recall@K)

02Multi-model comparison and selection (OpenAI, Voyage, BGE, E5)

03Matryoshka representation implementation for dimension reduction

04Advanced text chunking (token, sentence, and recursive strategies)

050 GitHub stars

06Domain-specific embedding pipelines for code and markdown

사용 사례

01Optimizing vector search latency and cost by reducing embedding dimensions

02Building a Retrieval-Augmented Generation (RAG) system for technical documentation

03Implementing multilingual semantic search for global enterprise applications

What are Skills?·How to Install

Install with 🐟 Skill.Fish

npx skillfish add as4584/antigravity-skills embedding-strategies

For use in Claude.ai and ChatGPT

Download Skill