The RAG Architect skill empowers developers to build sophisticated, scalable, and accurate AI-powered retrieval systems within Claude Code. It provides deep technical guidance on document processing, embedding model selection, vector database integration (including Pinecone, Weaviate, and pgvector), and hybrid retrieval methods. Whether you are optimizing for latency, cost, or precision, this skill covers advanced implementation patterns like HyDE query transformation, reciprocal rank fusion, and automated evaluation frameworks to ensure your RAG application is enterprise-ready.
Key Features
015,225 GitHub stars
02Hybrid retrieval implementation combining dense semantic and sparse keyword search
03Comprehensive evaluation frameworks for faithfulness and context relevance
04Query transformation techniques like HyDE and Multi-query generation
05Vector database optimization for Pinecone, Weaviate, Qdrant, and pgvector
06Advanced document chunking strategies including semantic and recursive splitting