This project implements a LangChain-powered Retrieval-Augmented Generation (RAG) pipeline hosted as a FastMCP server, specifically designed for seamless integration with Claude Desktop. It leverages `langchain_chroma` for persistent, domain-based vector stores, `langchain_huggingface` for local or Hugging Face embedding models, and `HuggingFaceCrossEncoder` for advanced reranking, enhancing document relevance. Each Chroma collection represents a distinct knowledge domain, allowing Claude queries to be routed to the appropriate collection to retrieve top-k results, providing relevant context and citations for domain-aware and citation-based responses.