The Document Processor enables AI assistants to access and search your private documents, codebases, and latest tech information, overcoming the limitations of LLM knowledge cutoffs. By processing Markdown, text, PDFs, and other document types into a searchable database, this tool extends AI knowledge beyond its initial training data, ensuring that AI assistants have access to the most up-to-date and relevant information. Built with Docker and supporting both free and paid embeddings, it seamlessly integrates into your workflow, keeping your AI assistants informed with your specific data.
主な機能
01Dockerized for easy setup and deployment.
025 GitHub stars
03Processes Markdown, text, and PDF files into a searchable database.
04Supports free local embeddings and paid OpenAI embeddings.
05Extends AI knowledge beyond initial training data.
06Configurable chunking and embedding parameters.