Extracts and stores documentation from Microsoft Learn and GitHub URLs into PocketBase, enabling powerful search and retrieval for AI agents.
The Document Extractor is an MCP (Model Context Protocol) server designed to centralize and make accessible external documentation. It efficiently extracts content from popular platforms like Microsoft Learn and GitHub, parsing articles and files to capture rich metadata. This extracted information is then stored in a PocketBase instance, providing robust full-text search capabilities and persistent storage. By leveraging the latest MCP SDK features, it offers dynamic tool management, session handling, and multiple transport options, making it an ideal backend for AI assistants requiring up-to-date and searchable external knowledge.