Extracts text from various document formats, including PDF, Excel, and Word, via a universal server.
The Document Reader is a universal server designed for efficiently extracting text from a wide array of document formats. It supports common types such as PDF, Excel (xlsx, xlsm), Word (docx), CSV, Plain Text, JSON, and Markdown. Engineered with a streaming API, it enables memory-efficient processing of even large files. Key capabilities include smart encoding detection, process-wide rate limiting, and Docker support for isolated deployment, making it ideal for integration with MCP-compatible clients like Cursor for enhanced AI interactions with local files.