PDF Reader
Extracts and processes content, metadata, and specific pages from PDF documents following the Model Context Protocol (MCP) specification.
Acerca de
This Model Context Protocol (MCP) server offers a comprehensive solution for securely and efficiently interacting with PDF documents. It provides specialized tools for extracting plain text, retrieving document metadata, processing content from specific pages or ranges, and validating PDF file integrity. Designed with a focus on enterprise-grade security, it features directory traversal protection, configurable file size and memory limits, and processing timeouts. Built with TypeScript, it ensures type-safety, robust error handling, and structured logging, making it a production-ready choice for integrating advanced PDF capabilities into AI applications and various development workflows.
Características Principales
- Extract plain text from PDFs with formatting preservation
- 0 GitHub stars
- Retrieve comprehensive document metadata (title, author, dates, page count)
- Enhanced security features including input validation, resource limits, and no temporary file persistence
- Validate PDF file integrity and readability
- Process content from specific pages or page ranges with flexible syntax
Casos de Uso
- Automate document processing workflows for text and metadata retrieval
- Integrate PDF content extraction and processing into AI applications (e.g., Claude, ChatGPT)
- Validate incoming PDF files for integrity before further handling