About
This tool functions as an MCP server designed to extract content from PDF files. It offers the 'extract-pdf-contents' tool, which requires a PDF file path as input and allows for optional page selection using comma-separated values, supporting negative indexing for page numbers. The server incorporates both PDF file reading and OCR capabilities to ensure comprehensive content extraction.
Key Features
- Extracts text from PDF files
- Supports specifying page ranges for extraction
- Uses OCR to extract text from scanned PDFs
- Accepts local file paths as input
Use Cases
- Automating data extraction from PDF documents
- Processing large volumes of PDF reports for analysis
- Integrating PDF content extraction into automated workflows