Extracts text from PDF documents using the robust `pdftotext` utility, designed for reliable integration with Model Context Protocol servers.
PDFtotext provides a reliable and lightweight Model Context Protocol (MCP) server for extracting text from PDF documents. Unlike other PDF MCP servers, it focuses on clean JSON-RPC communication without stdout pollution, ensuring consistent and predictable results. Built upon the mature `pdftotext` utility from poppler-utils, it offers features such as extracting text from entire documents or specific pages, preserving original layout formatting, supporting multiple text encodings, and providing comprehensive metadata. It includes robust file validation, security checks, and detailed error reporting, making it production-tested and feature-complete for various MCP client environments.