概要
The Awels PDF Processor is a Model Context Protocol (MCP) server specializing in robust PDF processing. It leverages docling to convert PDF documents into clean Markdown format, with the added capability of extracting images such as page images, tables, and figures. Engineered to run in isolated environments, this server proactively avoids common permission issues, ensuring reliable operation. It provides structured JSON output detailing processing results, file metadata, and statistics, making it ideal for integrating advanced document conversion and data extraction into various systems.
主な機能
- Structured JSON output with detailed processing results
- Batch Processing of multiple PDF files
- 0 GitHub stars
- Comprehensive Image Extraction from PDFs
- Isolated execution for enhanced security and permission handling
- PDF to Markdown Conversion using docling
ユースケース
- Process PDFs with precise control over recursion and output paths for specific document management workflows.
- Convert entire directories of PDF documents into Markdown for content migration or analysis.
- Extract and save all images from PDF files alongside their Markdown conversions.