01Convert diverse document types (PDF, Word, PPT, images) to Markdown format.
02Utilize OCR functionality for text extraction from image-based content.
034 GitHub stars
04Retrieve a list of supported OCR languages via a dedicated tool.
05Configurable API keys and base URLs for MinerU services via environment variables.
06Process files from local paths or URLs, with support for batch operations.