01Unified API for multiple parsing drivers including LlamaParse, PyMuPDF, and Unstructured.
02Advanced PDF manipulation including merging, splitting, and file optimization.
03Granular text extraction at page, block, line, span, or character levels.
04Batch processing with support for multi-worker streaming and per-file configuration.
058 GitHub stars
06Embedded attachment management for adding, extracting, or removing PDF files.