01Granular controls for DPI, image quality, and specific page ranges
028 GitHub stars
03Automatic extraction and processing of documents from ZIP archives
04Standardized WebP output optimized for Vision Language Models (VLM)
05Local document caching with searchable metadata and content hashing
06Multi-format support for PDF, DOCX, PPTX, XLSX, and images