01High-accuracy VLM-based parsing for complex academic and technical layouts
02Multi-column support and automated OCR error cleanup
03Precision extraction of tables, mathematical formulas, and figures
042 GitHub stars
05Intelligent tool selection based on document complexity and API availability
06Batch processing support for managing multiple document queues