010 GitHub stars
02Convert PDFs, images, and Office files into clean, metadata-rich Markdown.
03Enable agentic parsing for high-accuracy processing of complex tables and figures.
04Modify documents using natural language instructions for form-filling and editing.
05Extract structured JSON data using custom schemas for invoices, contracts, and forms.
06Track document changes, highlights, and comments during the conversion process.