011 GitHub stars
02Comprehensive format conversion (PDF, Office, HTML, Images)
03Structured data and table extraction to Excel or Text
04High-accuracy OCR supporting 100+ languages
05Programmatic PDF form filling and digital signatures
06Automated PII redaction with regex and preset strategies