010 GitHub stars
02Automated PII redaction using presets or custom regex patterns
03Structured data extraction for text and tables (export to Excel)
04Universal document conversion (PDF, Office, HTML, and images)
05Comprehensive PDF manipulation including digital signatures and form filling
06High-accuracy OCR for scanned documents in over 100 languages