010 GitHub stars
02High-accuracy OCR for 100+ languages including CJK and Arabic
03Automated PII redaction with preset strategies for SSNs and emails
04Digital signing and programmatic PDF form filling
05Multi-format conversion between PDF, Office, HTML, and Images
06Structured data and table extraction to Excel or plain text