01Comprehensive DOCX to HTML/Markdown conversion with preserved formatting
02Detailed plain text extraction with word count
03Robust handling of rich formatting elements, lists, and tables
04In-depth document structure and formatting analysis
051 GitHub stars
06Flexible image extraction (as base64 or saved to files)