01Audio transcription support for converting MP3 and WAV files to text
02Clean Markdown output that preserves table structures and document hierarchy
037 GitHub stars
04Multi-format support for PDF, Word, Excel, and PowerPoint files
05Built-in OCR capabilities for extracting text from images (JPG, PNG)
06Unified Bash-based CLI integration for automated document parsing