01Multi-format support for PDF, Word, Excel, PowerPoint, and ZIP archives
02High-fidelity structure preservation for tables, headings, and lists
03Integrated audio transcription and HTML-to-Markdown conversion
0432 GitHub stars
05Optimized output for LLM context injection and document analysis
06Advanced image analysis with OCR and LLM-generated visual descriptions