01Speech-to-text transcription for audio files and YouTube URLs
02Built-in OCR for extracting text from scanned documents and images
03AI-enhanced image descriptions for technical and scientific figures
043,718 GitHub stars
05Supports 15+ formats including PDF, DOCX, XLSX, PPTX, and EPUB
06Token-efficient Markdown output preservation of tables and structure