01Built-in OCR for text extraction from images and scanned documents
02AI-powered image descriptions for rich visual context in documents
03114 GitHub stars
04Supports 15+ formats including PDF, DOCX, PPTX, and XLSX
05Speech-to-text transcription for audio files and YouTube URLs
06Automated generation of scientific schematics and diagrams