01Automated YouTube transcript and web content extraction to Markdown
02Built-in OCR for image text extraction and speech-to-text for audio transcription
03Preserves document structure including headings, lists, tables, and hyperlinks
040 GitHub stars
05Supports 20+ formats including Office (DOCX, PPTX, XLSX) and PDFs
06Advanced integration with Azure Document Intelligence and GPT-4o for enhanced layout analysis