01Supports high-quality OCR through Azure Document Intelligence integration
02Extracts content from URLs, Wikipedia pages, and YouTube transcripts
03Handles media files via metadata extraction and optional transcription
04Enables complex piping and batch processing workflows via CLI
05Converts PDF, DOCX, PPTX, XLSX, and various other document formats
064 GitHub stars