01Multi-format Document Parsing (PDF, Word, Excel, PowerPoint)
02OCR Image Recognition with multi-language support
03Batch Processing for parallel file extraction and search
04Comprehensive PDF Manipulation (merge, split, extract pages, watermarking, encryption support)
052 GitHub stars
06AI-powered Semantic Search using vector embeddings