01Document Discovery for specified directories
02Document Processing to convert various formats to markdown
03OCR Support for text extraction from scanned PDFs using Tesseract
04Automatic Token Management and content truncation
05Multi-format Support for Word, PDF, PowerPoint, Excel, and more
060 GitHub stars