01Multi-format document ingestion with automatic format detection
02Concurrent document processing and batch database writes for efficiency
03Dual embedding support: OpenAI (recommended) or Ollama (free, local)
04Support for URLs, single files, and entire directories as ingestion sources
05Persistent vector storage using ChromaDB
060 GitHub stars