01Memory Quantization: Includes Int8, Int4, and Binary options to reduce memory footprint by up to 32x.
02Agentic Integration: Optimized for agentic-flow with ONNX support for 75x faster execution.
03HNSW Indexing: Accelerates semantic search by 150x to 12,500x compared to linear scans.
04Persistent Storage: Utilizes sql.js (WASM) for cross-platform SQLite-based embedding caching.
050 GitHub stars
06Hyperbolic Support: Implements Poincare ball models for representing hierarchical data relationships.