01Hybrid search capabilities combining vector similarity with metadata filtering
02Multiple quantization modes (Binary, Scalar, Product) for up to 32x memory reduction
030 GitHub stars
04High-speed HNSW indexing with sub-100µs search latency
05Built-in MCP server for direct tool access within Claude Code
06Support for various embedding models including OpenAI, sentence-transformers, and MiniLM