关于
This skill provides expert guidance for fine-tuning vector search indexes to achieve the ideal balance between search latency, memory consumption, and recall accuracy. It offers specialized implementation patterns for HNSW parameter optimization and data compression techniques like product quantization and binary quantization, making it essential for developers scaling RAG pipelines or large-scale similarity search systems to millions or billions of vectors. Whether you are using Qdrant, Milvus, or HNSWlib, this skill helps automate benchmarking and configuration for production-grade performance.