About
This skill provides comprehensive guidance and implementation patterns for optimizing vector indexes in production environments. It helps developers balance the critical trade-offs between search latency, memory consumption, and recall accuracy. By providing ready-to-use templates for HNSW parameter benchmarking, scalar and product quantization, and specific database configurations like Qdrant, it ensures that AI-powered search and RAG systems can scale effectively to handle millions or billions of vectors while maintaining high performance and cost-efficiency.