소개
This skill provides specialized guidance for fine-tuning vector search infrastructure in RAG and AI applications. It helps developers navigate the complex trade-offs of vector indexing by providing templates for HNSW parameter tuning, implementing quantization strategies like Scalar and Product Quantization (PQ), and estimating memory requirements. Whether you are scaling to billions of vectors or reducing latency for a real-time application, this skill offers the benchmarks and implementation patterns needed to maintain high-performance search capabilities.