01Semantic similarity matching for flexible cache hits
02Full LLM response caching for identical or similar queries
03Cache Augmented Generation (CAG) patterns for document pre-caching
04Advanced cache invalidation and TTL management strategies
051 GitHub stars
06Native Anthropic prompt prefix caching optimization