概要
This skill provides a comprehensive framework for ensuring Perplexity-powered applications remain resilient and performant under heavy traffic. It automates the generation of k6 load testing scripts, configures Kubernetes Horizontal Pod Autoscalers (HPA) using custom metrics, and implements capacity planning logic to prevent rate-limiting or resource exhaustion. By integrating connection pooling patterns and standardized benchmarking templates, it helps developers maintain high availability and optimize infrastructure costs for production-grade AI services.