关于
This skill provides production-ready orchestration patterns for developers building language learning applications with the Speak API. It addresses common latency issues through automated audio preprocessing—including normalization and silence trimming—multi-level caching using LRU and Redis, and efficient request management via DataLoader for batching. It is an essential tool for engineering teams looking to reduce P95 latency and improve the responsiveness of real-time speech recognition and pronunciation scoring features.