关于
This skill provides a powerful interface for local text-to-speech synthesis, eliminating the need for cloud APIs or internet connectivity. Leveraging the Kokoro-82M model and ONNX runtime, it offers high-performance audio generation that is specifically optimized for Apple Silicon, achieving speeds up to 50x real-time. Whether you are converting short snippets of text or rendering entire books into seamless audio files, this tool handles over 60 voices across eight languages with intelligent chunking to prevent artifacts, making it an essential utility for private and efficient voice synthesis.