概要
Empower Claude to build sophisticated voice-enabled applications using the Google Gemini Live API. This skill provides comprehensive technical guidance on managing low-latency WebSocket connections, handling natural speech interruptions, and configuring bidirectional audio streaming for human-like interactions. It covers complex patterns including tool integration through function calling, session persistence with resumption tokens, and voice customization for specific tones and cadences. Whether you are debugging audio sample rate conversions or implementing secure ephemeral token authentication for production, this skill serves as an expert resource for high-performance voice AI development.