소개
The Gemini Audio skill empowers Claude with advanced audio capabilities, enabling it to transcribe, summarize, and analyze audio files up to 9.5 hours in length. It supports a wide range of formats and can distinguish between speech, music, and ambient sounds, while also providing high-quality text-to-speech generation with controllable voice styles. This tool is essential for developers building applications that require deep audio understanding, meeting transcription, or natural-sounding voice responses via Google AI Studio or Vertex AI.