This tool serves as an enhanced Model Context Protocol (MCP) server, bridging AI clients like Claude Desktop, Cursor, and OpenAI Agents with ElevenLabs' powerful audio APIs. It extends the original ElevenLabs MCP server by introducing advanced conversational AI functionalities, including comprehensive conversation history, real-time monitoring, and detailed transcript retrieval. Furthermore, it offers robust support for the latest ElevenLabs v3 model with features like enhanced expressiveness, audio tags, and multi-speaker dialogue, alongside AI-friendly improvements such as smart voice defaults and dynamic timeouts, making it a comprehensive solution for developing sophisticated AI voice interactions.
주요 기능
01Automatic splitting of long dialogues and intelligent tag simplification for better quality.
02Enhanced ElevenLabs v3 model support with audio tags and multi-speaker dialogue.
03AI-friendly improvements including smart voice defaults and educational error messages.
04Built-in v3 proxy for accessing the ElevenLabs v3 model without direct API access.
050 GitHub stars
06Comprehensive conversation history and transcript retrieval in multiple formats.
사용 사례
01Developing sophisticated AI agents capable of generating realistic speech, cloning voices, and transcribing audio with speaker identification.
02Integrating advanced text-to-speech and conversational AI capabilities into MCP-compatible applications like Claude Desktop.
03Creating dynamic multi-speaker dialogues with emotion tags and automated formatting for rich, natural conversations.