Records audio and transcribes it using OpenAI's Whisper model, functioning as a Goose custom extension or standalone server.
Plays a sound when Cursor AI completes a task to provide audio feedback.
Manipulates video files using FFmpeg to resize videos and extract audio in various formats.
Provides audio feedback for code generation in Cursor by playing sound effects.
Provides audio feedback for code completion, errors, and notifications in MCP-compatible environments.
Generates voice audio from text using the Resemble AI API, integrating with Claude and Cursor via the Model Context Protocol.
Enables AI assistants to generate images, text, and audio through the Pollinations APIs using the Model Context Protocol.
Converts text into spoken audio using OpenAI's Text-to-Speech API.
Provides comprehensive sound playback capabilities for macOS, enabling AI assistants and other MCP clients to play system sounds, text-to-speech, and custom audio files.
Provides an MCP server to manage and interact with RedPanal.org's audio platform, enabling listing, detailing, downloading, and uploading of audio files.
Enables AI assistants to search, analyze, and retrieve audio samples and information from Freesound.org via an MCP server.
Provides a powerful server for advanced video and audio editing operations via a standardized Model Context Protocol interface.
Download videos and audio from various internet sources, with support for agentic server operations.
Enables interaction with ElevenLabs' powerful Text to Speech and audio processing APIs through the Model Context Protocol.
Integrate ElevenLabs' advanced AI speech capabilities, including text-to-speech, voice management, and audio transformation, into MCP clients.
Enables AI assistants to generate and control real-time audio synthesis through natural language descriptions using SuperCollider.
Transcribes long audio and video files asynchronously using Deepgram's Speech-to-Text API, overcoming common timeout limitations.
Performs AI-powered audio transcription using the optimized Whisper model, supporting multiple languages, batch processing, and various output formats.
Enables AI assistants to control Pro Tools for advanced audio production workflows and comprehensive session management.
Provides over 40 advanced FFmpeg tools for comprehensive video and audio processing, analysis, and streaming.
Exposes Auphonic audio processing capabilities via an HTTP-based Model Context Protocol server for integration with AI agents.