Search Search results for "audio" - Page 2 of

Voice Recorder

Records audio and transcribes it using OpenAI's Whisper model, functioning as a Goose custom extension or standalone server.

Developer Tools

Sound

Plays a sound when Cursor AI completes a task to provide audio feedback.

Developer Tools

FFmpeg Video Processor

Manipulates video files using FFmpeg to resize videos and extract audio in various formats.

Developer Tools

Cursor Sound

Provides audio feedback for code generation in Cursor by playing sound effects.

Developer Tools

Py-Sound

Provides audio feedback for code completion, errors, and notifications in MCP-compatible environments.

Developer Tools

Resemble AI

Generates voice audio from text using the Resemble AI API, integrating with Claude and Cursor via the Model Context Protocol.

API Development

Pollinations

Enables AI assistants to generate images, text, and audio through the Pollinations APIs using the Model Context Protocol.

API Development

Blabber

Converts text into spoken audio using OpenAI's Text-to-Speech API.

API Development

Make Sound

Provides comprehensive sound playback capabilities for macOS, enabling AI assistants and other MCP clients to play system sounds, text-to-speech, and custom audio files.

Developer Tools

RedPanal

Provides an MCP server to manage and interact with RedPanal.org's audio platform, enabling listing, detailing, downloading, and uploading of audio files.

Data Science & ML

Freesound

Enables AI assistants to search, analyze, and retrieve audio samples and information from Freesound.org via an MCP server.

API Development

Video Edit

Provides a powerful server for advanced video and audio editing operations via a standardized Model Context Protocol interface.

Developer Tools

Media Downloader

Download videos and audio from various internet sources, with support for agentic server operations.

Developer Tools

ElevenLabs

Enables interaction with ElevenLabs' powerful Text to Speech and audio processing APIs through the Model Context Protocol.

API Development

ElevenLabs

Integrate ElevenLabs' advanced AI speech capabilities, including text-to-speech, voice management, and audio transformation, into MCP clients.

API Development

Wave

Enables AI assistants to generate and control real-time audio synthesis through natural language descriptions using SuperCollider.

API Development

Deepgram

Transcribes long audio and video files asynchronously using Deepgram's Speech-to-Text API, overcoming common timeout limitations.

API Development

Fast Whisper

Performs AI-powered audio transcription using the optimized Whisper model, supporting multiple languages, batch processing, and various output formats.

Developer Tools

Pro Tools

Enables AI assistants to control Pro Tools for advanced audio production workflows and comprehensive session management.

API Development

FFmpeg

Provides over 40 advanced FFmpeg tools for comprehensive video and audio processing, analysis, and streaming.

Developer Tools

Auphonic

Exposes Auphonic audio processing capabilities via an HTTP-based Model Context Protocol server for integration with AI agents.

API Development