Downloads video and audio content from various online platforms for use with Large Language Models.
Enables AI assistants like Claude to interact with your computer's audio system for recording and playback.
Enables Claude to control audio playback on your computer.
Transcribes audio files using OpenAI's Whisper API within an MCP server environment.
Enables voice interaction with modern audio visualization for the Goose platform.
Analyzes music audio from local files, YouTube links, or audio URLs using librosa, Whisper, and LLMs.
Enables interaction with Text to Speech and audio processing APIs for MCP clients.
Generates images, text, and audio from text prompts using a free and open-source API.
Processes audio files through OpenAI's transcription and speech services via the Model Context Protocol.
Transcribes audio files to text using a speech recognition API.
Provides audio playback functionality for AI agents, enabling notification sounds for task completion.
Transcribe, summarize, and analyze audio content using intelligent processing.
Converts audio to text using various recognition engines and supports multiple formats and languages.
Performs professional AI-powered audio processing and stem manipulation for music production and engineering workflows.
Transcribes audio files and live microphone input into text using advanced AI models.
Downloads audio from popular streaming services, converts audio formats, and edits metadata tags.
Enables natural language control of the Carla audio plugin host for professional audio production workflows.
Enables AI assistants to search and download public domain and Creative Commons licensed audio from the Internet Archive.
Generate AI-powered podcasts with customizable voices and transform text into natural-sounding conversational audio.
Provides local audio transcription capabilities using the Whisper speech-to-text model through a lightweight Model Context Protocol (MCP) server.
Process video and audio files by converting, compressing, trimming, and extracting media with FFmpeg through natural language commands.