検索の検索結果 "audio" - ページ中3ページ目

Voicevox TTS

Synthesizes speech from text using the VOICEVOX engine within an MCP server environment.

Developer Tools

ElevenLabs

Integrates ElevenLabs text-to-speech API to generate audio from text, manage voices, and track generation history.

API Development

Reaper

Enables AI agents to compose, mix, and master music tracks within the REAPER digital audio workstation.

Developer Tools

Voice Call

Enables AI assistants to initiate and manage voice calls using Twilio and OpenAI.

API Development

Rime

Enables text-to-speech capabilities using the Rime API, playing audio through the system's native audio player.

API Development

ElevenLabs

Enables interaction with ElevenLabs' Text to Speech and audio processing APIs through the Model Context Protocol.

API Development

Douyin

Extracts watermark-free video links, video captions, and audio transcriptions from Douyin (TikTok) share links.

Data Science & ML

878

FunASR

Provides speech processing services, including audio validation, speech transcription, and voice activity detection, using Alibaba's FunASR library.

API Development

Chatterbox

Provides text-to-speech generation with automatic audio playback using the Chatterbox TTS model.

Developer Tools

VibeStudio

Provides headless, zero-runtime video and audio editing capabilities using FFmpeg and MCP.

Developer Tools

Xiaozhi

Experience AI Xiaozhi's voice and smart assistant functionalities through a versatile Python-based client, enabling access without dedicated hardware.

Developer Tools

3,259

Total Reaper

Provides an MCP server to expose REAPER Digital Audio Workstation functionality via a clean API.

API Development

Rocktop

Provides a comprehensive system for training and inferring singing voice models, complete with development and testing environments.

Developer Tools

MusicMCP.AI

Empower AI agents and desktop clients to generate music through natural language commands using an advanced AI music platform.

API Development

Video Extraction Plus

Transcribe video and audio content using multiple automatic speech recognition (ASR) providers, including local Whisper models and online services like JianYing (CapCut) and Bcut (Bilibili).

Developer Tools

StableAvatar

Generate infinite-length, high-quality talking head videos from a single image and audio input.

API Development

Claude Code TTS

Integrates OpenAI's Text-to-Speech API into Claude Code, providing developers with audio feedback directly within their coding environment.

Developer Tools

Qwen3-TTS

Synthesize realistic audio using the Qwen3-TTS 1.7B model with advanced voice design and cloning capabilities.

API Development

ProPresenter

Provides comprehensive control over ProPresenter presentations through its API, exposing a wide range of functionalities as a Model Context Protocol (MCP) server.

API Development

Voice Soundboard

Generate natural-sounding audio from text with multi-voice synthesis, emotional speech, and real-time streaming capabilities.

API Development

Voice Soundboard

Generates natural-sounding audio from text for AI assistants and developers, offering multi-voice synthesis, real-time streaming, and SSML support.

Developer Tools