검색 검색 결과: "audio" - 페이지 중 10페이지

Sonic Pi

Enables AI assistants to control Sonic Pi programmatically through OSC messages.

Developer Tools

Markdownify

Converts various file types and web content into Markdown format.

Developer Tools

2,542

File Finder & Whisper STT

Provides Model Context Protocol (MCP) servers for file searching and speech-to-text conversion.

Developer Tools

FFmpeg Helper

Provides FFmpeg capabilities to AI assistants via the Model Context Protocol (MCP) for video processing tasks.

Developer Tools

TTS Say

Synthesizes text into speech and plays it locally using the OpenAI TTS SDK.

API Development

Video Digest

Transcribes and summarizes video content from various online platforms using multiple transcription service providers.

Developer Tools

Zonos TTS

Enables Claude to generate speech directly on Linux systems with GPU optimizations.

API Development

Whissle

Provides access to Whissle API endpoints for speech-to-text, diarization, translation, and text summarization.

API Development

Voice Status Report

Provides voice status updates using OpenAI's text-to-speech API.

API Development

LLM Jukebox

Enables large language models to seamlessly search, download, and extract information from YouTube music videos.

API Development

Piper

Integrates Piper TTS into an MCP server for high-quality text-to-speech functionality.

API Development

Guosheng Toolbox

Provides a comprehensive multimodal AI toolkit, integrating powerful capabilities from Zhipu GLM and Pollinations.AI for advanced media analysis and generation.

API Development

JianYing

Automates JianYing video production, enabling AI assistants to create professional video content through natural language.

API Development

218

AI Agents

Orchestrates a collection of AI agents for web scraping, real-time information retrieval, and dynamic podcast generation.

Data Science & ML

Spotify

Generates Spotify playlists on the fly using natural language and advanced track similarity algorithms.

Developer Tools

Podcast

Parses Podcasting 2.0 RSS feeds to provide structured access to episode metadata and transcripts via an MCP server.

API Development

Sveriges Radio

Access Swedish Radio's open data to retrieve programs, podcasts, live streams, playlists, and news.

API Development

YouTube Search & Download

Provides an MCP server to search, retrieve information about, and download YouTube videos and audio content without requiring a YouTube API key.

API Development

RAG Sandbox

Transform static documents into intelligent, queryable knowledge bases, AI-powered podcasts, and immersive stories through a multi-tenant Agentic Retrieval-Augmented Generation platform.

Developer Tools

Transcriptor

Fetches video transcripts and subtitles from platforms like YouTube, Twitter/X, and TikTok, offering cleaned text, raw subtitle formats, and comprehensive video metadata.

API Development

RAGStack

Provides a serverless architecture for processing diverse documents and media, offering AI chat capabilities with cost-effective, scale-to-zero operations.

Learning & Documentation