关于
YouTube is a robust Model Context Protocol (MCP) server engineered to provide AI agents with advanced capabilities for processing YouTube videos. It allows for the comprehensive extraction of video metadata, including title, description, views, and duration, without requiring video download. Furthermore, it delivers smart transcription services featuring an in-memory processing pipeline, precise Voice Activity Detection (VAD) using Silero, and extensive multilingual support across 99 languages, alongside translation capabilities. Optimized for performance with `yt-dlp` and hardware acceleration for Whisper inference, the server also incorporates intelligent file-based caching to streamline operations and prevent redundant processing.