Transcribes YouTube video content using available transcripts or audio transcription.
This tool offers APIs to fetch and transcribe YouTube video content, supporting both REST API (Flask) and MCP server implementations. It automatically detects available transcripts in multiple languages (English and Vietnamese), and falls back to audio transcription using Whisper when transcripts are unavailable. It provides features for language detection, temporary file cleanup, and progress reporting, ensuring efficient and accurate transcript generation.