Transcribes YouTube videos and playlists into structured Markdown documents using Gemini Flash and yt-dlp.
YouTube Ingest is a specialized Claude Code skill that automates the conversion of video content into high-quality, searchable text. By utilizing yt-dlp for audio extraction and the cost-effective Gemini Flash model for transcription, it allows users to process single videos or entire playlists into polished Markdown files complete with metadata. This tool is essential for developers, researchers, and content creators looking to build local knowledge bases, summarize lectures, or archive technical insights from YouTube directly within their development workflow.
Key Features
01Configurable ingestion limits for large video series
02High-speed transcription using Gemini Flash
03Bulk processing support for full YouTube playlists
048 GitHub stars
05Structured Markdown output with YAML frontmatter
06Automated audio extraction via yt-dlp integration
Use Cases
01Converting video-based documentation into accessible text formats
02Extracting insights and summaries from long-form video interviews
03Building a searchable technical knowledge base from conference talks and tutorials