关于
This skill provides a robust automated workflow for extracting text content from YouTube videos using yt-dlp and OpenAI's Whisper. It intelligently navigates the retrieval process by first checking for high-quality manual subtitles, falling back to auto-generated captions, and offering a sophisticated AI-powered transcription path using Whisper if no native subtitles exist. The skill excels at post-processing, featuring a Python-based deduplication engine that converts messy, overlapping WebVTT files into clean, readable plain text while preserving the original narrative flow.