Acerca de
This skill provides expert guidance and code patterns for performing high-quality audio and video transcription using the OpenAI Whisper ecosystem. It covers multiple implementation paths including Python, C++, and GPU-accelerated versions, while providing specific patterns for model selection, frame-accurate timing synchronization, and subtitle generation in industry-standard formats like SRT and VTT. Whether you need batch processing for large media libraries or precise speaker diarization for interviews, this skill streamlines the integration of advanced speech-to-text capabilities into your video editing and content creation workflows.