About
The Media Understand skill integrates Gemini 2.5 Flash into your workflow to process and interpret various multimedia formats. It allows users to perform complex tasks such as Optical Character Recognition (OCR), video summarization (including YouTube URLs), and detailed audio transcription. Whether you need to explain a chart in a screenshot, identify speakers in a meeting recording, or summarize a technical video tutorial, this skill provides a powerful bridge between raw media and actionable text-based data.