01Direct integration with Gemini 3 Flash for high-speed multimodal analysis
02Precise timestamping for scene changes and key events
03Automated transcription and speaker-identified dialogue extraction
04Advanced token optimization via clipping, frame rate, and resolution control
050 GitHub stars
06Multi-source input support for local files and public YouTube URLs