Integrates Google's Gemini models into your Claude workflow to process multimodal inputs like images, video, and audio.
The Gemini CLI Assistant is a specialized skill that allows Claude to delegate complex multimodal tasks to Google's Gemini models. It provides a bridge for scenarios where users need to analyze visual assets, listen to audio recordings, or summarize video content—areas where Gemini's large context window and multimodal capabilities excel. By automating the hand-off to a dedicated gemini-executor, it streamlines the process of getting high-quality AI insights from diverse file formats directly within the Claude Code environment.
Características Principales
01Multimodal analysis support for images, video, and audio files
02Seamless delegation to the gemini-executor sub-agent
03Automated intent recognition for Gemini-specific tasks
04Deep project-wide scanning and structural analysis
051 GitHub stars
06Consolidated reporting of Gemini's insights back to the Claude terminal
Casos de Uso
01Performing high-level architectural reviews of entire project directories
02Summarizing key points from long video tutorials or meeting recordings
03Analyzing UI screenshots or design assets for implementation feedback