This skill integrates Google's Gemini Pro image generation capabilities into Claude, allowing developers to create, edit, and refine visual assets using natural language. It supports advanced features like multi-turn image refinement, composition from up to 14 reference images, and Google Search grounding for real-time visual data. Whether you need photorealistic scenes, UI mockups, or logo designs, this skill provides precise control over resolution (up to 4K) and aspect ratios, ensuring seamless integration of AI-generated visuals into your software development workflow.
Características Principales
01Multi-turn refinement for iterative image editing and chat-based styling
02High-resolution generation up to 4K using Gemini-3-Pro-Image-Preview
03Integrated Google Search grounding for data-driven infographics and visuals
04Precise control over aspect ratios and camera-specific photorealistic prompting
05Multi-image composition support using up to 14 reference files
060 GitHub stars
Casos de Uso
01Iterative image manipulation and style transfer for existing visual assets
02Generating unique logos, icons, and marketing graphics for applications
03Rapid prototyping of UI/UX assets, stickers, and product mockups