Image Generator
Enables AI assistants to generate and edit images leveraging Google's Gemini 2.5 Flash Image API within MCP-compatible tools.
About
This powerful server seamlessly integrates advanced image generation and editing capabilities into AI assistants like Claude Code and Cursor. Utilizing Google's Gemini 2.5 Flash Image API, it allows users to create images from text prompts, transform existing images with natural language instructions, and apply advanced options such as multi-image blending, character consistency, and world knowledge integration. Output images are saved as files in various formats (PNG, JPEG, WebP) for easy access and workflow integration.
Key Features
- Image editing using natural language instructions
- AI-powered image generation from text prompts
- 1 GitHub stars
- Multiple output formats: PNG, JPEG, WebP support
- Generated images saved as files for easy access
- Advanced options: multi-image blending, character consistency, world knowledge integration
Use Cases
- Editing specific elements within an image, such as a person's orientation
- Generating serene landscapes or complex scenes from natural language prompts
- Maintaining consistent character appearance across multiple generated images