01Support for reference images and image-to-image editing tasks.
020 GitHub stars
03Text-to-image generation using Gemini 3.1 Flash, Pro 3, and Flash 2.5 models.
04Google Search grounding for accurate rendering of real-world locations and brands.
05Multi-turn conversational editing for iterative image refinement.
06Automatic thinking mode detection for complex spatial and stylistic prompts.