Enables high-quality text and image generation using a reverse-engineered Gemini Web API interface.
This skill provides a robust bridge to the Gemini Web API, allowing users to leverage Google's latest models for text generation, high-fidelity image creation, and vision-based tasks directly within their AI workflow. It supports multi-turn conversations through sophisticated session management, handles reference images for visual analysis, and automates cookie-based authentication via local browsers, making it an ideal choice for developers seeking Gemini's capabilities without the friction of standard API management.
主要功能
01Automated browser-based authentication and secure cookie caching
02Advanced text and image generation via Gemini 3 Pro and Flash models
030 GitHub stars
04Persistent multi-turn conversations with automated session management
05Vision input support using reference images for AI-driven visual analysis
06Flexible output options supporting both standard text and structured JSON
使用场景
01Analyzing and describing local image files using Gemini's advanced vision capabilities
02Generating high-quality AI images from descriptive text prompts for design workflows
03Maintaining context-aware AI chat sessions for complex, multi-step engineering tasks