Provides AI-powered image and video analysis capabilities through a Model Context Protocol server, leveraging Google Gemini and Vertex AI models.
The AI Vision MCP server empowers developers and applications with advanced visual intelligence. It integrates seamlessly with Model Context Protocol clients, offering robust capabilities to analyze both images and videos. By supporting powerful Google Gemini and Vertex AI models, it enables multimodal analysis, flexible file handling from various sources (URLs, local files, base64), and secure storage integration with Google Cloud Storage. The server is built with TypeScript, ensuring strict type checking, and features comprehensive Zod-based validation and resilient error handling with retries and circuit breakers.