Vision Relay wraps locally installed multimodal CLIs, specifically Google Gemini CLI and Qwen CLI, into a Model Context Protocol (MCP) server. This innovative approach allows MCP-compatible tools, such as Claude and Codex, to directly leverage the powerful image analysis features of these underlying models. It offers an efficient and cost-effective solution for enriching the functionality of text-only AI deployments with visual understanding, providing multimodal capabilities without the need to switch primary AI providers.