Generate detailed captions, alt text, and structured JSON metadata for images using large vision models.
CV is a minimal MCP server designed for advanced computer vision tasks, focusing on image recognition and metadata generation. It leverages AI models via OpenRouter (or local backends) to process images, generating descriptive captions, concise alt text, and comprehensive structured JSON metadata. Built to be tiny and composable, it integrates seamlessly into MCP clients like Claude Desktop, providing powerful image understanding capabilities without complex database or application logic.