CV
bysamhains
0Generate detailed captions, alt text, and structured JSON metadata for images using large vision models.
Acerca de
CV is a minimal MCP server designed for advanced computer vision tasks, focusing on image recognition and metadata generation. It leverages AI models via OpenRouter (or local backends) to process images, generating descriptive captions, concise alt text, and comprehensive structured JSON metadata. Built to be tiny and composable, it integrates seamlessly into MCP clients like Claude Desktop, providing powerful image understanding capabilities without complex database or application logic.
Características Principales
- Generates alt text, dense captions, and structured JSON metadata for images.
- Leverages large vision models via OpenRouter or local backends.
- Supports processing both remote image URLs and local file paths.
- Provides a minimal, composable MCP server for various computer vision tasks.
- Highly configurable for model selection, backend preference, and metadata generation modes.
- 0 GitHub stars
Casos de Uso
- Automate the creation of detailed image captions and alt text for web content or accessibility.
- Integrate AI-powered image analysis and metadata generation into conversational AI agents.
- Generate structured JSON metadata from images for cataloging, search, or content management applications.