Provides image recognition capabilities by leveraging Anthropic and OpenAI vision APIs.
Empower your applications with image understanding using this server, which offers detailed image descriptions powered by Anthropic Claude Vision or OpenAI GPT-4 Vision. It supports various image formats, including JPEG, PNG, GIF, and WebP, and offers both base64 and file-based image input. Enhance your analysis further with optional text extraction using Tesseract OCR. Configurable primary and fallback providers ensure reliability and flexibility in your image recognition workflows.