Image Recognition
Createdmario-andreschak
Provides image recognition capabilities by leveraging Anthropic and OpenAI vision APIs.
About
Empower your applications with image understanding using this server, which offers detailed image descriptions powered by Anthropic Claude Vision or OpenAI GPT-4 Vision. It supports various image formats, including JPEG, PNG, GIF, and WebP, and offers both base64 and file-based image input. Enhance your analysis further with optional text extraction using Tesseract OCR. Configurable primary and fallback providers ensure reliability and flexibility in your image recognition workflows.
Key Features
- Image description using Anthropic Claude Vision or OpenAI GPT-4 Vision
- Base64 and file-based image input support
- Configurable primary and fallback providers
- 6 GitHub stars
- Supports multiple image formats (JPEG, PNG, GIF, WebP)
- Optional text extraction using Tesseract OCR
Use Cases
- Automated image analysis for content moderation
- Extracting text from images for data entry
- Generating image captions for accessibility