01Image description using Anthropic Claude Vision or OpenAI GPT-4 Vision
02Base64 and file-based image input support
03Configurable primary and fallback providers
046 GitHub stars
05Supports multiple image formats (JPEG, PNG, GIF, WebP)
06Optional text extraction using Tesseract OCR