01Automated document, chart, and diagram analysis with structured OCR
02Native video frame analysis and object detection with bounding boxes
03Advanced multi-image comparison and visual reasoning patterns
04Optimized PDF handling strategies for large-scale document processing
0569 GitHub stars
06Multi-provider support for GPT-5, Claude 4.5, Gemini 2.5/3, and Grok 4