01Advanced document and chart analysis with structured OCR patterns
0269 GitHub stars
03Cross-provider support for GPT-5, Claude 4.5, Gemini 2.5/3, and Grok 4
04Native video frame processing and multi-image reasoning capabilities
05Spatial awareness tools for bounding box and object detection
06Optimized token management and resolution-based cost efficiency patterns