01Deep Video Analysis for up to 1 hour of content with timestamped insights
02Native Multi-page PDF Document Extraction and structural analysis
030 GitHub stars
04Granular Media Resolution Control (low, medium, high) for token optimization
05Advanced Image Understanding including object detection and high-fidelity OCR
06Long-form Audio Processing supporting up to 9.5 hours of speech and transcription