01High-quality image generation and editing using specialized Gemini models
02Long-form media support with up to 2M token context windows
03Advanced OCR and structured data extraction from complex forms and tables
04Automated media optimization and batch processing utility scripts
05Multimodal analysis for audio, video, images, and PDF documents
060 GitHub stars