关于
Gemini Document Processing is a specialized skill designed to leverage Google Gemini's multimodal power for high-fidelity document analysis. Unlike standard OCR, this skill understands the visual context of PDFs—including charts, tables, diagrams, and images—for up to 1,000 pages. It is ideal for developers needing to automate data extraction into validated JSON formats, generate context-aware summaries of long reports, or build intelligent Q&A systems on top of unstructured documentation. With built-in support for both small inline files and large documents via the Google File API, it provides a robust framework for professional-grade document intelligence pipelines.