概要
Many LLM vision systems automatically downscale large input images, leading to significant loss of detail and rendering small text unreadable. This tool functions as an MCP server that intelligently tiles expansive images and full-page screenshots into smaller, model-optimized segments. It calculates an optimal grid based on the target LLM's vision configuration, extracts each tile, and provides comprehensive metadata including token estimates, ensuring that AI vision models can process every critical detail at full resolution without downscaling.