011 GitHub stars
02Offers customizable tiling parameters, including target model, tile size, output format (WebP/PNG), and image pre-downscaling.
03Captures full-page screenshots from URLs, including scroll-stitching for pages over 16,384px.
04Prevents LLM vision downscaling by intelligently tiling large images and screenshots.
05Provides paginated retrieval of image tiles, complete with content hints and brightness statistics for each tile.
06Compares and estimates tile counts and vision tokens for various LLM models (Claude, OpenAI, Gemini).