01Generates photorealistic smartphone photos of logistics documents with various defects (e.g., blur, folds, stains).
02Produces verified ground truth for every field and an occlusion manifest for each generated photo variation.
03Automates realistic document generation (PDF) from JSON, existing PDFs, or natural language descriptions.
04Provides multiple interfaces: CLI, Python API, FastAPI REST API, MCP Server tools, and Agent SDK plugin.
05Supports natural language prompts and a detailed schema for configuring photo variations and camera presets.
063 GitHub stars