010 GitHub stars
02LLM-native design for minimal tokens and round-trips
03Advanced 3-tier token reduction including P-frame diffing for screenshots
04Full CJK support with IME bypass input
05Precise 1:1 coordinate mode for direct pixel-to-screen mapping
06UIA element extraction for semantic interaction without image analysis