01Comprehensive device control including swipes, key events, and app launching
02UI-aware screenshots that map element coordinates to text labels
03Human-readable interactions including 'tap by text' and 'fill form'
040 GitHub stars
05Unified CLI with JSON output optimized for LLM consumption
06Reliable synchronization with wait-for logic for UI state changes