010 GitHub stars
02Enables AI agents to see, tap, type, and navigate any mobile application.
03Supports cross-platform mobile interaction on Android and iOS devices, including cloud devices via Appium.
04Offers detailed situation reports after actions, screen exploration with navigation graph mapping, and network capture.
05Provides three core MCP tools: `drengr_look` for screen observation, `drengr_do` for actions, and `drengr_query` for information retrieval.
06Utilizes a vision-first approach with annotated screenshots and an efficient text mode (approx. 300 tokens per screen).