01Multi-platform desktop control (click, type, scroll, drag) for Linux, Windows, and macOS.
02Vision-based UI automation utilizing screenshot analysis and advanced grounding models.
03Seamless integration with 100+ LLM providers including Anthropic, OpenAI, and local models.
04Built-in cost control and trajectory budgeting for autonomous task execution.
05Automated sandbox management for creating, starting, and stopping virtual execution environments.
060 GitHub stars