01Automate page navigation, input (click, type, scroll), and take screenshots.
02Securely connect to existing or launch new isolated Chromium browser sessions (Brave, Chrome, Edge).
03Extract lightweight semantic snapshots of web pages and perform precise element interactions.
04Manage and automatically clean up browser tabs opened by the AI agent upon disconnection.
050 GitHub stars
06Perform comprehensive PDF text extraction from local files, URLs, or the current tab.