01Enables LLM control over GUI applications via agentic workflows
02Implements modular cognitive layers (Perception, Memory, Decision-Making, Action) using Pydantic for agent architecture
03Explores advanced prompt engineering for predictable LLM outputs and multi-turn interactions
04Automates MS Paint interactions without direct API access
050 GitHub stars
06Integrates with Gemini API for LLM capabilities and uses Pywinauto for Windows GUI automation