GitHub topics: computer-using-agent
Clad3815/open-computer-use
AI-powered assistant that controls a Windows environment through docker, allowing automated interaction with the desktop interface. Control your computer with natural language.
Language: JavaScript - Size: 128 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 4 - Forks: 1

OS-Copilot/ScienceBoard
Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"
Language: Python - Size: 2.98 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 71 - Forks: 4

reidbarber/webmarker
Mark web pages for use with vision-language models
Language: TypeScript - Size: 681 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 40 - Forks: 3

a-real-ai/pywinassistant
The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user-interfaces (GUIs) by using only natural language. Uses Visualization-of-Thought and Chain-of-Thought reasoning to elicit spatial reasoning and perception, emulates, plans and simulates synthetic HID interactions.
Language: Python - Size: 681 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 1,301 - Forks: 186

OS-Agent-Survey/OS-Agent-Survey
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
Size: 11.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 271 - Forks: 12

thethinkmachine/4o-agent
A ReAct Principles based fully autonomous Command Line Computer Using Agent
Language: Python - Size: 72.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Mihonarium/food_ordering_agent
Use an LLM agent to automate ordering food and other items from Deliveroo, Uber Eats, DoorDash, etc.
Language: Python - Size: 31.3 KB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 0 - Forks: 1
