GitHub topics: computer-use
bytebot-ai/bytebot
Bytebot is the container for desktop agents.
Language: TypeScript - Size: 33.2 MB - Last synced at: about 7 hours ago - Pushed at: about 7 hours ago - Stars: 724 - Forks: 48

web-infra-dev/midscene
Your AI Operator for Web, Android, Automation & Testing.
Language: TypeScript - Size: 380 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 9,229 - Forks: 595

e2b-dev/open-computer-use
AI computer use powered by open source LLMs and E2B Desktop Sandbox
Language: Python - Size: 1.58 MB - Last synced at: 1 day ago - Pushed at: 15 days ago - Stars: 1,286 - Forks: 177

inclusionAI/AWorld
Build, evaluate and run General Multi-Agent Assistance with ease
Language: Python - Size: 156 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 263 - Forks: 14

alaa-nadi/UI-TARS-desktop
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
Language: TypeScript - Size: 37.6 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2 - Forks: 0

bilalonur/awesome-llm-os
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
Size: 610 KB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 111 - Forks: 7

simular-ai/Agent-S
Agent S: an open agentic framework that uses computers like a human
Language: Python - Size: 39.4 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 5,465 - Forks: 559

bytedance/UI-TARS-desktop
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
Language: TypeScript - Size: 137 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 14,637 - Forks: 1,237

baryhuang/mcp-remote-macos-use
The only general AI agent that does NOT requires extra API key, giving you full control on your local and remote MacOs from Claude Desktop App
Language: Python - Size: 270 KB - Last synced at: 2 days ago - Pushed at: 9 days ago - Stars: 134 - Forks: 20

AB498/computer-control-mcp
MCP server that provides computer control capabilities, like mouse, keyboard, OCR, etc. using PyAutoGUI, RapidOCR, ONNXRuntime. Similar to 'computer-use' by Anthropic. With Zero External Dependencies.
Language: Python - Size: 5.11 MB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 13 - Forks: 1

TongUI-agent/TongUI-agent
Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials
Language: HTML - Size: 3.84 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 29 - Forks: 3

trycua/acu
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
Size: 137 KB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 1,306 - Forks: 90

Upsonic/Upsonic
The most reliable AI agent framework that supports MCP.
Language: Python - Size: 3.88 MB - Last synced at: 5 days ago - Pushed at: 13 days ago - Stars: 7,524 - Forks: 702

trycua/cua
c/ua is the Docker Container for Computer-Use AI Agents.
Language: Python - Size: 5.65 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 8,560 - Forks: 366

cyberdesk-hq/cyberdesk
Open source virtual desktops for AI agents
Language: JavaScript - Size: 19.2 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 118 - Forks: 18

microsoft/WindowsAgentArena
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Language: Python - Size: 191 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 708 - Forks: 70

BandarLabs/clickclickclick
A framework to enable autonomous android and computer use using any LLM (local or remote)
Language: Python - Size: 78.1 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 452 - Forks: 57

tysonthomas9/browser-operator-devtools-frontend Fork of ChromeDevTools/devtools-frontend
Browser Operator - The Chromium browser with built in Multi-Agent
Language: TypeScript - Size: 958 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 71 - Forks: 8

presidio-oss/factif-ai
AI-powered computer control for automated testing. Factifai uses vision models (Claude, GPT-4o, Gemini) to interact with applications naturally - clicking, typing, and verifying results just like a human would.
Language: TypeScript - Size: 108 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 37 - Forks: 24

Rajaniraiyn/ccu
Anthropic's Computer Use tools within VSCode
Language: TypeScript - Size: 55.7 KB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

someaka/wayland-mcp
MCP Server for Wayland
Language: Python - Size: 16 MB - Last synced at: about 22 hours ago - Pushed at: 2 months ago - Stars: 3 - Forks: 1

runamu/monday
[CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
Language: Python - Size: 1.59 MB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 15 - Forks: 0

SawyerHood/computer-use-extension
This is OpenAI's computer use hooked up to a chrome extension.
Language: TypeScript - Size: 117 KB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 17 - Forks: 1

pnmartinez/simple-computer-use
Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.
Language: Python - Size: 385 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 25 - Forks: 1

reidbarber/webmarker
Mark web pages for use with vision-language models
Language: TypeScript - Size: 681 KB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 40 - Forks: 3

presidio-oss/factifai-agent-suite
AI-powered computer control for automated testing in your CI/CD pipelines. Factifai agent uses vision models (Claude, GPT-4o) to interact with applications naturally - clicking, typing, and verifying results just like a human would.
Language: TypeScript - Size: 88.4 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 8 - Forks: 3

OpenAdaptAI/OpenAdapt
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
Language: Python - Size: 28.9 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 1,277 - Forks: 181

aicompanionx/XBuddy-Desktop-Electron
A highly interesting cryptocurrency AI assistant
Language: TypeScript - Size: 247 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

showlab/ShowUI
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Language: Python - Size: 26.9 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 1,254 - Forks: 84

THUDM/CogAgent
An open-sourced end-to-end VLM-based GUI Agent
Language: Python - Size: 5.11 MB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 947 - Forks: 74

A9T9/RPA
Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use/LLM. Selenium IDE import/export.
Language: JavaScript - Size: 13.1 MB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 1,564 - Forks: 327

auto-browse/auto-browse-ts
Auto-Browse: AI Enabled Browser Automation
Language: TypeScript - Size: 750 KB - Last synced at: 1 day ago - Pushed at: 2 months ago - Stars: 6 - Forks: 0

aditya-nadkarni/spongecake
Spongecake is the easiest way to launch computer use agents.
Language: JavaScript - Size: 27.7 MB - Last synced at: 24 days ago - Pushed at: about 2 months ago - Stars: 138 - Forks: 18

OS-Agent-Survey/OS-Agent-Survey
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
Size: 11.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 271 - Forks: 12

ArchiveBox/abx-spec-behaviors
🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.
Language: JavaScript - Size: 785 KB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 18 - Forks: 0

Optexity/ComputerGYM
Foundation Model Training Using Human Demonstrations
Language: Python - Size: 291 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 104 - Forks: 4

ashbuilds/computer-use
Anthropic's Computer use implementation in Nodejs
Language: TypeScript - Size: 81.1 KB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 8 - Forks: 2

iris-networks/iris
This is the crud backend for our QA test application
Language: TypeScript - Size: 14.6 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 18 - Forks: 2

suitedaces/computer-agent
Desktop app powered by Claude’s computer use capability to control your computer
Language: Python - Size: 108 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 436 - Forks: 43

nottelabs/open-operator-evals
Opensource benchmark evaluating web operators/agents performance
Language: Python - Size: 141 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 10 - Forks: 0

philfung/awesome-computer-use
Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.
Size: 24.4 KB - Last synced at: 8 days ago - Pushed at: 7 months ago - Stars: 22 - Forks: 2

Justmalhar/claude-ubuntu-os
Claude Computer Use API with Ubuntu that enables Claude to interact with and automate desktop environments. It allows seamless command execution through VNC or noVNC, enhancing productivity with secure, containerized workflows with Github Codespaces.
Language: Python - Size: 15.3 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 10 - Forks: 1

Clad3815/open-computer-use
AI-powered assistant that controls a Windows environment through docker, allowing automated interaction with the desktop interface. Control your computer with natural language.
Language: JavaScript - Size: 110 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

cloudycotton/browser-operator
Build your own AI operators like OpenAI
Language: TypeScript - Size: 33.2 KB - Last synced at: 23 days ago - Pushed at: 5 months ago - Stars: 7 - Forks: 0

lvqq/intelli-browser
✨ Use natural language to control your browser, powered by LLM and playwright
Language: TypeScript - Size: 9.78 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 45 - Forks: 3

chatsci/Aeiva
A general AI agent framework that can be adapted to various tasks and environments.
Language: Python - Size: 85.8 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 100 - Forks: 16

webhiveos/WebHive
Meet WebHive, the AI-powered browser that takes care of tasks for you. No more endless clicks, tell it what you need, and it gets it done.
Language: Python - Size: 1.72 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 11 - Forks: 0

SALT-NLP/PopupAttack
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
Language: Python - Size: 195 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 26 - Forks: 1

Mihonarium/food_ordering_agent
Use an LLM agent to automate ordering food and other items from Deliveroo, Uber Eats, DoorDash, etc.
Language: Python - Size: 31.3 KB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

mubashir1osmani/m4
build custom asics and fpga's using llms.
Language: Python - Size: 50.5 MB - Last synced at: 18 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

philfung/computer-use
try Computer Use on your Mac with a few clicks
Language: Python - Size: 105 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 24 - Forks: 2

lx-0/computer-use-nodejs-demo
🤖 LLM-powered computer control through local and Docker environments. Features VNC integration, automated interactions, and a chat interface for natural language system control.
Language: TypeScript - Size: 1.16 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 6 - Forks: 1

nicholasoxford/computer-use-mac-demo
Anthropic's computer use controlling a Macbook
Language: Python - Size: 2.99 MB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 3 - Forks: 1
