GitHub topics: gui-agent
open-compass/MMBench-GUI
Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent with a hierarchical manner across multiple platforms, including Windows, Linux, macOS, iOS, Android and Web.
Language: Python - Size: 15.9 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 40 - Forks: 1

bytedance/UI-TARS-desktop
The Open-sourced Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.
Language: TypeScript - Size: 173 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 15,277 - Forks: 1,360

Yah185/open-source-operator
Create your self-hosted, open-source Operator model.
Size: 1000 Bytes - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

jamal22552/AI-Infra
Explore the AI-Infra repository for a structured learning path and a visual landscape of modern AI infrastructure in Kubernetes and cloud-native ecosystems. 🌐💻
Size: 385 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

TongUI-agent/TongUI-agent
Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials
Language: HTML - Size: 3.78 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 38 - Forks: 3

wendell0218/GVA-Survey
Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"
Size: 6.13 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 52 - Forks: 1

V-Droid-Agent/V-Droid
Source code of the paper "V-Droid: Advancing Mobile GUI Agent Through Generative Verifiers"
Language: Python - Size: 447 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 3 - Forks: 0

showlab/WorldGUI
Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.
Language: Python - Size: 74.6 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 80 - Forks: 7

trycua/acu
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
Size: 137 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 1,306 - Forks: 90

ritzz-ai/GUI-R1
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
Language: Python - Size: 974 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 112 - Forks: 11

lll6gg/UI-R1
Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"
Language: Python - Size: 1.04 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 104 - Forks: 6

showlab/ShowUI
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Language: Python - Size: 26.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1,254 - Forks: 84

THUDM/CogAgent
An open-sourced end-to-end VLM-based GUI Agent
Language: Python - Size: 5.11 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 947 - Forks: 74

OS-Agent-Survey/OS-Agent-Survey
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
Size: 11.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 271 - Forks: 12

iMeanAI/open-source-operator
Create your self-hosted, open-source Operator model.
Language: Python - Size: 1.92 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 91 - Forks: 5
