An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: gui-agent

open-compass/MMBench-GUI

Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent with a hierarchical manner across multiple platforms, including Windows, Linux, macOS, iOS, Android and Web.

Language: Python - Size: 15.9 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 40 - Forks: 1

bytedance/UI-TARS-desktop

The Open-sourced Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.

Language: TypeScript - Size: 173 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 15,277 - Forks: 1,360

Yah185/open-source-operator

Create your self-hosted, open-source Operator model.

Size: 1000 Bytes - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

jamal22552/AI-Infra

Explore the AI-Infra repository for a structured learning path and a visual landscape of modern AI infrastructure in Kubernetes and cloud-native ecosystems. 🌐💻

Size: 385 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

TongUI-agent/TongUI-agent

Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials

Language: HTML - Size: 3.78 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 38 - Forks: 3

wendell0218/GVA-Survey

Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"

Size: 6.13 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 52 - Forks: 1

V-Droid-Agent/V-Droid

Source code of the paper "V-Droid: Advancing Mobile GUI Agent Through Generative Verifiers"

Language: Python - Size: 447 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 3 - Forks: 0

showlab/WorldGUI

Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

Language: Python - Size: 74.6 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 80 - Forks: 7

trycua/acu

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

Size: 137 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 1,306 - Forks: 90

ritzz-ai/GUI-R1

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Language: Python - Size: 974 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 112 - Forks: 11

lll6gg/UI-R1

Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"

Language: Python - Size: 1.04 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 104 - Forks: 6

showlab/ShowUI

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Language: Python - Size: 26.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1,254 - Forks: 84

THUDM/CogAgent

An open-sourced end-to-end VLM-based GUI Agent

Language: Python - Size: 5.11 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 947 - Forks: 74

OS-Agent-Survey/OS-Agent-Survey

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".

Size: 11.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 271 - Forks: 12

iMeanAI/open-source-operator

Create your self-hosted, open-source Operator model.

Language: Python - Size: 1.92 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 91 - Forks: 5