GitHub topics: multimodal-agent
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Language: Python - Size: 383 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 4,034 - Forks: 404

om-ai-lab/OmAgent
Build multimodal language agents for fast prototype and production
Language: Python - Size: 11.4 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 2,465 - Forks: 271

bz-lab/AUITestAgent
AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification.
Size: 368 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 148 - Forks: 10

bigai-nlco/ExoViP
[COLM 2024] ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Language: Python - Size: 35.8 MB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 0
