Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: gpt4v

soulteary/amazing-openai-api

Convert different model APIs into the OpenAI API format out of the box.

Language: Go - Size: 463 KB - Last synced: about 23 hours ago - Pushed: 3 months ago - Stars: 126 - Forks: 10

easonlai/webcam_chat_with_aoai_gpt4o

Discover the GPT-4o multimodal model at Microsoft Build 2024, now with text and image capabilities. My prototype enhances chats with real-time camera snapshots, powered by Flask, OpenCV, and Azure’s OpenAI Services. It’s interactive, visual, and simple to use. Give it a try!

Language: HTML - Size: 2.03 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 2 - Forks: 0

X-PLUG/MobileAgent

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

Language: Python - Size: 20.4 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 1,904 - Forks: 159

BUAADreamer/Chinese-LLaVA-Med

中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine

Language: Python - Size: 2.26 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 8 - Forks: 0

reworkd/tarsier

Vision utilities for web interaction agents 👀

Language: Jupyter Notebook - Size: 1.6 GB - Last synced: 16 days ago - Pushed: 17 days ago - Stars: 898 - Forks: 47

langgptai/Awesome-Multimodal-Prompts

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.

Size: 87.3 MB - Last synced: 12 days ago - Pushed: 7 months ago - Stars: 190 - Forks: 15

AmberSahdev/Open-Interface

Control Any Computer Using LLMs

Language: Python - Size: 99.2 MB - Last synced: 21 days ago - Pushed: 21 days ago - Stars: 357 - Forks: 18

martintmv-git/gpt4v-streamlit-voiceover

AI Voiceover with GPT4V

Language: Jupyter Notebook - Size: 5.47 MB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 21 - Forks: 6

kyegomez/MambaByte

Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta

Language: Python - Size: 2.17 MB - Last synced: 22 days ago - Pushed: 3 months ago - Stars: 78 - Forks: 3

tiwater/flowgen

AutoGen Visualized - Visual Tools for Multi-Agent Development.

Language: TypeScript - Size: 47.8 MB - Last synced: 29 days ago - Pushed: 29 days ago - Stars: 91 - Forks: 10

roboflow/gpt-checkup

Monitor the performance of OpenAI's GPT-4V model over time.

Language: HTML - Size: 22 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 19 - Forks: 5

neka-nat/mylangrobot

Language instructions to mycobot using GPT-4V

Language: Python - Size: 3.52 MB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 13 - Forks: 0

elizabethsiegle/predict-bball-shot-sms-gpt4v

Language: JavaScript - Size: 1.63 MB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

kyegomez/HRTX

Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2

Language: Python - Size: 2.19 MB - Last synced: 22 days ago - Pushed: 3 months ago - Stars: 15 - Forks: 2

cameronking4/sketch2app

The ultimate sketch to code app made using GPT4 vision. Choose your desired framework (React, Next, React Native, Flutter) for your app. It will instantly generate code and preview (sandbox) from a simple hand drawn sketch on paper captured from webcam

Language: JavaScript - Size: 73.4 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 38 - Forks: 8

metatatt/iso_bot

ISO 13485 Sniffer Bot, GPT4V with LlamaIndex embeded in React Bot UI

Language: TypeScript - Size: 191 KB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

mnotgod96/AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language: Python - Size: 2.83 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 3,957 - Forks: 407

gpt4api9/gpt4api9

麻雀GPTs-API市场

Size: 281 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 2 - Forks: 0

Ravi-Teja-konda/TunedLlavaDelights

Explore the rich flavors of Indian desserts with TunedLlavaDelights. Utilizing the in Llava fine-tuning, our project unveils detailed nutritional profiles, taste notes, and optimal consumption times for beloved sweets. Dive into a fusion of AI innovation and culinary tradition

Language: Python - Size: 43.2 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

sagentic-ai/cupid

Valentine's Day Cupid Agent

Language: TypeScript - Size: 39.1 KB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 2

dceluis/vacocam_render

Vision-Assisted Camera Orientation

Language: Jupyter Notebook - Size: 184 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 4 - Forks: 0

elizabethsiegle/stephensmithify-openaivision-sendgrid

Analyze a Video and generate commentary about it with OpenAI's GPT-4V, Text-to-speech, LangChain, Streamlit, Replit, Twilio SendGrid, and OpenCV!

Language: Python - Size: 199 MB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 5 - Forks: 1

Envedity/DAIA

Digital Artificial Intelligence Agent

Language: Python - Size: 3.35 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

pAIrprogio/vscode-ui-sketcher

Draw your projects to life

Language: TypeScript - Size: 1.58 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 160 - Forks: 8

jamesponddotco/allalt

[READ-ONLY] Describe images and generate alt tags for visually impaired users.

Language: Go - Size: 23.4 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

logicalroot/gpt-4v-demos

🤖 GPT-4V Demos • Test the model's vision capabilities in your browser using Streamlit • Easy setup

Language: Python - Size: 1.8 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 2 - Forks: 2

admineral/GPT4-Vision-React-Starter

Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description

Language: TypeScript - Size: 256 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 21 - Forks: 18

GraphPKU/CoI

Chain of Images for Intuitively Reasoning

Language: Python - Size: 5.17 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 2 - Forks: 1

yunwoong7/GPT-4V-Examples

Explore the power of GPT-4V with our curated examples and tutorials. This repository offers code snippets, step-by-step guides, and use case demonstrations for integrating GPT-4V into various applications. Perfect for both AI novices and experts!

Language: Jupyter Notebook - Size: 3.52 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

zzxslp/MM-Navigator

Size: 28.4 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 42 - Forks: 1

bdekraker/WebcamGPT-Vision

Lightweight GPT-4 Vision processing over the Webcam

Language: JavaScript - Size: 34.2 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 127 - Forks: 15

danomation/discord-vision

poc gpt-4 vision bot

Language: Python - Size: 6.84 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 4 - Forks: 0