Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: gpt-4-vision
lancedb/vectordb-recipes
High quality resources & applications for LLMs, multi-modal models and VectorDBs
Language: Jupyter Notebook - Size: 82.5 MB - Last synced: about 15 hours ago - Pushed: 1 day ago - Stars: 457 - Forks: 82
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.
Language: TypeScript - Size: 140 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 31,484 - Forks: 7,390
davidmigloz/pixels2flutter
Convert a screenshot to a working Flutter app.
Language: Dart - Size: 17.8 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 139 - Forks: 35
Skythinker616/gpt-assistant-android
免费的ChatGPT API的安卓语音助手,可用音量键唤起并进行语音交流,支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.
Language: Java - Size: 72.4 MB - Last synced: 5 days ago - Pushed: 11 days ago - Stars: 552 - Forks: 67
Helltar/artific_intellig_bot
AI Telegram Bot, ChatGPT, Dalle2, Whisper, GPT-4 Vision, Stability AI
Language: Kotlin - Size: 627 KB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 19 - Forks: 2
roboflow/multimodal-maestro
Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥
Language: Python - Size: 5.12 MB - Last synced: 5 days ago - Pushed: 4 months ago - Stars: 966 - Forks: 68
tbckr/sgpt
SGPT is a command-line tool that provides a convenient way to interact with OpenAI models, enabling users to run queries, generate shell commands and produce code directly from the terminal.
Language: Go - Size: 1.17 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 181 - Forks: 21
danny-avila/LibreChat
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
Language: TypeScript - Size: 32.9 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 11,885 - Forks: 2,113
sanket98a/MetaData-Extraction
Welcome to GPT-4 Vision Apparel Metadata Extractor! 🌟 Our cutting-edge application leverages the power of GPT-4 to accurately extract detailed metadata from images, focusing specifically on apparel items.
Language: Python - Size: 8.98 MB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
kwishna/openai-smart-vision
AI apps using OpenAI Vision model.
Language: JavaScript - Size: 1.63 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 0 - Forks: 0
dfdggrss/GPT-Desktop-Client
Size: 1000 Bytes - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 0 - Forks: 0
maquenneville/PhotogChauffeur
An OpenAI Vision-powered local image search tool for complex/subjective NL queries
Language: Python - Size: 39.1 KB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 1 - Forks: 0
9vult/Raiha
Raiha Discord Accessibility Bot
Language: TypeScript - Size: 270 KB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 2 - Forks: 5
developersdigest/ai-devices
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
Language: TypeScript - Size: 8.63 MB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 116 - Forks: 19
first-automation/layoutya
いらすとやの画像をレイアウトして画像を作る
Language: Python - Size: 3.73 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 0 - Forks: 0
corbindavenport/alt-text-creator
Browser extension that generates alternate text for images using GPT-4 Vision.
Language: JavaScript - Size: 920 KB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 1 - Forks: 0
sazonovanton/SirChatalot
SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or YandexART for image creation. It can use vision capabilities or tools/functions.
Language: Python - Size: 549 KB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 60 - Forks: 10
TypingMind/typingmind
The most advanced Web UI for AI chat
Language: HTML - Size: 45.4 MB - Last synced: 30 days ago - Pushed: 30 days ago - Stars: 173 - Forks: 68
szczyglis-dev/py-gpt
Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command execution, file upload and download, speech synthesis and recognition, access to Web, memory, prompt presets, plugins, assistants & more. Linux, Windows, Mac.
Language: Python - Size: 31 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 316 - Forks: 67
WisconsinAIVision/ViP-LLaVA
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
Language: Python - Size: 17.4 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 165 - Forks: 8
aymenfurter/copilot-insurance-claim-demo
How a Picture of Car Damage Can File Your Insurance Claim
Language: Java - Size: 21.9 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 4 - Forks: 2
BenderScript/netvision
Network Topology Image Analsysis
Language: Python - Size: 32.4 MB - Last synced: 21 days ago - Pushed: 22 days ago - Stars: 0 - Forks: 0
theonlyamos/cognitrix
AI Agent equipped with tools and extensions
Language: Python - Size: 430 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 1 - Forks: 1
neka-nat/mylangrobot
Language instructions to mycobot using GPT-4V
Language: Python - Size: 3.52 MB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 13 - Forks: 0
ktutak1337/Stellar-Chat
A multi-modal chat application enabling users to create custom agents, and integrate with local LLMs (Local Language Models), as well as OpenAI models.
Language: C# - Size: 104 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 16 - Forks: 2
jahnvisikligar/Computer-Vision
This repo focuses on my works in the field of Computer Vision
Language: Jupyter Notebook - Size: 549 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
kornia/pixie
Pixie: Computer Vision AI Engineer assistant
Size: 12.7 KB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 13 - Forks: 2
reidbarber/gen-ui
Use text or image prompts to generate components and apps built with React.
Language: TypeScript - Size: 490 KB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 8 - Forks: 3
Sacred-G/ai
PDF Chatbot, Image Chatbot, Web-Site Chatbot with a Knowledge base. OpenAI , Memory, PostgreSQL
Language: Python - Size: 37.2 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
kamilkaczmareksolutions/JanAr
JanAr: GUI application leveraging GPT-4-Vision and GPT models to automatically generate engaging social media captions for artwork images. Customized for a glass workshop and picture framing business, it blends artistic insights with effective online engagement strategies.
Language: Python - Size: 614 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
mountaineerbr/shellChatGPT
Shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS. Features LocalAI, Ollama, Gemini, and Mistral integration.
Language: Shell - Size: 756 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 28 - Forks: 2
Elehiggle/ChatGPTMattermostChatbot
An AI-powered Mattermost ChatGPT chatbot that utilizes the OpenAI API to provide helpful, contextual responses to user messages, extract text from links, and describe or generate images. With Docker support!
Language: Python - Size: 470 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
SkalskiP/sports
Cool experiments at the intersection of Computer Vision and Sports ⚽🏃
Language: Jupyter Notebook - Size: 15.1 MB - Last synced: about 2 months ago - Pushed: 6 months ago - Stars: 436 - Forks: 29
ks6088ts-labs/extractor-python
A data extract tool written in Python
Language: Python - Size: 263 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
vdutts7/gpt4V-scraper
AI agent that can SEE 👁️, control, navigate, & do stuff for you on your browser.
Language: JavaScript - Size: 10.4 MB - Last synced: 2 months ago - Pushed: 3 months ago - Stars: 185 - Forks: 19
signebedi/gptty
ChatGPT wrapper in your TTY
Language: Python - Size: 726 KB - Last synced: 5 days ago - Pushed: 3 months ago - Stars: 47 - Forks: 7
mickymultani/GPT-4-Vision-Architecture-Scanner
A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insights and detailed breakdowns in an interactive chat interface.
Language: JavaScript - Size: 103 KB - Last synced: 26 days ago - Pushed: 7 months ago - Stars: 12 - Forks: 2
danomation/CanvasGPT
OpenAI Vision based drawing game.
Language: HTML - Size: 27.3 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
LazaUK/AOAI-GPT4Vision-Streamlit-SDKv1
Using Azure OpenAI deployment of GPT-4 Turbo with Vision to analyse out-of-stock situation in a fictitious retail shop.
Language: Python - Size: 3.8 MB - Last synced: 20 days ago - Pushed: 5 months ago - Stars: 18 - Forks: 5
zhuoooo/GPT4V-Simulator
探索让 GPT-4V(ision) 像人一样思考和测试的实践。比如一个人打开一个网页后,他通过眼睛看到的页面文字或图标,通过页面来识别是否存在布局错误。
Size: 507 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
image-yinyang/public
🖼️☯️ Public endpoint worker
Language: JavaScript - Size: 26.4 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
image-yinyang/genimg
🖼️☯️ Image generation worker
Language: JavaScript - Size: 29.3 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
image-yinyang/ui
🖼️☯️ UI
Language: HTML - Size: 669 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
mapluisch/GPT-4-Vision-for-HoloLens
Capture images with HoloLens and receive descriptive responses from OpenAI's GPT-4V(ision)
Language: ShaderLab - Size: 107 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 7 - Forks: 1
jacobmarks/gpt4-vision-plugin
Chat with your images using GPT-4 Vision!
Language: Python - Size: 9.77 KB - Last synced: about 2 months ago - Pushed: 7 months ago - Stars: 9 - Forks: 2
niawjunior/vision-speak
CameraVision: Capture, Analyze - Seamlessly integrate image analysis using GPT-4 Vision API and convert text to speech with Whisper AI
Language: TypeScript - Size: 1.26 MB - Last synced: about 2 months ago - Pushed: 7 months ago - Stars: 7 - Forks: 1
172478394/chatkore
chatkore为开发者提供优质稳定的OpenAI相关的API调用接口,方便国内用户使用各类开源ChatGPT项目或者AI领域的库的使用。
Size: 5.47 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 8 - Forks: 1
danigleba/solve-stem-problems-with-AI
Take a picture of any stem problem and get a detailed, step-by-step, solution. Built using OpenAI's gpt-4-vision-preview model.
Language: JavaScript - Size: 248 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
aymenfurter/azure-chat-with-your-photos-demo
Chatbot that comprehends uploaded images and engages in detailed conversations about their content.
Language: Bicep - Size: 11.6 MB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 2 - Forks: 0
olafwrieden/ai-pizza-quality-checker
An automated GPT-4 Turbo with Vision-powered pizza quality checker.
Language: TypeScript - Size: 24.4 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
jakobdylanc/Discord-LLM-Chatbot
"llmcord" | Multi-user chat | Choose your LLM | OpenAI API | Mistral API | LM Studio | GPT-4 Turbo with vision | Mixtral 8X7B | And more 🔥
Language: Python - Size: 54.7 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 51 - Forks: 16
PratikDavidson/word-power-ai
Learn Communication Language Effectively with Pictorial Story.
Language: Python - Size: 1.3 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
KF-R/ask
ask is a Python script intended for use at the command line in order to ask the OpenAI API a question, optionally including an image, and have the response read aloud by either fast local TTS or the ElevenLabs API.
Language: Python - Size: 23.4 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
supershaneski/chatgpt-with-image-sample
This sample project integrates OpenAI's GPT-4 Vision, with advanced image recognition capabilities, and DALL·E 3, the state-of-the-art image generation model, with the Chat completions API. This powerful combination allows for simultaneous image creation and analysis.
Language: JavaScript - Size: 2.68 MB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 13 - Forks: 4
franperezlopez/img2card
img2card is a unique Telegram Bot designed to simplify your contact management. It takes facade pictures or contact card pictures as input and generates vCard files
Language: Python - Size: 40 KB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
wfce/ChatGPT-OpenAI-API
全网最低价的OpenAI ChatGPT-4-32K、ChatGPT-3.5 API 最高低于官方价42倍。The lowest-priced OpenAI ChatGPT-4-32K and ChatGPT-3.5 APIs on the entire network are 42 times lower than the official price.
Size: 466 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 2 - Forks: 0
EnspiredjackDev/IDFK_Discord_Bots
All the bots running on the IDFK discord server that ive made most of which being about AI
Language: Python - Size: 69.3 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
c0mm4nd/command-windows
CommandWindows is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application window
Language: TypeScript - Size: 319 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
GODGOD126/ScreenChat-GPT4
Implementing a Dialogue with the Computer Screen Based on the GPT-4 API
Language: Python - Size: 86.9 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
pranavgupta2603/SpltiwiseGPTVision
SplitwiseGPT Vision: Streamline bill splitting with AI-driven image processing and OCR. This innovative web app uses Pytesseract, GPT-4 Vision, and the Splitwise API to simplify group expense management. Upload bill images, auto-extract details, and seamlessly integrate expenses into Splitwise groups. Ideal for easy and accurate financial tracking
Language: Python - Size: 11.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
jeremy-collins/gpt4v-screenshot-analyzer
Harnessing OpenAI's GPT-4 Vision API, this tool offers an interactive way to analyze and understand your screenshots. Capture any part of your screen and engage in a dialogue with ChatGPT to uncover detailed insights, ask follow-up questions, and explore visual data in a user-friendly format.
Language: Python - Size: 5.86 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 2
scalable-dynamics/gpt-spa
A customizable GPT in a single page, using OpenAI models text-embedding-ada-002, tts-1, whisper-1, dall-e-3, and gpt-4-vision-preview
Language: JavaScript - Size: 65.4 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 6 - Forks: 1
ArtemStepanov/gpt4_vision_telegram_bot Fork of karfly/chatgpt_telegram_bot
Augmented with with a support of GPT 4 Vision
Language: Python - Size: 1 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0
ElDokmak/MultiModal-Models
Hands on some MultiModal Models
Language: Jupyter Notebook - Size: 2.29 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
GianfrancoCorrea/gpt-4-vision-chat
GPT 4 Turbo Vision with Chainlit
Language: Python - Size: 29.3 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 10 - Forks: 0
zaidmukaddam/poemthatsunset
Let AI write poems about sunsets for you!
Language: TypeScript - Size: 673 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
reedemus/teleGPT
chatGPT as a telegram bot
Language: Python - Size: 2.33 MB - Last synced: 23 days ago - Pushed: 7 months ago - Stars: 1 - Forks: 0
nateraw/openai-vision-api-for-videos
Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦
Language: Jupyter Notebook - Size: 13.7 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 34 - Forks: 2