Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: gpt-4-vision

lancedb/vectordb-recipes

High quality resources & applications for LLMs, multi-modal models and VectorDBs

Language: Jupyter Notebook - Size: 82.5 MB - Last synced: about 15 hours ago - Pushed: 1 day ago - Stars: 457 - Forks: 82

lobehub/lobe-chat

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.

Language: TypeScript - Size: 140 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 31,484 - Forks: 7,390

davidmigloz/pixels2flutter

Convert a screenshot to a working Flutter app.

Language: Dart - Size: 17.8 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 139 - Forks: 35

Skythinker616/gpt-assistant-android

免费的ChatGPT API的安卓语音助手,可用音量键唤起并进行语音交流,支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.

Language: Java - Size: 72.4 MB - Last synced: 5 days ago - Pushed: 11 days ago - Stars: 552 - Forks: 67

Helltar/artific_intellig_bot

AI Telegram Bot, ChatGPT, Dalle2, Whisper, GPT-4 Vision, Stability AI

Language: Kotlin - Size: 627 KB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 19 - Forks: 2

roboflow/multimodal-maestro

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

Language: Python - Size: 5.12 MB - Last synced: 5 days ago - Pushed: 4 months ago - Stars: 966 - Forks: 68

tbckr/sgpt

SGPT is a command-line tool that provides a convenient way to interact with OpenAI models, enabling users to run queries, generate shell commands and produce code directly from the terminal.

Language: Go - Size: 1.17 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 181 - Forks: 21

danny-avila/LibreChat

Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development

Language: TypeScript - Size: 32.9 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 11,885 - Forks: 2,113

sanket98a/MetaData-Extraction

Welcome to GPT-4 Vision Apparel Metadata Extractor! 🌟 Our cutting-edge application leverages the power of GPT-4 to accurately extract detailed metadata from images, focusing specifically on apparel items.

Language: Python - Size: 8.98 MB - Last synced: 14 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

kwishna/openai-smart-vision

AI apps using OpenAI Vision model.

Language: JavaScript - Size: 1.63 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 0 - Forks: 0

dfdggrss/GPT-Desktop-Client

Size: 1000 Bytes - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 0 - Forks: 0

maquenneville/PhotogChauffeur

An OpenAI Vision-powered local image search tool for complex/subjective NL queries

Language: Python - Size: 39.1 KB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 1 - Forks: 0

9vult/Raiha

Raiha Discord Accessibility Bot

Language: TypeScript - Size: 270 KB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 2 - Forks: 5

developersdigest/ai-devices

AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more

Language: TypeScript - Size: 8.63 MB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 116 - Forks: 19

first-automation/layoutya

いらすとやの画像をレイアウトして画像を作る

Language: Python - Size: 3.73 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 0 - Forks: 0

corbindavenport/alt-text-creator

Browser extension that generates alternate text for images using GPT-4 Vision.

Language: JavaScript - Size: 920 KB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 1 - Forks: 0

sazonovanton/SirChatalot

SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or YandexART for image creation. It can use vision capabilities or tools/functions.

Language: Python - Size: 549 KB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 60 - Forks: 10

TypingMind/typingmind

The most advanced Web UI for AI chat

Language: HTML - Size: 45.4 MB - Last synced: 30 days ago - Pushed: 30 days ago - Stars: 173 - Forks: 68

szczyglis-dev/py-gpt

Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command execution, file upload and download, speech synthesis and recognition, access to Web, memory, prompt presets, plugins, assistants & more. Linux, Windows, Mac.

Language: Python - Size: 31 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 316 - Forks: 67

WisconsinAIVision/ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Language: Python - Size: 17.4 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 165 - Forks: 8

aymenfurter/copilot-insurance-claim-demo

How a Picture of Car Damage Can File Your Insurance Claim

Language: Java - Size: 21.9 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 4 - Forks: 2

BenderScript/netvision

Network Topology Image Analsysis

Language: Python - Size: 32.4 MB - Last synced: 21 days ago - Pushed: 22 days ago - Stars: 0 - Forks: 0

theonlyamos/cognitrix

AI Agent equipped with tools and extensions

Language: Python - Size: 430 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 1 - Forks: 1

neka-nat/mylangrobot

Language instructions to mycobot using GPT-4V

Language: Python - Size: 3.52 MB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 13 - Forks: 0

ktutak1337/Stellar-Chat

A multi-modal chat application enabling users to create custom agents, and integrate with local LLMs (Local Language Models), as well as OpenAI models.

Language: C# - Size: 104 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 16 - Forks: 2

jahnvisikligar/Computer-Vision

This repo focuses on my works in the field of Computer Vision

Language: Jupyter Notebook - Size: 549 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

kornia/pixie

Pixie: Computer Vision AI Engineer assistant

Size: 12.7 KB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 13 - Forks: 2

reidbarber/gen-ui

Use text or image prompts to generate components and apps built with React.

Language: TypeScript - Size: 490 KB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 8 - Forks: 3

Sacred-G/ai

PDF Chatbot, Image Chatbot, Web-Site Chatbot with a Knowledge base. OpenAI , Memory, PostgreSQL

Language: Python - Size: 37.2 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

kamilkaczmareksolutions/JanAr

JanAr: GUI application leveraging GPT-4-Vision and GPT models to automatically generate engaging social media captions for artwork images. Customized for a glass workshop and picture framing business, it blends artistic insights with effective online engagement strategies.

Language: Python - Size: 614 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

mountaineerbr/shellChatGPT

Shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS. Features LocalAI, Ollama, Gemini, and Mistral integration.

Language: Shell - Size: 756 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 28 - Forks: 2

Elehiggle/ChatGPTMattermostChatbot

An AI-powered Mattermost ChatGPT chatbot that utilizes the OpenAI API to provide helpful, contextual responses to user messages, extract text from links, and describe or generate images. With Docker support!

Language: Python - Size: 470 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

SkalskiP/sports

Cool experiments at the intersection of Computer Vision and Sports ⚽🏃

Language: Jupyter Notebook - Size: 15.1 MB - Last synced: about 2 months ago - Pushed: 6 months ago - Stars: 436 - Forks: 29

ks6088ts-labs/extractor-python

A data extract tool written in Python

Language: Python - Size: 263 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

vdutts7/gpt4V-scraper

AI agent that can SEE 👁️, control, navigate, & do stuff for you on your browser.

Language: JavaScript - Size: 10.4 MB - Last synced: 2 months ago - Pushed: 3 months ago - Stars: 185 - Forks: 19

signebedi/gptty

ChatGPT wrapper in your TTY

Language: Python - Size: 726 KB - Last synced: 5 days ago - Pushed: 3 months ago - Stars: 47 - Forks: 7

mickymultani/GPT-4-Vision-Architecture-Scanner

A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insights and detailed breakdowns in an interactive chat interface.

Language: JavaScript - Size: 103 KB - Last synced: 26 days ago - Pushed: 7 months ago - Stars: 12 - Forks: 2

danomation/CanvasGPT

OpenAI Vision based drawing game.

Language: HTML - Size: 27.3 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

LazaUK/AOAI-GPT4Vision-Streamlit-SDKv1

Using Azure OpenAI deployment of GPT-4 Turbo with Vision to analyse out-of-stock situation in a fictitious retail shop.

Language: Python - Size: 3.8 MB - Last synced: 20 days ago - Pushed: 5 months ago - Stars: 18 - Forks: 5

zhuoooo/GPT4V-Simulator

探索让 GPT-4V(ision) 像人一样思考和测试的实践。比如一个人打开一个网页后,他通过眼睛看到的页面文字或图标,通过页面来识别是否存在布局错误。

Size: 507 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

image-yinyang/public

🖼️☯️ Public endpoint worker

Language: JavaScript - Size: 26.4 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

image-yinyang/genimg

🖼️☯️ Image generation worker

Language: JavaScript - Size: 29.3 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

image-yinyang/ui

🖼️☯️ UI

Language: HTML - Size: 669 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

mapluisch/GPT-4-Vision-for-HoloLens

Capture images with HoloLens and receive descriptive responses from OpenAI's GPT-4V(ision)

Language: ShaderLab - Size: 107 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 7 - Forks: 1

jacobmarks/gpt4-vision-plugin

Chat with your images using GPT-4 Vision!

Language: Python - Size: 9.77 KB - Last synced: about 2 months ago - Pushed: 7 months ago - Stars: 9 - Forks: 2

niawjunior/vision-speak

CameraVision: Capture, Analyze - Seamlessly integrate image analysis using GPT-4 Vision API and convert text to speech with Whisper AI

Language: TypeScript - Size: 1.26 MB - Last synced: about 2 months ago - Pushed: 7 months ago - Stars: 7 - Forks: 1

172478394/chatkore

chatkore为开发者提供优质稳定的OpenAI相关的API调用接口,方便国内用户使用各类开源ChatGPT项目或者AI领域的库的使用。

Size: 5.47 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 8 - Forks: 1

danigleba/solve-stem-problems-with-AI

Take a picture of any stem problem and get a detailed, step-by-step, solution. Built using OpenAI's gpt-4-vision-preview model.

Language: JavaScript - Size: 248 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

aymenfurter/azure-chat-with-your-photos-demo

Chatbot that comprehends uploaded images and engages in detailed conversations about their content.

Language: Bicep - Size: 11.6 MB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 2 - Forks: 0

olafwrieden/ai-pizza-quality-checker

An automated GPT-4 Turbo with Vision-powered pizza quality checker.

Language: TypeScript - Size: 24.4 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

jakobdylanc/Discord-LLM-Chatbot

"llmcord" | Multi-user chat | Choose your LLM | OpenAI API | Mistral API | LM Studio | GPT-4 Turbo with vision | Mixtral 8X7B | And more 🔥

Language: Python - Size: 54.7 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 51 - Forks: 16

PratikDavidson/word-power-ai

Learn Communication Language Effectively with Pictorial Story.

Language: Python - Size: 1.3 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

KF-R/ask

ask is a Python script intended for use at the command line in order to ask the OpenAI API a question, optionally including an image, and have the response read aloud by either fast local TTS or the ElevenLabs API.

Language: Python - Size: 23.4 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

supershaneski/chatgpt-with-image-sample

This sample project integrates OpenAI's GPT-4 Vision, with advanced image recognition capabilities, and DALL·E 3, the state-of-the-art image generation model, with the Chat completions API. This powerful combination allows for simultaneous image creation and analysis.

Language: JavaScript - Size: 2.68 MB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 13 - Forks: 4

franperezlopez/img2card

img2card is a unique Telegram Bot designed to simplify your contact management. It takes facade pictures or contact card pictures as input and generates vCard files

Language: Python - Size: 40 KB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

wfce/ChatGPT-OpenAI-API

全网最低价的OpenAI ChatGPT-4-32K、ChatGPT-3.5 API 最高低于官方价42倍。The lowest-priced OpenAI ChatGPT-4-32K and ChatGPT-3.5 APIs on the entire network are 42 times lower than the official price.

Size: 466 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 2 - Forks: 0

EnspiredjackDev/IDFK_Discord_Bots

All the bots running on the IDFK discord server that ive made most of which being about AI

Language: Python - Size: 69.3 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

c0mm4nd/command-windows

CommandWindows is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application window

Language: TypeScript - Size: 319 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

GODGOD126/ScreenChat-GPT4

Implementing a Dialogue with the Computer Screen Based on the GPT-4 API

Language: Python - Size: 86.9 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

pranavgupta2603/SpltiwiseGPTVision

SplitwiseGPT Vision: Streamline bill splitting with AI-driven image processing and OCR. This innovative web app uses Pytesseract, GPT-4 Vision, and the Splitwise API to simplify group expense management. Upload bill images, auto-extract details, and seamlessly integrate expenses into Splitwise groups. Ideal for easy and accurate financial tracking

Language: Python - Size: 11.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

jeremy-collins/gpt4v-screenshot-analyzer

Harnessing OpenAI's GPT-4 Vision API, this tool offers an interactive way to analyze and understand your screenshots. Capture any part of your screen and engage in a dialogue with ChatGPT to uncover detailed insights, ask follow-up questions, and explore visual data in a user-friendly format.

Language: Python - Size: 5.86 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 2

scalable-dynamics/gpt-spa

A customizable GPT in a single page, using OpenAI models text-embedding-ada-002, tts-1, whisper-1, dall-e-3, and gpt-4-vision-preview

Language: JavaScript - Size: 65.4 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 6 - Forks: 1

ArtemStepanov/gpt4_vision_telegram_bot Fork of karfly/chatgpt_telegram_bot

Augmented with with a support of GPT 4 Vision

Language: Python - Size: 1 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

ElDokmak/MultiModal-Models

Hands on some MultiModal Models

Language: Jupyter Notebook - Size: 2.29 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

GianfrancoCorrea/gpt-4-vision-chat

GPT 4 Turbo Vision with Chainlit

Language: Python - Size: 29.3 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 10 - Forks: 0

zaidmukaddam/poemthatsunset

Let AI write poems about sunsets for you!

Language: TypeScript - Size: 673 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

reedemus/teleGPT

chatGPT as a telegram bot

Language: Python - Size: 2.33 MB - Last synced: 23 days ago - Pushed: 7 months ago - Stars: 1 - Forks: 0

nateraw/openai-vision-api-for-videos

Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦

Language: Jupyter Notebook - Size: 13.7 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 34 - Forks: 2