GitHub topics: multimodel-large-language-model
theboringhumane/echoOLlama
🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features WebSocket streaming, voice interactions, and OpenAI API compatibility. Built with FastAPI, Redis, and PostgreSQL. Perfect for private AI conversations and custom voice assistants.
Language: Jupyter Notebook - Size: 8.2 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 101 - Forks: 4

dvlab-research/Seg-Zero
Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
Language: Python - Size: 4.4 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 298 - Forks: 7

sun-hailong/TVC
🎉 The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorch.
Language: Python - Size: 9.68 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 8 - Forks: 1

balaji1233/AI-Radiology-Reporting
Using MAIRA-2 multimodal transformer designed for the generation of grounded or non-grounded radiology reports from chest X-rays.
Language: Jupyter Notebook - Size: 27.3 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

raminguyen/LLMP2
Evaluating ‘Graphical Perception’ with Multimodal Large Language Models
Language: Jupyter Notebook - Size: 477 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

charanhu/Assets_Youtube_Videos
This repository showcases a collection of innovative projects by Charan H U, focusing on cutting-edge technologies such as facial emotion recognition, fitness tracking, and multi-model applications. Each project demonstrates practical implementations of advanced AI/ML techniques, making it a valuable resource for developers and researchers.
Language: Jupyter Notebook - Size: 18.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

hmshb/langchain-google-gemini-integration
This repo contains integration of LangChain with Google Gemini LLM
Language: Python - Size: 254 KB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Coding-Devil/AI-Multimodel-Hub
AI multi-model using RAG and Langchain
Language: Python - Size: 1010 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Nileshsan/Digital-product-feature-multimodel-large-language-model
Create a tool that uses a multimodal LLM to describe testing instructions for any digital product's features, based on the screenshots.
Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

xinyanghuang7/Basic-Visual-Language-Model
Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖
Language: Python - Size: 34.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0

spidercatfly/TacticExpert
设计一下怎么毕业
Language: Python - Size: 21.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0
