An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: multimodel-large-language-model

theboringhumane/echoOLlama

🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features WebSocket streaming, voice interactions, and OpenAI API compatibility. Built with FastAPI, Redis, and PostgreSQL. Perfect for private AI conversations and custom voice assistants.

Language: Jupyter Notebook - Size: 8.2 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 101 - Forks: 4

dvlab-research/Seg-Zero

Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

Language: Python - Size: 4.4 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 298 - Forks: 7

sun-hailong/TVC

🎉 The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorch.

Language: Python - Size: 9.68 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 8 - Forks: 1

balaji1233/AI-Radiology-Reporting

Using MAIRA-2 multimodal transformer designed for the generation of grounded or non-grounded radiology reports from chest X-rays.

Language: Jupyter Notebook - Size: 27.3 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

raminguyen/LLMP2

Evaluating ‘Graphical Perception’ with Multimodal Large Language Models

Language: Jupyter Notebook - Size: 477 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

charanhu/Assets_Youtube_Videos

This repository showcases a collection of innovative projects by Charan H U, focusing on cutting-edge technologies such as facial emotion recognition, fitness tracking, and multi-model applications. Each project demonstrates practical implementations of advanced AI/ML techniques, making it a valuable resource for developers and researchers.

Language: Jupyter Notebook - Size: 18.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

hmshb/langchain-google-gemini-integration

This repo contains integration of LangChain with Google Gemini LLM

Language: Python - Size: 254 KB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Coding-Devil/AI-Multimodel-Hub

AI multi-model using RAG and Langchain

Language: Python - Size: 1010 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Nileshsan/Digital-product-feature-multimodel-large-language-model

Create a tool that uses a multimodal LLM to describe testing instructions for any digital product's features, based on the screenshots.

Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

xinyanghuang7/Basic-Visual-Language-Model

Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖

Language: Python - Size: 34.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0

spidercatfly/TacticExpert

设计一下怎么毕业

Language: Python - Size: 21.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0