GitHub topics: multi-modal-ai
Subrata090-th/sheldon-ai-showcase
π Explore cutting-edge spatial AI with Sheldon, a multi-model intelligence platform for seamless and interactive canvas conversations.
Size: 2.38 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

ShivamMishra1603/video-xplore
AI video analysis + web research in one tool. Upload videos, ask questions, get comprehensive insights with current web data.
Language: Python - Size: 7.04 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

fenilsonani/rag-document-qa
Enterprise-grade RAG system featuring dual online/offline operation, multi-modal document processing, and advanced AI capabilities including knowledge graph construction and hybrid search for intelligent document analysis.
Language: Python - Size: 298 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Ashot72/Multi-Modal-MCP-Server-Client
Multi-Modal MCP Server/Client with SSE Transport Layer
Language: TypeScript - Size: 1.12 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

delegatexai/practical-ai-agents
A curated list of AI agents (open-source & proprietary) that solve real-world problems. Updated regularly!
Size: 1000 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

sjinnovation/CollabAI
CollabAI is an open-source & self-hosted AI operation platform for small and medium-sized businesses. Itβs a customizable & team-centric platform where you can have access to custom AI agents tailored to your business needs.
Language: JavaScript - Size: 10.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 85 - Forks: 34

Aish-p/Text-Vision-Agent
Text-Vision-Agent is an AI-powered assistant that generates images from text descriptions and provides detailed image descriptions. It combines image generation using FluxPipeline with vision-based language models like ChatOllama, enabling seamless text-to-image and image interpretation interactions.
Language: Python - Size: 248 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Md-Emon-Hasan/LangChain
Powerful framework for building applications with Large Language Models (LLMs), enabling seamless integration with memory, agents, and external data sources.
Language: Jupyter Notebook - Size: 737 KB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0
