GitHub / fork123aniket / Multi-Round-VLM-powered-Multimodal-Conversational-AI-Navigation-Bot
Streamlit App Combining Vision, Language, and Audio AI Models
Stars: 3
Forks: 0
Open issues: 0
License: mit
Language: Python
Size: 18.6 KB
Dependencies parsed at: Pending
Created at: 4 months ago
Updated at: about 1 month ago
Pushed at: 3 months ago
Last synced at: 4 days ago
Topics: conversational-agent, conversational-ai, conversational-bot, conversational-interface, generative-ai, internvl, internvl2, multimodal, multimodal-data, multimodal-deep-learning, multimodal-large-language-models, multimodal-learning, vision-language, vision-language-learning, vision-language-model, vision-language-models, vision-language-navigation, vision-language-transformer