GitHub / Pavansomisetty21 / Multimodal-AI-Agent-for-Video-Understanding-and-Research-using-Gemini-LLM
In this we implement Multimodal AI Agent for Video Understanding and Research we can ask any questions on video it will answer to it
Stars: 0
Forks: 0
Open issues: 0
License: mit
Language: Jupyter Notebook
Size: 4.21 MB
Dependencies parsed at: Pending
Created at: about 1 month ago
Updated at: about 1 month ago
Pushed at: about 1 month ago
Last synced at: 21 days ago
Topics: multimodal, multimodal-large-language-models, video-question-answering
Loading...