GitHub topics: audio-language-model
ALucek/multimodal-llm-breakdown
Outlining and demonstrating how language models are able to understand image, video, and text content.
Language: Jupyter Notebook - Size: 14.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0
