An open API service providing repository metadata for many open source software ecosystems.

Topic: "image-reasoning"

The-Martyr/Awesome-Multimodal-Reasoning

Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models

Size: 133 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 23 - Forks: 0

ksm26/Introducing-Multimodal-Llama-3.2

This repository focuses on the cutting-edge features of Llama 3.2, including multimodal capabilities, advanced tokenization, and tool calling for building next-gen AI applications. It highlights Llama's enhanced image reasoning, multilingual support, and the Llama Stack API for seamless customization and orchestration.

Language: Jupyter Notebook - Size: 3.79 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0