GitHub / FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/FoundationVision%2FGroma
PURL: pkg:github/FoundationVision/Groma

Stars: 575
Forks: 44
Open issues: 15

License: apache-2.0
Language: Python
Size: 13.5 MB
Dependencies parsed at: Pending

Created at: over 1 year ago
Updated at: 8 days ago
Pushed at: about 1 year ago
Last synced at: 4 days ago

Topics: foundation-models, grounding, large-language-models, llama, llama2, llm, mllm, multimodal, vision-language-model

Loading...