Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / FoundationVision / Groma
Grounded Multimodal Large Language Model with Localized Visual Tokenization
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/FoundationVision%2FGroma
Stars: 423
Forks: 54
Open Issues: 1
License: apache-2.0
Language: Python
Repo Size: 13.5 MB
Dependencies:
45
Created: about 1 month ago
Updated: 4 days ago
Last pushed: 4 days ago
Last synced: 4 days ago
Topics: foundation-models, grounding, large-language-models, llama, llama2, llm, mllm, multimodal, vision-language-model
Files
Dependencies
- python 3.7 build
- docutils ==0.16.0
- myst-parser *
- opencv-python *
- sphinx ==4.0.2
- sphinx-copybutton *
- sphinx_markdown_tables *
- torch *
- PyTurboJPEG * test
- coverage * test
- lmdb * test
- onnx ==1.7.0 test
- onnxoptimizer * test
- onnxruntime >=1.8.0 test
- pytest * test
- scipy * test
- tifffile * test
- accelerate @ git+https://github.com/huggingface/accelerate@a2d8f540c3ab37c8f84d616be1300a0572b69cf8
- deepspeed ==0.9.2
- einops *
- fastapi *
- gradio ==3.23
- lvis @ git+https://github.com/lvis-dataset/lvis-api.git
- markdown2 [all]
- numpy *
- peft ==0.3.0
- pycocoevalcap *
- pycocotools *
- requests *
- scipy *
- sentencepiece *
- shortuuid *
- terminaltables *
- tokenizers ==0.12.1
- transformers ==4.32.0
- uvicorn *