An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: qwen-vl

gokayfem/awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

Language: Markdown - Size: 2.26 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 804 - Forks: 42

janelu9/EasyLLM

Running Large Language Model easily.

Language: Python - Size: 220 MB - Last synced at: 7 days ago - Pushed at: 24 days ago - Stars: 8 - Forks: 0

zjysteven/lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Language: Python - Size: 13 MB - Last synced at: 27 days ago - Pushed at: 2 months ago - Stars: 284 - Forks: 29

reidbarber/webmarker

Mark web pages for use with vision-language models

Language: TypeScript - Size: 677 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 30 - Forks: 3

autodistill/autodistill-qwen-vl

Qwen-VL base model for use with Autodistill.

Language: Python - Size: 6.84 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0