An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: paligemma

roboflow/maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Language: Python - Size: 10.6 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 2,628 - Forks: 215

google-gemini/gemma-cookbook

A collection of guides and examples for the Gemma open models from Google.

Language: Jupyter Notebook - Size: 136 MB - Last synced at: 5 days ago - Pushed at: 22 days ago - Stars: 2,035 - Forks: 314

roboflow/notebooks

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.

Language: Jupyter Notebook - Size: 691 MB - Last synced at: 7 days ago - Pushed at: 11 days ago - Stars: 8,280 - Forks: 1,305

Blaizzy/mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Language: Python - Size: 36 MB - Last synced at: 7 days ago - Pushed at: 16 days ago - Stars: 1,605 - Forks: 169

tibir123/maestro

🎶 Control Apple Music seamlessly on macOS with Maestro, a powerful music controller built for performance, security, and extensibility.

Language: Go - Size: 85 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

eshwarram07/maestro

MAESTRO is an AI-powered research application designed to streamline complex research tasks.

Language: Python - Size: 2.92 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

sikokil/maestro

Maestro simplifies job orchestration for Node.js message workflows. Easily manage producers, consumers, and monitoring in your applications. 🚀✨

Size: 1.95 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

kornia/kornia-paligemma

Rust implementation of Google Paligemma with Candle

Language: Rust - Size: 2.77 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 6 - Forks: 1

sitammeur/paligemma-docci

Image Captioning with PaliGemma 2 Vision Language Model.

Language: Python - Size: 1.26 MB - Last synced at: 23 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

sayedmohamedscu/Vision-language-models-VLM

vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)

Language: Jupyter Notebook - Size: 16.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 27 - Forks: 7

altaf1444/notebooks

<div align="center"> <a href="https://unsloth.ai"><picture> <source media="(prefers-color-scheme: dark)" srcset="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20logo%20white%20text.png"> <source media="(prefers-color-scheme: light)" srcset="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%

Language: Jupyter Notebook - Size: 9.26 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ejlnmusic/PaliGemma-flickr8k-finetuning

# PaliGemma-flickr8k-finetuningThis repository provides a method to fine-tune the PaliGemma model on the Flickr8k dataset for improved image captioning. Explore the features and utilities designed for efficient training and testing. 🐙🌟

Language: Jupyter Notebook - Size: 375 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

AHMEDSANA/PaliGemma-flickr8k-finetuning

This repository contains code for fine-tuning Google's PaliGemma vision-language model on the Flickr8k dataset for image captioning tasks

Language: Jupyter Notebook - Size: 401 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

BUAADreamer/MLLM-Finetuning-Demo

使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory

Language: Python - Size: 61.5 KB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 32 - Forks: 2

sitammeur/paligemma2-mix-litserve

Leverage PaliGemma 2 mix model variant capabilities using LitServe.

Language: Python - Size: 768 KB - Last synced at: 23 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

sitammeur/paligemma2-docci-litserve

Leverage PaliGemma 2's DOCCI fine-tuned variant capabilities using LitServe.

Language: Python - Size: 468 KB - Last synced at: 23 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

shaadclt/Fine-tune-PaliGemma-Image-Captioning

This project demonstrates how to fine-tune PaliGemma model for image captioning. The PaliGemma model, developed by Google Research, is designed to handle images and generate corresponding captions.

Language: Jupyter Notebook - Size: 408 KB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 6 - Forks: 0

MaxLSB/mini-paligemma2

Minimalist implementation of PaliGemma 2 & PaliGemma VLM from scratch

Language: Python - Size: 4.22 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

adithya-s-k/YoloGemma

Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detection and segmentation.

Language: Python - Size: 11.8 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 80 - Forks: 5

GURPREETKAURJETHRA/PaliGemma-FineTuning

PaliGemma FineTuning

Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 4

anamabo/SegmentWaterWithPaligemma

Segmentation of water in Satellite images using Paligemma

Language: Jupyter Notebook - Size: 226 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

autodistill/autodistill-paligemma

Use PaliGemma to auto-label data for use in training fine-tuned vision models.

Language: Python - Size: 31.3 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 12 - Forks: 2

osmajic-mihaela/vqa-paligemma

Fine tunned PaliGemma vision-language models using the ScienceQA dataset for visual question answering.

Language: Jupyter Notebook - Size: 24.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Related Keywords
paligemma 23 python 8 fine-tuning 7 vision-language-model 7 machine-learning 6 deep-learning 6 transformers 6 computer-vision 6 image-captioning 5 pytorch 5 vlm 4 artificial-intelligence 3 blackbox-testing 3 vision-and-language 3 objectdetection 3 captioning 3 image-annotation 2 data-science 2 flickr8k-dataset 2 flax 2 data-orchestrator 2 data-pipelines 2 compter-vision 2 generative-ai 2 ios 2 mlops 2 operating-system 2 orchestration 2 ui-automation 2 workflow-engine 2 transfer-learning 2 multimodal-ai 2 kaggle 2 rust 2 jax 2 image-processing 2 unix 2 gemma 2 google-colab 2 image-classification 2 image-segmentation 2 vqa 2 qwen 2 tutorial 2 yolov5 2 lora 2 zero-shot-classification 2 litserve 2 lightning-ai 2 fastapi 2 llava 2 multimodal 2 florence-2 2 medgemma 1 medical-imaging 1 qlora 1 visionlanguage 1 aws 1 data-analysis 1 jupyter-notebook 1 matplotlib 1 scikit-learn 1 tutorials 1 tensorflow 1 visual-question-answering 1 scienceqa 1 zero-shot-object-detection 1 fine-tuning-computer-vision 1 autodistill 1 satellite-imagery 1 remote-sensing 1 openai 1 llms 1 large-language-models 1 optical-character-recognition 1 yi-vl 1 supervised-finetuning 1 pretraining 1 mllm 1 llama-factory 1 huggingface-datasets 1 finetune-llm 1 natural-language-processing 1 vision-transformer 1 vision-framework 1 pixtral 1 molmo 1 mlx 1 local-ai 1 llm 1 idefics 1 florence2 1 apple-silicon 1 zero-shot-detection 1 yolov8 1 open-vocabulary-segmentation 1 open-vocabulary-detection 1 object-detection 1 deep-neural-networks 1 automatic-labeling-system 1