Topic: "florence-2"
roboflow/maestro
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
Language: Python - Size: 10.6 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2,600 - Forks: 210

jhc13/taggui
Tag manager and captioner for image datasets
Language: Python - Size: 22.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 994 - Forks: 46

D-Ogi/WatermarkRemover-AI
AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to effectively remove watermarks from images with a user-friendly PyQt6 interface.
Language: Python - Size: 48.8 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 510 - Forks: 80

autodistill/autodistill-grounded-sam-2
Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.
Language: Python - Size: 30.3 KB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 125 - Forks: 18

Ravi-Teja-konda/Surveillance_Video_Summarizer
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
Language: Python - Size: 2.57 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 115 - Forks: 13

autodistill/autodistill-florence-2
Use Florence 2 to auto-label data for use in training fine-tuned object detection models.
Language: Python - Size: 41 KB - Last synced at: 9 days ago - Pushed at: 11 months ago - Stars: 64 - Forks: 13

retkowsky/florence-2
Florence-2
Language: Jupyter Notebook - Size: 45.3 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 62 - Forks: 12

Damarcreative/rem-wm
Rem-WM, a powerful watermark remover tool that leverages the capabilities of Microsoft Florence and Lama Cleaner models.
Language: Python - Size: 188 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 52 - Forks: 9

fireicewolf/wd-llm-caption-cli
A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.
Language: Python - Size: 1.92 MB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 37 - Forks: 9

sayedmohamedscu/Vision-language-models-VLM
vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)
Language: Jupyter Notebook - Size: 16.3 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 27 - Forks: 7

Iteranya/AktivaAI
Local LLM Discord Bot
Language: Python - Size: 1.67 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 15 - Forks: 1

jacobmarks/fiftyone_florence2_plugin
Run SOTA Vision-Language Model Florence-2 on your data!
Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 9 - Forks: 1

mithunparab/text2segment_video
Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes advanced AI models, specifically Florence2 and SAM2, to detect and segment specific objects or activities in a video based on textual descriptions.
Language: Python - Size: 8.48 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 8 - Forks: 2

regiellis/ecko-cli
ecko-cli is a simple CLI tool that streamlines the process of processing images in a directory, generating captions, and saving them as text files. Additionally, it provides functionalities to create a JSONL file from images in the directory you specify. Images will be captioned using the Microsoft Florence-2-large model and ONNX
Language: Python - Size: 551 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 5 - Forks: 0

PRITHIVSAKTHIUR/Florence-2-Image-Caption
This application utilizes the powerful Florence-2 vision-language model from Microsoft to generate comprehensive captions for images. The model is capable of understanding visual content and expressing it in natural language.
Language: Python - Size: 15.6 KB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 4 - Forks: 0

sitamgithub-MSIT/TextSnap
TextSnap: Demo for Florence 2 model used in OCR tasks to extract and visualize text from images.
Language: Python - Size: 3.34 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 2

Ambruk-chan/DiscordBot Fork of badgids/OpenKlyde 📦
The Ultimate Local LLM Discord Bot!!!
Language: Python - Size: 4.47 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 4 - Forks: 3

Gabriellgpc/computer-vision-dataset-maker
The Power of Florence-2 with OpenVINO & FiftyOne: Real-World Applications in Image Analysis
Language: Python - Size: 11.7 KB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

Kazuhito00/Florence-2-Colaboratory-Sample
Microsoft の軽量VLMのFlorence-2のColaboratory上でのサンプル
Language: Jupyter Notebook - Size: 69.5 MB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

ohjho/hfs-florence-2
code for Hugging Space Florence 2 Demo
Language: Python - Size: 141 KB - Last synced at: 25 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

jkawamoto/mcp-florence2
An MCP server for processing images using Florence-2
Language: Python - Size: 405 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 2

Zuellni/Qt-Caption
Image captioning GUI.
Language: Python - Size: 2.18 MB - Last synced at: 16 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

Zuellni/Image-Tools
Various image processing scripts.
Language: Python - Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Rm1n90/Florence2Onnx
ONNX deploys for Florence 2 visual multimodal
Language: Python - Size: 196 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

nsourlos/wave_segmentation
Notebooks to segment wave contour using RunPod
Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Abdeen-A-AI/Image-Feature-Extraction-Using-GenAI
This project implements an advanced generative AI pipeline for extracting and rating features from images. It combines the power of Florence-2, a state-of-the-art vision-language model, with a fine-tuned version of Mistral-v3, a cutting-edge large language model.
Language: Jupyter Notebook - Size: 2.84 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

antonio-f/Florence-2-test
Florence-2 quick test
Language: Jupyter Notebook - Size: 3.91 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

phamkinhquoc2002/florence2-football-analysis
Language: Python - Size: 47.4 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0
