Topic: "open-vocabulary-detection"
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language: Jupyter Notebook - Size: 152 MB - Last synced at: 12 days ago - Pushed at: 9 months ago - Stars: 16,333 - Forks: 1,493

roboflow/notebooks
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.
Language: Jupyter Notebook - Size: 463 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 7,724 - Forks: 1,213

roboflow/awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
Language: Python - Size: 40.5 MB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 1,679 - Forks: 132

FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Language: Python - Size: 22.3 MB - Last synced at: 10 days ago - Pushed at: 7 months ago - Stars: 1,126 - Forks: 70

IDEA-Research/Grounding-DINO-1.5-API
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Language: Python - Size: 38.1 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 950 - Forks: 36

SkalskiP/awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Language: Python - Size: 58.6 KB - Last synced at: about 1 hour ago - Pushed at: over 1 year ago - Stars: 618 - Forks: 46

segments-ai/panoptic-segment-anything
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
Language: Jupyter Notebook - Size: 32.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 329 - Forks: 22

wanghao9610/OV-DINO
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Language: Python - Size: 5.58 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 296 - Forks: 19

Charles-Xie/awesome-described-object-detection
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
Size: 40 KB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 273 - Forks: 22

FoundationVision/GenerateU
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
Language: Python - Size: 14.4 MB - Last synced at: 8 days ago - Pushed at: 2 months ago - Stars: 168 - Forks: 7

jaychempan/LAE-DINO
🦕 [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"
Language: Jupyter Notebook - Size: 16.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 145 - Forks: 8

shikras/d-cube
A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).
Language: Python - Size: 835 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 123 - Forks: 7

naver/shine
[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
Language: Python - Size: 73.8 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 92 - Forks: 8

CVMI-Lab/CoDet
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Language: Python - Size: 13 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 84 - Forks: 4

rohit901/cooperative-foundational-models
[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"
Language: Python - Size: 6.33 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 69 - Forks: 4

lorebianchi98/FG-OVD
[CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."
Language: Python - Size: 96.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 52 - Forks: 3

om-ai-lab/OVDEval
A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
Language: Python - Size: 5.79 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 44 - Forks: 2

wusize/CLIM
[AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation
Language: Python - Size: 12.8 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 3

lorebianchi98/FG-CLIP
[CBMI2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".
Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 26 - Forks: 0

lartpang/OVCamo
(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation
Language: Python - Size: 46.9 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 22 - Forks: 1

jaychempan/ETS
🥈🐉 [CVPRW'25] Official Code for “Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection”
Language: Jupyter Notebook - Size: 22.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 15 - Forks: 0

mala-lab/SIC-CADS
Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)
Language: Python - Size: 20.4 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 1

hpc203/GroundingDINO-onnxrun
使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序
Language: Python - Size: 2.66 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

EasyWalk-PRIN/OpenNav
Official code for the OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation - ACVR Workshop at ECCV'24
Language: Python - Size: 400 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 4 - Forks: 1

hpc203/Open-Vocabulary-Object-Detection-opencv-onnxrun
使用OpenCV+onnxruntime部署开放域目标检测,包含C++和Python两个版本的程序
Language: C++ - Size: 2.44 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

macorisd/TALOS
Modular pipeline for automatic open-vocabulary instance segmentation using a combination state-of-the-art models.
Language: Python - Size: 13.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

superZ678/OV-FGHAD
The official repo for the technical report "Open-Vocabulary Fine-Grained Hand Action Detection"
Size: 2.21 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0
