An open API service providing repository metadata for many open source software ecosystems.

Topic: "open-vocabulary-detection"

IDEA-Research/Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language: Jupyter Notebook - Size: 152 MB - Last synced at: 12 days ago - Pushed at: 9 months ago - Stars: 16,333 - Forks: 1,493

roboflow/notebooks

This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.

Language: Jupyter Notebook - Size: 463 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 7,724 - Forks: 1,213

roboflow/awesome-openai-vision-api-experiments

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

Language: Python - Size: 40.5 MB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 1,679 - Forks: 132

FoundationVision/GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language: Python - Size: 22.3 MB - Last synced at: 10 days ago - Pushed at: 7 months ago - Stars: 1,126 - Forks: 70

IDEA-Research/Grounding-DINO-1.5-API

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language: Python - Size: 38.1 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 950 - Forks: 36

SkalskiP/awesome-foundation-and-multimodal-models

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

Language: Python - Size: 58.6 KB - Last synced at: about 1 hour ago - Pushed at: over 1 year ago - Stars: 618 - Forks: 46

segments-ai/panoptic-segment-anything

Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation

Language: Jupyter Notebook - Size: 32.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 329 - Forks: 22

wanghao9610/OV-DINO

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Language: Python - Size: 5.58 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 296 - Forks: 19

Charles-Xie/awesome-described-object-detection

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.

Size: 40 KB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 273 - Forks: 22

FoundationVision/GenerateU

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

Language: Python - Size: 14.4 MB - Last synced at: 8 days ago - Pushed at: 2 months ago - Stars: 168 - Forks: 7

jaychempan/LAE-DINO

🦕 [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"

Language: Jupyter Notebook - Size: 16.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 145 - Forks: 8

shikras/d-cube

A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

Language: Python - Size: 835 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 123 - Forks: 7

naver/shine

[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection

Language: Python - Size: 73.8 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 92 - Forks: 8

CVMI-Lab/CoDet

(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

Language: Python - Size: 13 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 84 - Forks: 4

rohit901/cooperative-foundational-models

[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"

Language: Python - Size: 6.33 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 69 - Forks: 4

lorebianchi98/FG-OVD

[CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."

Language: Python - Size: 96.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 52 - Forks: 3

om-ai-lab/OVDEval

A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)

Language: Python - Size: 5.79 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 44 - Forks: 2

wusize/CLIM

[AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation

Language: Python - Size: 12.8 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 3

lorebianchi98/FG-CLIP

[CBMI2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".

Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 26 - Forks: 0

lartpang/OVCamo

(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation

Language: Python - Size: 46.9 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 22 - Forks: 1

jaychempan/ETS

🥈🐉 [CVPRW'25] Official Code for “Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection”

Language: Jupyter Notebook - Size: 22.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 15 - Forks: 0

mala-lab/SIC-CADS

Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)

Language: Python - Size: 20.4 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 1

hpc203/GroundingDINO-onnxrun

使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序

Language: Python - Size: 2.66 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

EasyWalk-PRIN/OpenNav

Official code for the OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation - ACVR Workshop at ECCV'24

Language: Python - Size: 400 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 4 - Forks: 1

hpc203/Open-Vocabulary-Object-Detection-opencv-onnxrun

使用OpenCV+onnxruntime部署开放域目标检测,包含C++和Python两个版本的程序

Language: C++ - Size: 2.44 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

macorisd/TALOS

Modular pipeline for automatic open-vocabulary instance segmentation using a combination state-of-the-art models.

Language: Python - Size: 13.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

superZ678/OV-FGHAD

The official repo for the technical report "Open-Vocabulary Fine-Grained Hand Action Detection"

Size: 2.21 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

Related Topics
object-detection 13 open-vocabulary-segmentation 9 zero-shot-object-detection 6 computer-vision 5 open-world 4 open-vocabulary 3 segment-anything 3 open-world-object-detection 3 grounding-dino 3 clip 3 referring-expression-comprehension 3 deep-learning 3 zero-shot-detection 2 foundation-model 2 pytorch 2 vision-language 2 automatic-labeling-system 2 locate-anything-on-earth 1 remote-sensing 1 detection 1 fine-grained-understanding 1 fg-ovd 1 evaluation-study 1 zero-shot-classification 1 deep-neural-networks 1 yolov8 1 yolov5 1 vlm 1 google-colab 1 tutorial 1 qwen 1 image-classification 1 paligemma 1 machine-learning 1 image-segmentation 1 computer-vison 1 ros2-humble 1 3d-object-detection 1 speech 1 image-editing 1 data-generation 1 caption 1 3d-whole-body-pose-estimation 1 video-object-segmentation 1 video-instance-segmentation 1 tracking 1 referring-video-object-segmentation 1 referring-expression-segmentation 1 open-vocabulary-video-segmentation 1 interactive-segmentation 1 zero-shot 1 openai 1 classification 1 chatgpt 1 open-set 1 multimodal 1 llava 1 image-captioning 1 foundational-models 1 blip 1 vision-and-language 1 visual-grounding 1 awesome-list 1 awesome 1 multimodality 1 mllm 1 fine-grained-open-vocabulary-object-detection 1 artificial-intelligence 1 text-prompt 1 prompt-engineering 1 open-world-detection 1 groundingdino 1 opencv-dnn 1 segmentation 1 vision-and-language-pre-training 1 lae-dino 1 fudational-detector 1 camouflaged-target-detection 1 camouflaged-object-detection 1 camouflage-images 1 camouflage-detection 1 ov-dino 1 fundation-models 1 open-set-object-detection 1 novel-objects 1 ntire-2025-cd-fsod-challenge 1 ntire 1 cvprw2025 1 cross-domain-few-shot-object-detection 1 augmentation-search-strategy 1 hand-action-detection 1 fine-grained 1 multi-modal-learning 1 dataset 1 nlp 1