Topic: "zero-shot-object-detection"
om-ai-lab/OmDet
Real-time and accurate open-vocabulary end-to-end object detection
Language: Python - Size: 9.75 MB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 1,318 - Forks: 110

FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Language: Python - Size: 22.3 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 1,122 - Forks: 69

IDEA-Research/Grounding-DINO-1.5-API
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Language: Python - Size: 38.1 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 944 - Forks: 35

wanghao9610/OV-DINO
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Language: Python - Size: 5.58 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 296 - Forks: 19

KennithLi/Awesome-Zero-Shot-Object-Detection
Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 117 - Forks: 7

rohit901/cooperative-foundational-models
[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"
Language: Python - Size: 6.33 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 69 - Forks: 4

autodistill/autodistill-florence-2
Use Florence 2 to auto-label data for use in training fine-tuned object detection models.
Language: Python - Size: 41 KB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 63 - Forks: 12

lorebianchi98/FG-OVD
[CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."
Language: Python - Size: 96.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 52 - Forks: 3

ai-forever/fusion_brain_aij2021
Creating multimodal multitask models
Language: Jupyter Notebook - Size: 4.29 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 50 - Forks: 15

rhysdg/vision-at-a-clip
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
Language: Jupyter Notebook - Size: 24 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 36 - Forks: 1

capjamesg/sam-clip
Use Grounding DINO, Segment Anything, and CLIP to label objects in images.
Language: Python - Size: 7.81 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 31 - Forks: 5

deepmancer/clip-object-detection
Zero-shot object detection with CLIP, utilizing Faster R-CNN for region proposals.
Language: Jupyter Notebook - Size: 27.5 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 20 - Forks: 4

sandipan211/ZSD-SC-Resolver
Resolving semantic confusions for improved zero-shot detection (BMVC 2022)
Language: Python - Size: 77 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 4

autodistill/autodistill-paligemma
Use PaliGemma to auto-label data for use in training fine-tuned vision models.
Language: Python - Size: 31.3 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 12 - Forks: 2

autodistill/autodistill-efficient-yolo-world
EfficientSAM + YOLO World base model for use with Autodistill.
Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 0

witnessai/Awesome-Zero-Shot-Object-Detection
A curated list of papers, datasets and resources pertaining to zero-shot object detection.
Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 0

hpc203/GroundingDINO-onnxrun
使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序
Language: Python - Size: 2.66 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

autodistill/autodistill-owlv2
OWLv2 base model for use with Autodistill.
Language: Python - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 6

autodistill/autodistill-codet
CoDet base model for use with Autodistill.
Language: Python - Size: 37.1 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

autodistill/autodistill-yolo-world
YOLO World base module for use with Autodistill.
Language: Python - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

bladeszasza/SOWLv2
SOWLv2 (SegmentedOWLv2) is a powerful command-line tool for text-prompted object segmentation for video and images.
Language: Jupyter Notebook - Size: 6.92 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

MinLee0210/Smart-Evaluation-Solution
A project for AngelHack competition - h4ckhcmc 2024
Language: Python - Size: 444 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 2

capjamesg/image-collage
Generate an image collage with computer vision.
Language: Python - Size: 7.81 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

iKrishneel/zsis
CLIP based Zero Shot Instance Segmentation
Language: Jupyter Notebook - Size: 1.48 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

HassanBinHaroon/GroundingDINO-Inference
This project represents a GroundingDINO Inference (zero-shot object detection) procedure with both methods (CLI and Script). This implementation will help the reader to know the sequence of commands and exemplifying commands for running a quick zero-shot object detection. Additionally, the reader may get insight into code (script) execution.
Language: Jupyter Notebook - Size: 1.04 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

iAnisdev/assistive-ai
Zero-shot object detection system for visually impaired users using CLIP, OWL-ViT, and real-time audio feedback.
Size: 3.91 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

SvenPfiffner/SegmentBot
SegmentBot is a user-friendly web application for zero-shot image segmentation and editing. The app supports GPU acceleration and is designed for research and personal use, with a modular system for easy extension.
Language: Python - Size: 102 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

parnerka/CV-EyeTracking
Pranav's code contribution during his internship at ARL-W. Includes gaze-based object detection GUI and screen tracking script.
Language: Jupyter Notebook - Size: 235 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

sefaburakokcu/autodetectify
Autodectify: Detect and Export Objects with Zero-Shot Object Detection Models
Language: Python - Size: 5.38 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

autodistill/autodistill-qwen-vl
Qwen-VL base model for use with Autodistill.
Language: Python - Size: 6.84 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
