An open API service providing repository metadata for many open source software ecosystems.

Topic: "zero-shot-object-detection"

om-ai-lab/OmDet

Real-time and accurate open-vocabulary end-to-end object detection

Language: Python - Size: 9.75 MB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 1,318 - Forks: 110

FoundationVision/GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language: Python - Size: 22.3 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 1,122 - Forks: 69

IDEA-Research/Grounding-DINO-1.5-API

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language: Python - Size: 38.1 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 944 - Forks: 35

wanghao9610/OV-DINO

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Language: Python - Size: 5.58 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 296 - Forks: 19

KennithLi/Awesome-Zero-Shot-Object-Detection

Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 117 - Forks: 7

rohit901/cooperative-foundational-models

[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"

Language: Python - Size: 6.33 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 69 - Forks: 4

autodistill/autodistill-florence-2

Use Florence 2 to auto-label data for use in training fine-tuned object detection models.

Language: Python - Size: 41 KB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 63 - Forks: 12

lorebianchi98/FG-OVD

[CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."

Language: Python - Size: 96.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 52 - Forks: 3

ai-forever/fusion_brain_aij2021

Creating multimodal multitask models

Language: Jupyter Notebook - Size: 4.29 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 50 - Forks: 15

rhysdg/vision-at-a-clip

Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts

Language: Jupyter Notebook - Size: 24 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 36 - Forks: 1

capjamesg/sam-clip

Use Grounding DINO, Segment Anything, and CLIP to label objects in images.

Language: Python - Size: 7.81 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 31 - Forks: 5

deepmancer/clip-object-detection

Zero-shot object detection with CLIP, utilizing Faster R-CNN for region proposals.

Language: Jupyter Notebook - Size: 27.5 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 20 - Forks: 4

sandipan211/ZSD-SC-Resolver

Resolving semantic confusions for improved zero-shot detection (BMVC 2022)

Language: Python - Size: 77 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 4

autodistill/autodistill-paligemma

Use PaliGemma to auto-label data for use in training fine-tuned vision models.

Language: Python - Size: 31.3 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 12 - Forks: 2

autodistill/autodistill-efficient-yolo-world

EfficientSAM + YOLO World base model for use with Autodistill.

Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 0

witnessai/Awesome-Zero-Shot-Object-Detection

A curated list of papers, datasets and resources pertaining to zero-shot object detection.

Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 0

hpc203/GroundingDINO-onnxrun

使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序

Language: Python - Size: 2.66 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

autodistill/autodistill-owlv2

OWLv2 base model for use with Autodistill.

Language: Python - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 6

autodistill/autodistill-codet

CoDet base model for use with Autodistill.

Language: Python - Size: 37.1 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

autodistill/autodistill-yolo-world

YOLO World base module for use with Autodistill.

Language: Python - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

bladeszasza/SOWLv2

SOWLv2 (SegmentedOWLv2) is a powerful command-line tool for text-prompted object segmentation for video and images.

Language: Jupyter Notebook - Size: 6.92 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

MinLee0210/Smart-Evaluation-Solution

A project for AngelHack competition - h4ckhcmc 2024

Language: Python - Size: 444 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 2

capjamesg/image-collage

Generate an image collage with computer vision.

Language: Python - Size: 7.81 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

iKrishneel/zsis

CLIP based Zero Shot Instance Segmentation

Language: Jupyter Notebook - Size: 1.48 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

HassanBinHaroon/GroundingDINO-Inference

This project represents a GroundingDINO Inference (zero-shot object detection) procedure with both methods (CLI and Script). This implementation will help the reader to know the sequence of commands and exemplifying commands for running a quick zero-shot object detection. Additionally, the reader may get insight into code (script) execution.

Language: Jupyter Notebook - Size: 1.04 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

iAnisdev/assistive-ai

Zero-shot object detection system for visually impaired users using CLIP, OWL-ViT, and real-time audio feedback.

Size: 3.91 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

SvenPfiffner/SegmentBot

SegmentBot is a user-friendly web application for zero-shot image segmentation and editing. The app supports GPU acceleration and is designed for research and personal use, with a modular system for easy extension.

Language: Python - Size: 102 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

parnerka/CV-EyeTracking

Pranav's code contribution during his internship at ARL-W. Includes gaze-based object detection GUI and screen tracking script.

Language: Jupyter Notebook - Size: 235 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

sefaburakokcu/autodetectify

Autodectify: Detect and Export Objects with Zero-Shot Object Detection Models

Language: Python - Size: 5.38 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

autodistill/autodistill-qwen-vl

Qwen-VL base model for use with Autodistill.

Language: Python - Size: 6.84 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Related Topics
object-detection 15 computer-vision 12 autodistill 6 open-vocabulary-detection 6 clip 5 deep-learning 5 grounding-dino 4 open-world 3 python 3 zero-shot-learning 3 pytorch 3 foundation-model 2 open-vocabulary-segmentation 2 groundingdino 2 streamlit 2 segmentation 2 segment-anything 2 faster-rcnn 2 owlv2 2 yolo-world 2 sam2 1 vlm 1 florence-2 1 region-proposal 1 rcnn 1 openai-clip 1 vision-language-model 1 open-set 1 art 1 ov-dino 1 fundation-models 1 open-set-object-detection 1 novel-objects 1 paligemma 1 fine-tuning-computer-vision 1 image-processing 1 zero-shot-segmentation 1 react-native 1 qdrant 1 owl-vit 1 huggingface 1 fastapi 1 edge-ai 1 docker 1 assistive-technology 1 accessibility 1 video-object-segmentation 1 video-instance-segmentation 1 tracking 1 referring-video-object-segmentation 1 referring-expression-segmentation 1 referring-expression-comprehension 1 open-vocabulary-video-segmentation 1 interactive-segmentation 1 zero-shot 1 vision-and-language 1 real-time 1 open-vocabulary 1 lvis 1 coco 1 machine-learning 1 foundation-models 1 visual-question-answering 1 multitask 1 multimodal-fusion 1 java-to-python 1 handwritten-text-recognition 1 bilingual 1 text-prompt 1 prompt-engineering 1 open-world-object-detection 1 open-world-detection 1 openclip 1 inferenece 1 accurate-algorithm 1 huggingface-transformers 1 triplet-loss 1 pytorch-implementation 1 transformers 1 multi-modal-learning 1 unsupervised-learning 1 conditional-gan 1 vision-language 1 efficientsam 1 fine-grained-open-vocabulary-object-detection 1 artificial-intelligence 1 qwen-vl 1 codet 1 detectron2 1 opencv 1 eye-tracking 1 tunneling 1 maskrcnn 1 paddleocr 1 ocr 1 nlp 1 ngrok 1 llm 1 hackathon-project 1 groq-api 1