An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: open-vocabulary

om-ai-lab/OmDet

Real-time and accurate open-vocabulary end-to-end object detection

Language: Python - Size: 9.75 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 1,313 - Forks: 111

iris0329/SeeGround

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

Language: Python - Size: 97.9 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 95 - Forks: 2

jianzongwu/Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

Size: 952 KB - Last synced at: 10 days ago - Pushed at: 27 days ago - Stars: 914 - Forks: 52

zhang-tao-whu/DVIS_Plus

Language: Python - Size: 442 KB - Last synced at: 11 days ago - Pushed at: 10 months ago - Stars: 116 - Forks: 11

wusize/ovdet

[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection

Language: Python - Size: 4.75 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 181 - Forks: 5

clin1223/VLDet

[ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)

Language: Python - Size: 1.56 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 186 - Forks: 11

FoundationVision/GenerateU

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

Language: Python - Size: 14.4 MB - Last synced at: 14 days ago - Pushed at: 21 days ago - Stars: 167 - Forks: 7

coderonion/awesome-open-world-object-detection

This repository lists some awesome public Open World object detection series projects.

Size: 3.91 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 1

JunweiZheng93/OPS

Official repository for paper "Open Panoramic Segmentation" (OPS) at ECCV 2024

Language: Python - Size: 5.24 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 24 - Forks: 2

jinyanglii/OVTR

🏄 [ICLR 2025] OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer

Language: Python - Size: 28.9 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 39 - Forks: 2

boschresearch/RelationField

[CVPR 2025] RelationField: Relate Anything in Radiance Fields

Language: Python - Size: 99.6 KB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 32 - Forks: 3

CVMI-Lab/PLA

(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

Language: Python - Size: 17.8 MB - Last synced at: 30 days ago - Pushed at: 10 months ago - Stars: 277 - Forks: 11

NVlabs/ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language: Python - Size: 16.4 MB - Last synced at: 22 days ago - Pushed at: 10 months ago - Stars: 888 - Forks: 49

xmed-lab/CLIP_Surgery

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

Language: Jupyter Notebook - Size: 18.9 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 397 - Forks: 25

aminebdj/OpenYOLO3D

[ICLR 2025 (Oral 📢) ] Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method in literature.

Language: Python - Size: 7.37 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 113 - Forks: 8

Jiahao000/MosaicFusion

[IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

Language: Python - Size: 28.5 MB - Last synced at: 22 days ago - Pushed at: 6 months ago - Stars: 120 - Forks: 3

hustvl/MaskAdapter

[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"

Language: Python - Size: 15.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 50 - Forks: 0

witnessai/Awesome-Open-Vocabulary-Object-Detection

A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 299 - Forks: 19

wusize/CLIPSelf

[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

Language: Python - Size: 32 MB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 183 - Forks: 9

lartpang/OVCamo

(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation

Language: Python - Size: 46.9 KB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 22 - Forks: 1

UVA-Computer-Vision-Lab/ovmono3d

Code for "Open Vocabulary Monocular 3D Object Detection"

Language: Python - Size: 52.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 29 - Forks: 0

yangcaoai/CoDA_NeurIPS2023

Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection

Language: Jupyter Notebook - Size: 71.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 188 - Forks: 17

hovsg/HOV-SG

[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"

Language: Python - Size: 19 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 230 - Forks: 19

Surrey-UP-Lab/RegionSpot

Recognize Any Regions

Language: Python - Size: 2.16 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 122 - Forks: 4

balabooooo/AED

Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown

Language: Python - Size: 36.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 22 - Forks: 0

chenxi52/FrozenSeg

Open-Vocabulary Panoptic Segmentation

Language: Python - Size: 1.11 MB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 18 - Forks: 1

ngthanhtin/owlvit_segment_anything

Combining OwlViT with Segment Anything - Open-vocabulary Detection and Segmentation (Text-conditioned, and Image-conditioned)

Language: Jupyter Notebook - Size: 17.2 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 156 - Forks: 14

sunanhe/MKT

Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".

Language: Python - Size: 1.81 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 119 - Forks: 6

ruohaoguo/ovavss

Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].

Language: Python - Size: 4.97 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 14 - Forks: 2

ucas-vg/Sambor

Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning

Size: 274 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 0

jessemelpolio/AnytimeCL

[ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification

Language: Python - Size: 94.7 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 8 - Forks: 1

ProvenceStar/PartGLEE

[ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Language: Python - Size: 37.5 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 15 - Forks: 0

boschresearch/Open3DSG

[CVPR 2024] Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships

Language: Python - Size: 140 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 43 - Forks: 1

HKUST-LongGroup/Awesome-Open-Vocabulary-Detection-and-Segmentation

Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future

Size: 1.06 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 81 - Forks: 5

VinAIResearch/Open3DIS

Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)

Language: Python - Size: 119 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 51 - Forks: 3

Fsoft-AIC/WAVER

[ICASSP 2024 Oral] WAVER: Writing-Style Agnostic Text-Video Retrieval Via Distilling Vision-Language Models Through Open-Vocabulary Knowledge

Language: Python - Size: 15.5 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ok-robot/ok-robot

An open, modular framework for zero-shot, language conditioned pick-and-drop tasks in arbitrary homes.

Language: Python - Size: 218 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 363 - Forks: 27

ldkong1205/OpenESS

[CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies

Size: 11.4 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 7 - Forks: 0

CVMI-Lab/CoDet

(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

Language: Python - Size: 13 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 84 - Forks: 4

xuanlinli17/large_vlm_distillation_ood

Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)

Language: Python - Size: 3.69 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 41 - Forks: 4

ajzhai/NeRF2Physics

[CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields

Language: Python - Size: 27 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

ArrowLuo/SegCLIP

PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"

Language: Python - Size: 2.48 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 35 - Forks: 1

xmed-lab/FreeSeg

FreeSeg: Free Mask from Interpretable Contrastive Language-Image Pretraining for Semantic Segmentation

Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 0

Related Keywords
open-vocabulary 43 object-detection 12 semantic-segmentation 7 zero-shot 7 deep-learning 6 instance-segmentation 6 3d-scene-understanding 6 pytorch 5 segment-anything 5 vision-and-language 4 clip 4 open-vocabulary-semantic-segmentation 4 computer-vision 4 segmentation 4 open-world 4 vision-language-model 4 open-vocabulary-segmentation 4 transformer 3 panoptic-segmentation 3 open-vocabulary-detection 3 detection 3 3d-scene-graph 2 diffusion-models 2 paper-resource 2 bcai 2 multimodal 2 multi-object-tracking 2 3d-vision 2 zero-shot-learning 2 transfer-learning 2 interpretability 2 open-world-object-detection 2 cvpr2023 2 cvpr2024 2 3d-computer-vision 2 autonomous-driving 2 vision-language-pretraining 2 multi-modal-learning 1 natural-language-understanding 1 owl-vit 1 robot-navigation 1 robot-planning 1 small-object 1 sound-localization 1 auto-labeling 1 audio-visual 1 multimodal-representation-learning 1 vision-language-foundation-model 1 vision-foundation-model 1 stable-diffusion 1 multi-label-classification 1 zero-shot-semantic-segmentation 1 contrastive-learning 1 scene-understanding 1 nerf 1 gpt-4 1 out-of-distribution-generalization 1 out-of-distribution 1 machine-learning 1 few-shot-learning 1 event-camera 1 robotics 1 home-robots 1 writing-style-agnostic 1 text-video-retrieval 1 knowledge-distillation 1 icassp2024 1 3d-point-clouds 1 3d-instance-segmentation 1 video-understanding 1 scene-graph 1 part-segmentation 1 hierarchical-models 1 foundation-models 1 flexible-inference 1 continual-learning 1 anytimecl 1 vision-language 1 video-processing 1 yolo 1 owod 1 gpt4 1 few-shot 1 cvpr 1 chatgpt 1 awesome 1 multimodality 1 mllm 1 multi-modal 1 iclr2023 1 video-semantic-segmentation 1 video-segmentation 1 video-instance-segmentation 1 tpami-2024 1 vlm 1 embodied-ai 1 embodied-agent 1 3d-visual-grounding 1 zero-shot-object-detection 1 real-time 1