An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: video-retrieval

TIGER-AI-Lab/VLM2Vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

Language: Python - Size: 12.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 392 - Forks: 30

lijun2005/ICCV25-HLFormer

[ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.

Language: Python - Size: 8.21 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 34 - Forks: 2

Adamouization/Content-Based-Video-Retrieval-Code

Undergraduate Dissertation: Content-based video retrieval prototype for movies written in Python using OpenCV.

Language: Python - Size: 408 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 16 - Forks: 5

OpenGVLab/InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language: Python - Size: 53.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,986 - Forks: 120

gimpong/AAAI25-S5VH

The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).

Language: Python - Size: 2.57 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 2

Arun-George-Zachariah/awesome-video-retrieval-papers

List of resources for video retrieval.

Language: TeX - Size: 25.4 KB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 1

gimpong/ICCV25-HLFormer Fork of lijun2005/ICCV25-HLFormer

The code for the paper "HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning" (ICCV'25).

Language: Python - Size: 7.48 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

roothch/PreenCut

AI-Powered Video Retrieval & Clipping Tool

Language: Python - Size: 684 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 222 - Forks: 26

willyfh/awesome-video-text-datasets

A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.

Size: 48.8 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 38 - Forks: 3

foolwood/DRL

[arXiv22] Disentangled Representation Learning for Text-Video Retrieval

Language: Python - Size: 6.04 MB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 96 - Forks: 5

willyfh/msvd-indonesian

MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).

Size: 2.55 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

jpthu17/HBI

[CVPR 2023 Highlight & TPAMI] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

Language: Python - Size: 51 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 119 - Forks: 5

albanie/collaborative-experts

Video embeddings for retrieval with natural language queries

Language: Python - Size: 4.26 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 342 - Forks: 55

Vision-CAIR/MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Language: Python - Size: 38.7 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 619 - Forks: 67

j-min/HiREST

Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)

Language: Python - Size: 3.64 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 101 - Forks: 10

X-PLUG/Youku-mPLUG

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

Language: Python - Size: 15.1 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 297 - Forks: 11

jayleicn/ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language: Python - Size: 73.2 KB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 720 - Forks: 86

jayleicn/moment_detr

[NeurIPS 2021] Moment-DETR code and QVHighlights dataset

Language: Python - Size: 34.4 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 306 - Forks: 51

X-PLUG/mPLUG-2

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

Language: Python - Size: 2.36 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 227 - Forks: 20

jpthu17/EMCL

[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

Language: Python - Size: 23.9 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 134 - Forks: 10

jpthu17/DiffusionRet

[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Language: Python - Size: 5.36 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 131 - Forks: 7

li-xirong/w2vvpp

W2VV++: A fully deep learning solution for ad-hoc video search

Language: Python - Size: 107 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 29 - Forks: 15

gkordo/s2vs

Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]

Language: Python - Size: 17.2 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 41 - Forks: 2

jayleicn/TVRetrieval

[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

Language: Python - Size: 52.9 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 157 - Forks: 24

jpthu17/DiCoSA

[IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment

Language: Python - Size: 5.56 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 51 - Forks: 2

minjoong507/BM-DETR

[WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"

Language: Python - Size: 3.07 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 0

TXH-mercury/COSA

[ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

Language: Python - Size: 84.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 39 - Forks: 3

trungdangtapcode/Video-Retrieval-System

Video Retrieval System

Language: HTML - Size: 25.6 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

danielchyeh/this-is-my

Official This-Is-My Dataset published in CVPR 2023

Language: Python - Size: 4.89 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 1

minjoong507/MPGN

[EMNLP 2022] Pytorch code for "Modal-specific Pseudo Query Generation for Video Corpus Moment Retrieval"

Language: Python - Size: 73.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

callsys/TextVR

A large Cross-Modal Video Retrieval Dataset with Reading Comprehension

Language: Python - Size: 35.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

mlvlab/MELTR

MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)

Language: Python - Size: 1.13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 6

MKLab-ITI/ndvr-dml

Authors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]

Language: Python - Size: 3.15 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 119 - Forks: 19

MKLab-ITI/visil

Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]

Language: Python - Size: 67.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 198 - Forks: 37

mever-team/distill-and-select

Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 2022]

Language: Python - Size: 118 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 66 - Forks: 9

Nhathuy1305/BetterDay-Tool

The advanced Video Retrieval Tool developed for the competition in AI Challenge 2023 - Ho Chi Minh City. Seamlessly search, locate, and analyze videos using state-of-the-art AI techniques. Elevate your multimedia experience with our innovative solution.

Language: Java - Size: 86.9 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 3

zchoi/PKOL

[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”

Language: Python - Size: 505 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 44 - Forks: 0

Sy-Zhang/MMC-PCFG

Video-aided Unsupervised Grammar Induction, NAACL‘21 [best long paper]

Language: Python - Size: 1.46 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 40 - Forks: 4

tsujuifu/pytorch_violet

A PyTorch implementation of VIOLET

Language: Python - Size: 115 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 130 - Forks: 7

tsujuifu/pytorch_empirical-mvm

A PyTorch implementation of EmpiricalMVM

Language: Python - Size: 449 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 2

wjun0830/QD-DETR

Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)

Language: Python - Size: 1.23 GB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 110 - Forks: 5

mrtrieuphong/Badger-TeamX-Retrieval

An official open-source Image/Video retrieval engine, developed by Badger Team X in AI Challenge 2022

Language: Python - Size: 8.59 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

lyuchenyang/Dialogue-to-Video-Retrieval

Code for ECIR 2023 paper "Dialogue-to-Video Retrieval"

Language: Python - Size: 34.2 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 1

swarna97/Deep-See-Crime

An AI based surveillance system to track down suspects using gestures and facial attributes using UCF Crime Dataset 128 hours long real-world surveillance videos 13 realistic anomalies includes fighting, assault, road accidents.

Language: Python - Size: 168 MB - Last synced at: 10 months ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

subha-ilamathy/SIH_DeepSeeCrime

To signal an activity that deviates normal patterns with time window. Video annotation, Video retrieval, and Real-time monitoring. Identify and track down the suspects.

Language: Python - Size: 169 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

martinetoering/ViCC

[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.

Language: Python - Size: 5.08 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 37 - Forks: 8

xwen99/temporal_context_aggregation

Temporal Context Aggregation for Video Retrieval with Contrastive Learning, WACV 2021

Language: Python - Size: 5.21 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 23 - Forks: 5

Tramac/sth-2-sth

一个基于内容的图像检索系统

Language: CSS - Size: 4.74 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 10 - Forks: 0

4ML-platform/ndvr

Near Duplicate Video Retrieval

Language: Python - Size: 265 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 28 - Forks: 1

buraksatar/RoME_video_retrieval

It includes our two recent papers on text-to-video retrieval along with a technical report.

Size: 3.79 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

XinshaoAmosWang/OSM_CAA_WeightedContrastiveLoss

Deep Metric Learning by Online Soft Mining and Class-Aware Attention, AAAI 2019 Oral

Language: Shell - Size: 18.6 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 4

Adamouization/Content-Based-Video-Retrieval-Dissertation

Final Year Undergraduate Dissertation Report written in LaTeX for a content-based video retrieval prototype for movies

Language: TeX - Size: 65.9 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 2

toan01-uet/simple-video-retrieval

content-based video retrieval for human actions

Language: Python - Size: 49.3 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

Aveek-Saha/VIRALIQ

Code implementation & CLI tool for the paper: "Graph Based Temporal Aggregation for Video Retrieval"

Language: Python - Size: 1010 KB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

KunpengLi1994/PsTuts

PyTorch code for the CVPR'2020 paper "Screencast Tutorial Video Understanding"

Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 1

XFeiF/ComputerVision_PaperNotes

📚 Paper Notes (Computer vision)

Size: 3.17 MB - Last synced at: 9 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Related Keywords
video-retrieval 56 video-question-answering 11 video-captioning 7 pytorch 6 cross-modal-retrieval 6 contrastive-learning 5 video-search 5 deep-learning 5 vision-and-language 5 ndvr 5 self-supervised-learning 4 video 4 duplicate-videos 4 image-retrieval 4 multimodal 4 computer-vision 4 benchmark 3 dataset 3 video-understanding 3 clip 3 representation-learning 3 action-recognition 3 fivr 3 near-duplicate-video-retrieval 3 video-similarity-learning 3 video-similarity-search 3 video-representation-learning 3 deep-metric-learning 2 iccv 2 text-video-retrieval 2 video-recognition 2 multi-modal 2 cvpr2023 2 video-description 2 video-text 2 video-to-text 2 video-grounding 2 machine-learning 2 vision-language 2 multimodal-learning 2 cvpr 2 moment-retrieval 2 mllm 2 multimodal-pretraining 2 pre-training 2 vqa 2 near-duplicates 2 foundation-models 2 video-dataset 2 content-based-video-retrieval 2 iccv2025 2 cbvr 2 video-text-retrieval 2 video-clip 2 pyqt5-desktop-application 1 pyqt5-gui 1 clip-model 1 badger-teamx 1 search-engine 1 video-browser-showdown 1 ai-challenges 1 video-summarization 1 video-highlight-detection 1 mmeb 1 detection-transformer 1 rag 1 visual-document-retrieval 1 grammar-induction 1 latex 1 vision-language-pretraining 1 backend 1 frontend 1 retrieval 1 personalization 1 text 1 partial-order-alignment 1 meta-learning 1 lorentz-self-attention 1 dml 1 hyperbolic-learning 1 vlm 1 knowledge-distillation 1 ai 1 java 1 reactjs 1 weaviate 1 pytorch-implementation 1 pattern-matching 1 extract-keys-frame 1 image2vec 1 katna 1 mobile-nets 1 graphsage 1 msr-vtt 1 resnet 1 temporal-cluster 1 video-embedding 1 screencast-tutorials 1 cv 1 eccv 1