GitHub topics: video-retrieval
TIGER-AI-Lab/VLM2Vec
This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]
Language: Python - Size: 12.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 392 - Forks: 30

lijun2005/ICCV25-HLFormer
[ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.
Language: Python - Size: 8.21 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 34 - Forks: 2

Adamouization/Content-Based-Video-Retrieval-Code
Undergraduate Dissertation: Content-based video retrieval prototype for movies written in Python using OpenCV.
Language: Python - Size: 408 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 16 - Forks: 5

OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language: Python - Size: 53.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,986 - Forks: 120

gimpong/AAAI25-S5VH
The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).
Language: Python - Size: 2.57 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 2

Arun-George-Zachariah/awesome-video-retrieval-papers
List of resources for video retrieval.
Language: TeX - Size: 25.4 KB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 1

gimpong/ICCV25-HLFormer Fork of lijun2005/ICCV25-HLFormer
The code for the paper "HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning" (ICCV'25).
Language: Python - Size: 7.48 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

roothch/PreenCut
AI-Powered Video Retrieval & Clipping Tool
Language: Python - Size: 684 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 222 - Forks: 26

willyfh/awesome-video-text-datasets
A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
Size: 48.8 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 38 - Forks: 3

foolwood/DRL
[arXiv22] Disentangled Representation Learning for Text-Video Retrieval
Language: Python - Size: 6.04 MB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 96 - Forks: 5

willyfh/msvd-indonesian
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).
Size: 2.55 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

jpthu17/HBI
[CVPR 2023 Highlight & TPAMI] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
Language: Python - Size: 51 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 119 - Forks: 5

albanie/collaborative-experts
Video embeddings for retrieval with natural language queries
Language: Python - Size: 4.26 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 342 - Forks: 55

Vision-CAIR/MiniGPT4-video
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Language: Python - Size: 38.7 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 619 - Forks: 67

j-min/HiREST
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
Language: Python - Size: 3.64 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 101 - Forks: 10

X-PLUG/Youku-mPLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
Language: Python - Size: 15.1 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 297 - Forks: 11

jayleicn/ClipBERT
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
Language: Python - Size: 73.2 KB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 720 - Forks: 86

jayleicn/moment_detr
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
Language: Python - Size: 34.4 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 306 - Forks: 51

X-PLUG/mPLUG-2
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Language: Python - Size: 2.36 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 227 - Forks: 20

jpthu17/EMCL
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Language: Python - Size: 23.9 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 134 - Forks: 10

jpthu17/DiffusionRet
[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
Language: Python - Size: 5.36 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 131 - Forks: 7

li-xirong/w2vvpp
W2VV++: A fully deep learning solution for ad-hoc video search
Language: Python - Size: 107 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 29 - Forks: 15

gkordo/s2vs
Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]
Language: Python - Size: 17.2 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 41 - Forks: 2

jayleicn/TVRetrieval
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Language: Python - Size: 52.9 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 157 - Forks: 24

jpthu17/DiCoSA
[IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
Language: Python - Size: 5.56 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 51 - Forks: 2

minjoong507/BM-DETR
[WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"
Language: Python - Size: 3.07 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 0

TXH-mercury/COSA
[ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Language: Python - Size: 84.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 39 - Forks: 3

trungdangtapcode/Video-Retrieval-System
Video Retrieval System
Language: HTML - Size: 25.6 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

danielchyeh/this-is-my
Official This-Is-My Dataset published in CVPR 2023
Language: Python - Size: 4.89 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 1

minjoong507/MPGN
[EMNLP 2022] Pytorch code for "Modal-specific Pseudo Query Generation for Video Corpus Moment Retrieval"
Language: Python - Size: 73.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

callsys/TextVR
A large Cross-Modal Video Retrieval Dataset with Reading Comprehension
Language: Python - Size: 35.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

mlvlab/MELTR
MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)
Language: Python - Size: 1.13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 6

MKLab-ITI/ndvr-dml
Authors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]
Language: Python - Size: 3.15 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 119 - Forks: 19

MKLab-ITI/visil
Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]
Language: Python - Size: 67.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 198 - Forks: 37

mever-team/distill-and-select
Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 2022]
Language: Python - Size: 118 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 66 - Forks: 9

Nhathuy1305/BetterDay-Tool
The advanced Video Retrieval Tool developed for the competition in AI Challenge 2023 - Ho Chi Minh City. Seamlessly search, locate, and analyze videos using state-of-the-art AI techniques. Elevate your multimedia experience with our innovative solution.
Language: Java - Size: 86.9 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 3

zchoi/PKOL
[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
Language: Python - Size: 505 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 44 - Forks: 0

Sy-Zhang/MMC-PCFG
Video-aided Unsupervised Grammar Induction, NAACL‘21 [best long paper]
Language: Python - Size: 1.46 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 40 - Forks: 4

tsujuifu/pytorch_violet
A PyTorch implementation of VIOLET
Language: Python - Size: 115 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 130 - Forks: 7

tsujuifu/pytorch_empirical-mvm
A PyTorch implementation of EmpiricalMVM
Language: Python - Size: 449 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 2

wjun0830/QD-DETR
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)
Language: Python - Size: 1.23 GB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 110 - Forks: 5

mrtrieuphong/Badger-TeamX-Retrieval
An official open-source Image/Video retrieval engine, developed by Badger Team X in AI Challenge 2022
Language: Python - Size: 8.59 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

lyuchenyang/Dialogue-to-Video-Retrieval
Code for ECIR 2023 paper "Dialogue-to-Video Retrieval"
Language: Python - Size: 34.2 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 1

swarna97/Deep-See-Crime
An AI based surveillance system to track down suspects using gestures and facial attributes using UCF Crime Dataset 128 hours long real-world surveillance videos 13 realistic anomalies includes fighting, assault, road accidents.
Language: Python - Size: 168 MB - Last synced at: 10 months ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

subha-ilamathy/SIH_DeepSeeCrime
To signal an activity that deviates normal patterns with time window. Video annotation, Video retrieval, and Real-time monitoring. Identify and track down the suspects.
Language: Python - Size: 169 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

martinetoering/ViCC
[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.
Language: Python - Size: 5.08 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 37 - Forks: 8

xwen99/temporal_context_aggregation
Temporal Context Aggregation for Video Retrieval with Contrastive Learning, WACV 2021
Language: Python - Size: 5.21 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 23 - Forks: 5

Tramac/sth-2-sth
一个基于内容的图像检索系统
Language: CSS - Size: 4.74 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 10 - Forks: 0

4ML-platform/ndvr
Near Duplicate Video Retrieval
Language: Python - Size: 265 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 28 - Forks: 1

buraksatar/RoME_video_retrieval
It includes our two recent papers on text-to-video retrieval along with a technical report.
Size: 3.79 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

XinshaoAmosWang/OSM_CAA_WeightedContrastiveLoss
Deep Metric Learning by Online Soft Mining and Class-Aware Attention, AAAI 2019 Oral
Language: Shell - Size: 18.6 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 4

Adamouization/Content-Based-Video-Retrieval-Dissertation
Final Year Undergraduate Dissertation Report written in LaTeX for a content-based video retrieval prototype for movies
Language: TeX - Size: 65.9 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 2

toan01-uet/simple-video-retrieval
content-based video retrieval for human actions
Language: Python - Size: 49.3 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

Aveek-Saha/VIRALIQ
Code implementation & CLI tool for the paper: "Graph Based Temporal Aggregation for Video Retrieval"
Language: Python - Size: 1010 KB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

KunpengLi1994/PsTuts
PyTorch code for the CVPR'2020 paper "Screencast Tutorial Video Understanding"
Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 1

XFeiF/ComputerVision_PaperNotes
📚 Paper Notes (Computer vision)
Size: 3.17 MB - Last synced at: 9 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0
