An open API service providing repository metadata for many open source software ecosystems.

Topic: "text-video-retrieval"

wjun0830/QD-DETR

Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)

Language: Python - Size: 1.23 GB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 110 - Forks: 5

wjun0830/CGDETR

Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"

Language: Python - Size: 23.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 105 - Forks: 11

Jiamian-Wang/T-MASS-text-video-retrieval

Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"

Language: Python - Size: 6.42 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 36 - Forks: 1

zchoi/GLSCL

[TIP25] Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"

Language: Python - Size: 5.41 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 13 - Forks: 0

zzezze/NeighborRetr

Official implementation of "NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval (CVPR 2025)"

Language: Python - Size: 4.71 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 12 - Forks: 1

Blacksujit/100X-Engineers-GenAI-Hackathon-Submission

Dataviz AI is an AI powered web application that enables users to generate animated infographic videos based on input Data ,files. This MVP leverages the gen ai models for video content and incorporates advanced natural language processing (NLP) techniques, including LangChain and stable diffusion techniques, to analyze and create visual impact.

Language: Jupyter Notebook - Size: 154 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 4 - Forks: 1

gimpong/ICCV25-HLFormer Fork of lijun2005/ICCV25-HLFormer

The code for the paper "HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning" (ICCV'25).

Language: Python - Size: 7.48 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Jiamian-Wang/DITS-text-video-retrieval

Official implementation of "Diffusion-Inspired Truncated Sampler for Text-Video Retrieval (NeurIPS 2024)"

Size: 2.93 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

saleemhamo/coarse-to-fine-dataset

Coarse-to-Fine Grained Text-based Video-moment Retrieval pipeline utilizing T-MASS and MESM models for efficient multi-stage text-video alignment.

Language: Python - Size: 14.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Fsoft-AIC/WAVER

[ICASSP 2024 Oral] WAVER: Writing-Style Agnostic Text-Video Retrieval Via Distilling Vision-Language Models Through Open-Vocabulary Knowledge

Language: Python - Size: 15.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0