image-text-matching | Topic | Ecosyste.ms: Repos

Topic: "image-text-matching"

NVlabs/GroupViT

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.

Language: Python - Size: 8.04 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 761 - Forks: 54

Paranioar/Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

Size: 369 KB - Last synced at: 8 days ago - Pushed at: 8 months ago - Stars: 424 - Forks: 47

Paranioar/SGRAF

[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”

Language: Python - Size: 794 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 197 - Forks: 37

woodfrog/vse_infty

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

Language: Python - Size: 3.91 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 136 - Forks: 18

kywen1119/DSRAN

Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.

Language: Python - Size: 30.7 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 68 - Forks: 12

naver-ai/eccv-caption

Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)

Language: Python - Size: 771 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 2

jaisidhsingh/LoRA-CLIP

Easy wrapper for inserting LoRA layers in CLIP.

Language: Python - Size: 60.5 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 35 - Forks: 2

weiyx16/CLIP-pytorch 📦

A non-JIT version implementation / replication of CLIP of OpenAI in pytorch

Language: Python - Size: 233 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 34 - Forks: 4

jaisidhsingh/CoN-CLIP

Implementation of the "Learn No to Say Yes Better" paper.

Language: Python - Size: 4.36 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 31 - Forks: 2

slavabarkov/tidy

Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine

Language: Kotlin - Size: 99.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 27 - Forks: 5

eric-ai-lab/ComCLIP

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

Language: Python - Size: 7.86 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 0

Paranioar/RCAR

[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”

Language: Python - Size: 1.72 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 2

MartinYuanNJU/SEMScene

Code implementation of paper "SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval" (ACM TOMM 2024).

Language: Python - Size: 36.6 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 20 - Forks: 0

alipay/PC2-NoiseofWeb

Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text matching/retrieval models.

Language: Python - Size: 13.6 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 12 - Forks: 1

zabir-nabil/bangla-image-search

A dead-simple image search / retrieval and image-text matching system for Bangla using CLIP

Language: Python - Size: 234 KB - Last synced at: 4 days ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 4

zabir-nabil/bangla-CLIP

CLIP (Contrastive Language–Image Pre-training) for Bangla.

Language: Python - Size: 1.27 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 3

nhtlongcs/AIC2022-VER

Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding

Language: Python - Size: 12.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 1

cuiaiyu/Text-to-Image-ReIdentification

Unofficial code of paper "Improving description-based person re-identification by multi-granularity image-text alignment." by Niu et al. (partially implemented)

Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3

Paranioar/DBL

[TIP2024] The code of “Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching”

Language: Python - Size: 783 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

kaylode/tern

Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU

Language: Jupyter Notebook - Size: 7.23 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1

Paranioar/GSSF

[TIP2024] The code of "GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning"

Size: 5.86 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

marialymperaiou/knowledge-enhanced-multimodal-learning

A list of research papers on knowledge-enhanced multimodal learning

Size: 20.5 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

Paranioar/Awesome_Image_Text_Retrieval_Benchmark

The Unified Code of Image-Text Retrieval for Further Exploration.

Language: Python - Size: 41 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

mrzjy/GenshinCLIP

A simple open-sourced SigLIP model finetuned on Genshin Impact's image-text pairs.

Size: 1.06 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

basic-go-ahead/wikipedia-image-caption-matching

The 3rd place solution code for the Wikipedia - Image/Caption Matching Competition on Kaggle

Language: Jupyter Notebook - Size: 1.67 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

hthoai/image-text-matching

Image-Text Matching Model Zoo

Language: Python - Size: 12.7 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 2

shayan55579/CMSL-MLP-ImageText-Matching

A novel image-text matching model using Cross-Modal Space Learning with MLP aggregation, designed to bridge the semantic gap between images and texts for improved recall and matching efficiency.

Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

gaurav104/Image-Text-Matching

Language: Python - Size: 61.5 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Cbhihe/NLP_clip-bleu-meteor

Python Implementation of lexical vector embedding similarity scoring, zero-shot classification of images and n-gram based scoring to compare textual summaries

Language: Jupyter Notebook - Size: 4.81 MB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos