Topic: "visual-semantic"
aimagelab/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
Language: Python - Size: 7.07 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 531 - Forks: 135

kuanghuei/SCAN
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
Language: Python - Size: 34.2 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 490 - Forks: 106

layumi/Image-Text-Embedding
TOMM2020 Dual-Path Convolutional Image-Text Embedding with Instance Loss :feet: https://arxiv.org/abs/1711.05535
Language: MATLAB - Size: 6.02 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 290 - Forks: 73

aimagelab/show-control-and-tell
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Language: Python - Size: 1.71 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 281 - Forks: 61

woodfrog/vse_infty
Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021
Language: Python - Size: 3.91 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 136 - Forks: 18

oravus/lostX
(RSS 2018) LoST - Visual Place Recognition using Visual Semantics for Opposite Viewpoints across Day and Night
Language: Python - Size: 310 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 61 - Forks: 11

aimagelab/speaksee
PyTorch library for Visual-Semantic tasks
Language: Python - Size: 68.7 MB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 28 - Forks: 8

hthoai/image-text-matching
Image-Text Matching Model Zoo
Language: Python - Size: 12.7 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 2
