GitHub topics: cross-modality

Repositories

heitorrapela/HalluciDet

[WACV2024] HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information (Accepted at WACV 2024 and LatinX@CVPR2024 Extended Abstract)

Language: Python - Size: 11.6 MB - Last synced at: about 17 hours ago - Pushed at: about 17 hours ago - Stars: 21 - Forks: 0

THUDM/CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language: Python - Size: 25.8 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 6,619 - Forks: 430

chandan1145/Cog

Tiny HTTP framework built on node:http

Language: TypeScript - Size: 177 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

jina-ai/clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language: Python - Size: 27.4 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 12,701 - Forks: 2,077

AdityaLab/MM4TSA

A professional list on Multi-Modalities For Time Series Analysis (MM4TSA) Papers and Resource.

Size: 457 KB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 52 - Forks: 2

movienet/movienet-tools

Tools for movie and video research

Language: C++ - Size: 6.56 MB - Last synced at: 5 days ago - Pushed at: about 3 years ago - Stars: 290 - Forks: 37

haofanwang/awesome-conditional-content-generation

Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.

Size: 129 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 275 - Forks: 27

bismex/Awesome-cross-modality-person-re-identification

Awesome Cross-modality Person Re-identification

Size: 43.9 KB - Last synced at: 16 days ago - Pushed at: about 3 years ago - Stars: 147 - Forks: 32

M-3LAB/awesome-multimodal-brain-image-systhesis

Size: 20.5 KB - Last synced at: 24 days ago - Pushed at: over 2 years ago - Stars: 41 - Forks: 7

zjzsliyang/CrossLeak

Code for the WWW'20 paper "Nowhere to Hide: Cross-modal Identity Leakage between Biometrics and Devices"

Language: Python - Size: 47.9 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 23 - Forks: 5

sail-sg/ptp

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

Language: Python - Size: 2.37 MB - Last synced at: 19 days ago - Pushed at: about 2 years ago - Stars: 152 - Forks: 4

KimMeen/Time-LLM

[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

Language: Python - Size: 1.06 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 2,033 - Forks: 353

layumi/Image-Text-Embedding

TOMM2020 Dual-Path Convolutional Image-Text Embedding with Instance Loss :feet: https://arxiv.org/abs/1711.05535

Language: MATLAB - Size: 6.02 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 292 - Forks: 73

WinfredGe/T2S

[IJCAI 2025] Official implementation of "T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models"

Language: Python - Size: 37.1 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 10 - Forks: 1

hangzhaomit/Sound-of-Pixels

Codebase for ECCV18 "The Sound of Pixels"

Language: Python - Size: 1.24 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 380 - Forks: 74

Event-AHU/EventVOT_Benchmark

[CVPR-2024] The First High Definition (HD) Event based Visual Object Tracking Benchmark Dataset

Language: Python - Size: 41.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 104 - Forks: 5

JDAI-CV/CM-NAS

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

Language: Python - Size: 33.2 KB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 48 - Forks: 13

chenjingong/DN-ReID

[CVPR2024]Day-Night Cross-domain Vehicle Re-identification

Language: Python - Size: 9.46 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 21 - Forks: 1

mx-mark/SPMNet

Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)

Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

BEAM-Labs/CrossBind

Official Pytorch implementation of CrossBind: Collaborative Cross-Modal Identification of Protein Nucleic-Acid-Binding Residues.

Language: Python - Size: 7.79 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

llcing/Cross-modal-hashing-SRLCH

Codes of our work SRLCH

Language: MATLAB - Size: 1.02 MB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 3

GuiyuZhao/VRHCF

[ICME 2024] VRHCF: Cross-Source Point Cloud Registration via Voxel Representation and Hierarchical Correspondence Filtering

Language: Python - Size: 482 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

ZYK100/LLCM

[CVPR 2023] Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identification

Language: Python - Size: 4.99 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 66 - Forks: 8

AnjanDutta/sem-pcyc

PyTorch implementation of the paper "Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval", CVPR 2019.

Language: Python - Size: 23 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 111 - Forks: 23

JacobYuan7/OCN-HOI-Benchmark

[AAAI 2022] Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.

Language: Python - Size: 1.33 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 13 - Forks: 1

catalina17/VideoNavQA

An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)

Language: Python - Size: 5.17 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 23 - Forks: 1

rhgao/co-separation

Co-Separating Sounds of Visual Objects (ICCV 2019)

Language: Python - Size: 465 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 78 - Forks: 24

ZYK100/MMN

Pytorch code for Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification

Language: Python - Size: 45.9 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 3

Mithunjha/EarEEG_KnowledgeDistillation

Official implementation of "A Knowledge Distillation Framework for Enhancing Ear-EEG based Sleep Staging with Scalp-EEG Data"

Language: Jupyter Notebook - Size: 174 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 2

mangye16/Visible-Thermal-Person-Re-Identification

Demo code for visible thermal (cross-modality) person re-identification

Language: Python - Size: 195 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 87 - Forks: 18

Related Keywords

cross-modality 30 deep-learning 8 time-series 3 visible-infrared 3 multi-modal 3 language-model 3 multimodal-time-series 3 person-reidentification 3 awesome-list 3 machine-learning 3 multimodal 2 large-language-models 2 awesome 2 cross-modal-retrieval 2 re-identification 2 low-light 2 vireid 2 survey 2 dataset 2 computer-vision 2 video-understanding 2 person-re-identification 2 sound-separation 2 re-id 2 reid 2 time-series-analysis 2 cross-modal-learning 2 multimodal-deep-learning 2 cycle-gan 1 video 1 nas 1 cm-nas 1 visual-tracking 1 visual-object-tracking 1 single-object-tracking 1 rgb-event 1 high-definition 1 event-based-tracking 1 benchmark-dataset 1 videonavqa 1 visual-reasoning 1 vqa 1 audio-visual-learning 1 ear-eeg 1 self-supervised-learning 1 time-series-generation 1 visual-semantic 1 matlab 1 matconvnet 1 language-retrieval 1 image-search 1 image-retrieval 1 bidirectional-retrieval 1 time-series-forecasting 1 time-series-forecast 1 eeg-signals-processing 1 prompt-tuning 1 knowledge-distillation 1 sleep-research 1 sleep-staging 1 t-sne 1 llcm 1 cvpr2023 1 point-cloud-registration 1 lidar 1 generative-model 1 cross-source 1 retrieval 1 hashing 1 protein-rna-interactions 1 protein-dna-interactions 1 visual-to-sound 1 visual-audio 1 vas 1 sketch-based-image-retrieval 1 zero-shot-learning 1 synchronization 1 audioset 1 detection 1 detr 1 audio-generation 1 end-to-end-pipeline 1 human-object-interaction 1 benchmark 1 conditioning 1 deep-neural-networks 1 embodied 1 navigation 1 cvpr2024 1 question-answering 1 neural-architecture-search 1 detetection 1 docker 1 elixir-phoenix 1 graph-classification 1 graph-database 1 graph-neural-networks 1 link-prediction 1 slack 1 transformer 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos