An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: multi-modal-fusion

FIVEYOUNGWOO/IEEE-802.11n-CSI-Camera-Synchronization-Toolkit

IEEE 802.11n CSI and camera synchronization toolkit.

Language: C - Size: 229 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 12 - Forks: 4

SMARTlab-Purdue/Husformer

This repository contains the source code for our paper: "Husformer: A Multi-Modal Transformer for Multi-Modal Human State Recognition". For more details, please refer to our paper at https://arxiv.org/abs/2209.15182.

Language: Python - Size: 2.94 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 91 - Forks: 26

zjukg/KG-MM-Survey

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

Size: 82.3 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 401 - Forks: 19

kyegomez/MoE-Mamba

Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Zeta

Language: Python - Size: 2.17 MB - Last synced at: 1 day ago - Pushed at: 14 days ago - Stars: 102 - Forks: 5

zjukg/MoMoK

[Paper][ICLR 2025] Multiple Heads are Better than One: Mixture of Modality Knowledge Experts for Entity Representation Learning

Language: Python - Size: 6.99 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 24 - Forks: 2

zjukg/MyGO

[Paper][AAAI 2025] (MyGO)Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation

Language: Python - Size: 90.8 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 242 - Forks: 4

Event-AHU/VTF_PAR

[CVPR-2023 Workshop@NFVLR] Official PyTorch implementation of Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition

Language: Python - Size: 2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 23 - Forks: 2

kyegomez/MegaVIT

The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"

Language: Python - Size: 211 KB - Last synced at: 1 day ago - Pushed at: 16 days ago - Stars: 28 - Forks: 1

kyegomez/the-compiler

Seed, Code, Harvest: Grow Your Own App with Tree of Thoughts!

Language: Python - Size: 10.4 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 145 - Forks: 17

zylbuaa/TFormer

The official implementation of "TFormer: A throughout fusion transformer for multi-modal skin lesion diagnosis"

Language: Python - Size: 521 KB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 14 - Forks: 2

BubbleWang-wly/EIEA

Explicit-Implicit Entity Alignment Method in Multi-modal Knowledge Graphs

Language: Python - Size: 755 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

SeyedMuhammadHosseinMousavi/Multi-Modal-Fusion

Early Fusion, Late Fusion, and Hybrid Fusion

Language: Python - Size: 3.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 1

SeyedMuhammadHosseinMousavi/Comprehensive-Machine-Learning-Techniques-Metrics-Classifiers-and-Evaluation

Comprehensive Machine Learning Techniques: Metrics, Classifiers, and Evaluation

Language: Python - Size: 14.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

zjukg/NATIVE

[Paper][SIGIR 2024] NativE: Multi-modal Knowledge Graph Completion in the Wild

Language: Python - Size: 10.2 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 18 - Forks: 1

Cimy-wang/AM3Net_Multimodal_Data_Fusion

Code for J. Wang, J. Li, Y. Shi, J. Lai and X. Tan, "AM3Net: Adaptive Mutual-learning-based Multimodal Data Fusion Network," in IEEE TCSVT, 2022. We conducted the experiments on the hyperspectral and lidar dataset(Houston and Trento) and multispectral and synthetic aperture radar data (grss-dfc-2007 datasets).

Language: Python - Size: 40.1 MB - Last synced at: 9 months ago - Pushed at: about 2 years ago - Stars: 31 - Forks: 3

GuanRunwei/Achelous

Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave Radar

Language: Python - Size: 67.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 135 - Forks: 7

ecovision-uzh/sat-sinr

[ISPRS 2024] Sat-SINR: High-Resolution Species Distribution Models through Satellite Imagery

Language: Python - Size: 2.48 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 1

tudelft-iv/UniBEV

[IVS'24] UniBEV: the official implementation of UniBEV

Language: Python - Size: 12.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 10 - Forks: 1

yuntaoshou/pami

Size: 42 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

wangxiao5791509/VisEvent_SOT_Benchmark

[IEEE TCYB 2023] The first large-scale tracking dataset by fusing RGB and Event cameras.

Language: Python - Size: 19.9 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 105 - Forks: 8

gaosaroma/MM-AHGNN

Multi-Modal Attention-based Hierarchical Graph Neural Network for Object Interaction Recommendation in Internet of Things (IoT)

Language: Python - Size: 14.7 MB - Last synced at: 8 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

zjukg/AdaMF-MAT

[Paper][LREC-COLING 2024] Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion

Language: Python - Size: 1.91 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 1

nhattruongpham/SER-Fuse

SER-Fuse: An Emotion Recognition Application Utilizing Multi-Modal, Multi-Lingual, and Multi-Feature Fusion

Language: Jupyter Notebook - Size: 819 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

chenzpstar/Multi-Modal-Image-Fusion

Training for multi-modal image fusion with PyTorch.

Language: Python - Size: 10.2 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 3

HackerHyper/ACMVH

Adaptive Confidence Multi-View Hashing

Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 0

kdhht2334/Hidden_Emotion_Detection_using_MM_Signals

[CHI2021] Hidden emotion detection using multi-modal signals

Language: Python - Size: 48 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

Related Keywords
multi-modal-fusion 26 multi-modal-learning 6 knowledge-graph 5 multi-modal-knowledge-graph 5 multi-modality 4 multi-modal 4 knowledge-graph-completion 4 emotion-recognition 4 transformer 3 deep-learning 3 entity-alignment 2 gsr 2 adversarial-learning 2 pytorch 2 imbalanced-learning 2 contrastive-learning 2 knowledge-graph-embeddings 2 object-detection 1 multi-task-learning 1 jaccard 1 object-tracking 1 panoptic-perception 1 point-cloud-segmentation 1 semantic-segmentation 1 earth-observation 1 geo-spatial 1 sentinel-2 1 species-distribution-modelling 1 3d-object-detection 1 leave-one-out-cross-validation 1 machine-learning 1 metrics 1 physiological-signals 1 plot 1 pre-processing 1 seyed-muhammad-hossein-mousavi 1 shape 1 t-sne 1 universita-della-svizzera-italiana 1 violinplot 1 incomplete-data 1 hyperspectral-image-classification 1 hyperspectral-lidar-fusion 1 hyperspectral-sar-fusion 1 4d-mmwave-radar 1 object-interaction-recommendation 1 object-social-network 1 pytorch-geometric 1 imbalanced-data 1 negative-sampling 1 multi-feature-extraction 1 multi-lingual 1 image-fusion 1 infrared 1 polarization 1 unsupervised-learning 1 multi-view-learning 1 eeg-analysis 1 hidden-emotions 1 human-computer-interaction 1 valence-arousal 1 autonomous-driving 1 intelligent-vehicles 1 robustness 1 python 1 dvs 1 dynamic-vision-sensors 1 event 1 frame-event-tracking 1 neuromorphic 1 neuromorphic-vision 1 pengchenglab 1 single-object-tracking 1 spike 1 visevent 1 graph-embedding 1 hierarchical-graph-neural-network 1 feature-extraction 1 artificial-intelligence 1 visual-text-fusion 1 video-based-attribute-recognition 1 pedestrian-attribute-recognition 1 tokenization 1 mygo 1 mutual-information 1 mixture-of-experts 1 swarms 1 moe 1 ml 1 ai 1 visual-question-answering 1 surveys 1 survey 1 paper-list 1 large-language-models 1 information-extraction 1 image-generation 1 image-classification 1 entity-linking 1 cross-modal-retrieval 1