An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: multimodal-representation

declare-lab/BBFN

This repository contains the implementation of the paper -- Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment Analysis

Language: Python - Size: 1.25 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 71 - Forks: 14

shamanez/BERT-like-is-All-You-Need

The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion Recognition

Language: Python - Size: 6.73 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 120 - Forks: 11

wanglab-broad/FuseMap

FuseMap: Integrate spatial transcripomics with universal gene, cell, and tissue embeddings.

Language: Jupyter Notebook - Size: 70.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

JunweiLiang/FVTA_MemexQA

Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19

Language: Python - Size: 723 KB - Last synced at: 2 months ago - Pushed at: almost 6 years ago - Stars: 32 - Forks: 15

Bekyilma/MRL_VA_RecSys

Together Yet Apart: Multimodal Representation Learning for Personalised Visual Art Recommendation

Language: Jupyter Notebook - Size: 4.78 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0

GAIR-Lab/IISAN

IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT

Language: Python - Size: 2.04 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 0

usc-sail/mica-deep-mcca

Deep Multiset Canonical Correlation Analysis - An extension of CCA to multiple datasets

Language: Python - Size: 103 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 31 - Forks: 14

PrithivirajDamodaran/vision-language-modelling-series

Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations

Language: Jupyter Notebook - Size: 6.15 MB - Last synced at: about 21 hours ago - Pushed at: almost 3 years ago - Stars: 14 - Forks: 4

usc-sail/mica-multimodal-ads

Segment-level autoencoders for multimodal representation

Language: Python - Size: 226 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 9 - Forks: 1

bryanbocao/open-papernotes

Yet another Ph.D. adventure.

Size: 1010 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 4

kafku/mmeigenwords

Python implementation of the Multimodal Eigenwords (MM-Eigenwords) :snake:

Language: Jupyter Notebook - Size: 6.67 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

gorjanradevski/SMHA

My master thesis: Siamese multi-hop attention for cross-modal retrieval.

Language: Python - Size: 2.76 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 0

dali-does/vse-probing

Code for COLING2020 paper: Probing Multimodal Embeddings for Linguistic Properties.

Language: Python - Size: 37.1 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 9 - Forks: 0

wxjiao/Multimodal-Feature-Extraction

A detailed description on how to extract and align text, audio, and video features at word-level.

Size: 45.9 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 2

harmanpreet93/user_modelling

User modelling using Multi-modal fusion

Language: Python - Size: 21.3 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

Damorgal/Multimodal-Research-experiments

All experiments were done to classify multimodal data.

Size: 161 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

OlehOnyshchak/pyWikiMM

Collects a multimodal dataset of Wikipedia articles and their images

Language: Python - Size: 7.78 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

pokarats/LAP-final-project

Multimodal Bi-Transformers (MMBT) in Biomedical Text/Image Classification

Language: Jupyter Notebook - Size: 156 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

arpytanshu/HUSE-PyTorch

PyTorch Implementation of HUSE: Hierarchical Universal Semantic Embeddings ( https://arxiv.org/pdf/1911.05978.pdf )

Language: Python - Size: 1.65 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

ijcruic/Gowers-Method

Gowers Method for finding latent networks of multi-modal data

Language: Python - Size: 43 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

Related Keywords
multimodal-representation 20 multimodal-deep-learning 7 deep-learning 5 multimodal-learning 4 multimodal-datasets 3 multimodal 3 vision-and-language 2 bert 2 attention-mechanism 2 transfer-learning 2 multimodal-emotion-recognition 2 representation-learning 2 huggingface-transformers 2 data-collection 1 data-processing 1 database 1 data-cleaning 1 transformer-models 1 personality-trait 1 vse 1 multimodal-data 1 nlp 1 machine-learning 1 embeddings 1 tensorflow 1 image-text-search 1 cross-modal-retrieval 1 word-embedding 1 python3 1 eigenwords 1 sensor-fusion 1 paper-arxiv 1 paper 1 unsupervised-learning 1 network-analysis 1 universal-semantic-embedding 1 pytorch-implementation 1 transformer 1 text-classification 1 sparse-data-learning 1 multimodal-models 1 mmbt-model 1 image-classification 1 biomedical-image-processing 1 attention-visualization 1 wikipedia-viewer 1 wikipedia-search 1 wikipedia-scraper 1 wikipedia-page 1 wikipedia-entries 1 wikipedia-dump 1 wikipedia-corpus 1 wikipedia-bot 1 wikipedia-api 1 wikipedia 1 multimodality 1 personalization 1 painting 1 lda 1 contrastive-learning 1 clip 1 blip 1 visual-question-answering 1 memexqa-dataset 1 memex-question-answering 1 variational-autoencoder 1 spatial-transcriptomics 1 spatial-brain-atlas 1 graph-neural-networks 1 foundation-models 1 contextualized-representation 1 speech-emotion-recognition 1 sentiment-analysis 1 self-supervised-learning 1 pretrained-models 1 fine-tuning 1 bert-model 1 multimodal-sentiment-analysis 1 mutil-modal 1 multimodal-correlation 1 multimodal-association 1 segment-level-autoencoders 1 autoencoders 1 audio-visual 1 advertisements 1 vision-and-language-pre-training 1 vision-and-language-navigation 1 multimodal-interactions 1 multiset-cca 1 deep-representation-learning 1 canonical-correlation-analysis 1 sequential-recommendation 1 peft 1 multimodal-recommendation 1 memory-efficient 1 iisan 1 efficiency-analysis 1 visual-arts 1 resnet-50 1 recommender-systems 1