GitHub topics: mutli-modal
VachanVY/Transfusion.torch
PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Language: Python - Size: 2.07 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 21 - Forks: 5

krantiparida/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
Size: 58.6 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 715 - Forks: 68

atlas-2192/Multi-AI-Chat-APP
Language: Python - Size: 722 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 16 - Forks: 2

LittlePey/SFD
Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion (CVPR 2022, Oral)
Language: Python - Size: 1.78 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 271 - Forks: 39

Rajeeb321123/Daily_Tasks
During AI internship at princelab
Language: Jupyter Notebook - Size: 6.34 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

zjukg/MANS
[Paper][IJCNN2023] Modality-Aware Negative Sampling for Multi-modal Knowledge Graph Embedding
Language: Python - Size: 7.64 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

youngbin-ro/audiotext-transformer
Multimodal Transformer for Korean Sentiment Analysis with Audio and Text Features
Language: Python - Size: 804 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 20 - Forks: 7
