GitHub topics: mutli-modal

PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Language: Python - Size: 2.07 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 21 - Forks: 5

A curated list of different papers and datasets in various areas of audio-visual processing

Size: 58.6 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 715 - Forks: 68

Language: Python - Size: 722 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 16 - Forks: 2

Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion (CVPR 2022, Oral)

Language: Python - Size: 1.78 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 271 - Forks: 39

During AI internship at princelab

Language: Jupyter Notebook - Size: 6.34 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

[Paper][IJCNN2023] Modality-Aware Negative Sampling for Multi-modal Knowledge Graph Embedding

Language: Python - Size: 7.64 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

Multimodal Transformer for Korean Sentiment Analysis with Audio and Text Features

Language: Python - Size: 804 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 20 - Forks: 7

Related Keywords

ecosyste.ms