An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: mutli-modal

VachanVY/Transfusion.torch

PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Language: Python - Size: 2.07 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 21 - Forks: 5

krantiparida/awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

Size: 58.6 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 715 - Forks: 68

atlas-2192/Multi-AI-Chat-APP

Language: Python - Size: 722 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 16 - Forks: 2

LittlePey/SFD

Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion (CVPR 2022, Oral)

Language: Python - Size: 1.78 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 271 - Forks: 39

Rajeeb321123/Daily_Tasks

During AI internship at princelab

Language: Jupyter Notebook - Size: 6.34 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

zjukg/MANS

[Paper][IJCNN2023] Modality-Aware Negative Sampling for Multi-modal Knowledge Graph Embedding

Language: Python - Size: 7.64 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

youngbin-ro/audiotext-transformer

Multimodal Transformer for Korean Sentiment Analysis with Audio and Text Features

Language: Python - Size: 804 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 20 - Forks: 7