An open API service providing repository metadata for many open source software ecosystems.

Topic: "multimodal-data"

ilaria-manco/multimodal-ml-music

List of academic resources on Multimodal ML for Music

Language: TeX - Size: 268 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 295 - Forks: 11

IIGROUP/MM-CelebA-HQ-Dataset

[CVPR 2021] Multi-Modal-CelebA-HQ: A Large-Scale Text-Driven Face Generation and Understanding Dataset

Language: Python - Size: 3.41 MB - Last synced at: 11 days ago - Pushed at: 12 months ago - Stars: 240 - Forks: 20

scverse/muon

muon is a multimodal omics Python framework

Language: Python - Size: 5.05 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 235 - Forks: 31

friedrichor/Awesome-Multimodal-Papers

A curated list of awesome Multimodal studies.

Language: HTML - Size: 63.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 192 - Forks: 19

BlueQuartzSoftware/DREAM3D

Data Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.

Language: C++ - Size: 149 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 163 - Forks: 76

google/space 📦

Unified storage framework for the entire machine learning lifecycle

Language: Python - Size: 825 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 8

machine-intelligence-laboratory/TopicNet

Interface for easier topic modelling.

Language: Python - Size: 10.5 MB - Last synced at: 15 days ago - Pushed at: 10 months ago - Stars: 139 - Forks: 17

willxxy/awesome-mmps

Corpus of resources for multimodal machine learning with physiological signals (mmps).

Size: 1.17 MB - Last synced at: 7 days ago - Pushed at: 10 days ago - Stars: 80 - Forks: 2

akashe/Multimodal-action-recognition

Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.

Language: Python - Size: 64.7 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 69 - Forks: 11

ai4colonoscopy/IntelliScope

Frontiers in Intelligent Colonoscopy [ColonSurvey | ColonINST | ColonGPT]

Language: Python - Size: 32.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 66 - Forks: 4

kyegomez/EXA-1 Fork of pliang279/awesome-multimodal-ml

An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!

Language: Jupyter Notebook - Size: 1.15 GB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 2

ChenHongruixuan/SRGCAE

[IEEE TGRS 2022] Official Pytorch implementation for Unsupervised Multimodal Change Detection Based on Structural Relationship Graph Representation Learning

Language: Python - Size: 2.41 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 31 - Forks: 0

srinadh99/Transformer-Models-for-Multimodal-Remote-Sensing-Data

Study of Transformer based models for Multimodal Remote Sensing Image Classification

Language: Jupyter Notebook - Size: 253 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 21 - Forks: 2

PaccMann/fdsa

A fully differentiable set autoencoder

Language: Python - Size: 6.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 3

aclai-lab/SoleData.jl

Manage logical datasets!

Language: Julia - Size: 1.88 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 13 - Forks: 2

dhchenx/mmkit-features

A multimodal architecture to build multimodal knowledge graphs with flexible multimodal feature extraction and dynamic multimodal concept generation

Language: Python - Size: 324 MB - Last synced at: 4 days ago - Pushed at: about 2 years ago - Stars: 9 - Forks: 0

OlehOnyshchak/pyWikiMM

Collects a multimodal dataset of Wikipedia articles and their images

Language: Python - Size: 7.78 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

BlueQuartzSoftware/simplnx

The backend algorithms and framework associated with DREAM3DNX, a data analysis program for materials science data analytics

Language: C++ - Size: 158 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 8 - Forks: 11

pdx-labs/pdx

Prompt Engineering and Dev-Ops toolkit for applications powered by Language Models

Language: Python - Size: 2.68 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 1

Eurus-Holmes/Tumor2Graph

Tumor2Graph: a novel Overall-Tumor-Profile-derived virtual graph deep learning for predicting tumor typing and subtyping.

Language: Python - Size: 3.83 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 0

kyegomez/Odin

SOTA Classification at scale for UAVs, Drones, and much more

Language: Python - Size: 211 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

ZhihaoZhang97/RU-AI

[WWW'25] Official repo for paper: RU-AI: A Large Multimodal Dataset for Machine Generated Content Detection

Language: Jupyter Notebook - Size: 3.36 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

nobel-postech/M2CoSC

Code and data for "Multimodal Cognitive Reframing Therapy via Multi-hop Psychotherapeutic Reasoning" (NAACL 2025)

Size: 493 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

fork123aniket/Multi-Round-VLM-powered-Multimodal-Conversational-AI-Navigation-Bot

Streamlit App Combining Vision, Language, and Audio AI Models

Language: Python - Size: 18.6 KB - Last synced at: 26 days ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

sitamgithub-MSIT/streamlit-app-builder

A Streamlit-based AI assistant generates custom Streamlit app code from user-provided images or text using the Google Gemini model.

Language: Python - Size: 934 KB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 3

basiralab/GmTE-Net

Predicting the multi-trajectory evolution of multimodal brain connectivity.

Language: Python - Size: 1.43 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

fork123aniket/Agentic-RAG-Story-Generation-with-Multimodal-GenAI

Multimodal Agentic GenAI Workflow – Seamlessly blends retrieval and generation for intelligent storytelling

Language: Python - Size: 94.7 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

manuelpagliuca/pain-recognition-ml

Project for the courses of Natural Interaction and Affective Computing, University of Milan, M.Sc. in Computer Science, A.Y. 2022/2023. Predicting pain given a multi-modal dataset.

Language: Python - Size: 133 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

michelecafagna26/HL-dataset

[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rationale.

Size: 5.67 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

sitamgithub-MSIT/well-being

Reducing neonatal and under-5 mortality rates via an AI-driven awareness platform with a Gradio app, Gemini API integration, and essential project utilities. #AIForGood

Language: Python - Size: 487 KB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 1

sarkadava/FLESH_Effort

This repository stores coding pipeline to process and analyze data associated with project "Putting in the Effort: Modulation of Multimodal Effort in Communicative Breakdowns during a Gestural-Vocal Referential Game" (FLESH).

Language: Jupyter Notebook - Size: 22.1 GB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

srinadh99/VISION-TRANSFORMER-DRIVEN-LIDAR-DATA-FUSION-FOR-ENHANCED-HYPERSPECTRAL-IMAGE-CLASSIFICATION

Study of Self Attention and Cross Attention-based Transformer models for Multimodal Remote Sensing Image Classification

Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

mdh266/speech2image

A Streamlit App For Speech To Image

Language: Python - Size: 279 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

sitamgithub-MSIT/TechSage

Language: Python - Size: 256 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

sggao/multimodal-pnc

Official Implementation of "Multimodal Analysis of PNC Data via Sparse GCA"

Language: MATLAB - Size: 1.21 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

GOALCLEOPATRA/MLM

Multitask Learning with Multiple Languages and Modalities

Language: Jupyter Notebook - Size: 19 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 3

mateuszkochanek/reprezentacja-projekt

Project created for Representation Learning course on the University of Technology in Wrocław.

Language: Jupyter Notebook - Size: 9.72 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

GOALCLEOPATRA/MLM_Geo

Multimodal and Multilingual Georeferencing and News Retrieval

Language: Python - Size: 1.83 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 2

slipnitskaya/FAVSeq

FAVSeq is a machine learning-based pipeline for identifying factors affecting the difference between bulk and scRNA-Seq experiments.

Language: Python - Size: 93.8 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

cleopatra-itn/GOAL

Multimodal and Multilingual Georeferencing and News Retrieval

Language: Python - Size: 1.84 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 1

vaskanas/SemiSupervised-Algorithms

Language: Python - Size: 14.5 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 1

Related Topics
machine-learning 10 multimodal 9 multimodal-learning 8 generative-ai 7 multimodal-deep-learning 7 multimodal-large-language-models 7 deep-learning 4 python 4 dataset 3 multimodality 3 multitask-learning 3 multilingual 3 artificial-intelligence 3 vision-language 3 streamlit 2 gemini-api 2 hyperspectral-image-classification 2 scrna-seq 2 gemini-15-pro 2 deeplearning 2 gradio 2 remotesensing 2 computer-vision 2 huggingface-spaces 2 microstructure 2 graph-convolutional-networks 2 materials-informatics 2 filter 2 chatbot 2 analysis 2 internvl2 2 vision-language-transformer 2 data-analysis 2 autoencoder 2 vision-language-model 2 vision-language-learning 2 multimodal-graphs 1 graph-deep-learning 1 gnn 1 conversational-agent 1 tcga-data 1 agentic-ai 1 agentic-rag 1 agentic-workflow 1 generative-ai-model 1 story-generation 1 apache-arrow 1 apache-parquet 1 data-warehouse 1 dataops 1 dml 1 biosignals 1 pypi 1 topic-modeling 1 topic-modelling 1 c-plus-plus 1 data-science 1 materials-science 1 anthropic 1 anthropic-claude 1 cohere 1 gpt-3 1 gpt-4 1 llm 1 llmops 1 llms 1 openai 1 prompt 1 prompt-engineering 1 prompt-toolkit 1 cluster 1 cnn 1 gcn 1 physiological-signals 1 signal-processing 1 wearable 1 wearable-devices 1 multimodal-feature 1 multimodal-knowledge-graph 1 colonoscopy 1 colonoscopy-survey 1 endoscopy 1 medical-ai 1 medical-image-analysis 1 multimodal-colonoscopy 1 polyp 1 polyp-survey 1 anndata 1 cite-seq 1 mudata 1 multi-omics 1 multimodal-omics-analysis 1 muon 1 scanpy 1 scatac-seq 1 scverse 1 lakehouse 1 mlops 1 olap 1 ray 1