GitHub topics: multimodal-data
ai4colonoscopy/IntelliScope
Frontiers in Intelligent Colonoscopy [ColonSurvey | ColonINST | ColonGPT]
Language: Python - Size: 30.9 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 64 - Forks: 5

willxxy/awesome-mmps
Corpus of resources for multimodal machine learning with physiological signals (mmps).
Size: 1.03 MB - Last synced at: about 10 hours ago - Pushed at: 18 days ago - Stars: 73 - Forks: 2

aclai-lab/SoleData.jl
Manage logical datasets!
Language: Julia - Size: 1.88 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 13 - Forks: 2

sarkadava/FLESH_Effort
This repository stores coding pipeline to process and analyze data associated with project "Putting in the Effort: Modulation of Multimodal Effort in Communicative Breakdowns during a Gestural-Vocal Referential Game" (FLESH).
Language: Jupyter Notebook - Size: 22.1 GB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

scverse/muon
muon is a multimodal omics Python framework
Language: Python - Size: 5.05 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 231 - Forks: 31

dhchenx/mmkit-features
A multimodal architecture to build multimodal knowledge graphs with flexible multimodal feature extraction and dynamic multimodal concept generation
Language: Python - Size: 324 MB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 0

ilaria-manco/multimodal-ml-music
List of academic resources on Multimodal ML for Music
Language: TeX - Size: 268 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 293 - Forks: 11

friedrichor/Awesome-Multimodal-Papers
A curated list of awesome Multimodal studies.
Language: HTML - Size: 63.2 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 172 - Forks: 16

BlueQuartzSoftware/simplnx
The backend algorithms and framework associated with DREAM3DNX, a data analysis program for materials science data analytics
Language: C++ - Size: 157 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 8 - Forks: 10

fork123aniket/Multi-Round-VLM-powered-Multimodal-Conversational-AI-Navigation-Bot
Streamlit App Combining Vision, Language, and Audio AI Models
Language: Python - Size: 18.6 KB - Last synced at: about 10 hours ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

google/space
Unified storage framework for the entire machine learning lifecycle
Language: Python - Size: 825 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 156 - Forks: 8

ZhihaoZhang97/RU-AI
[WWW'25] Official repo for paper: RU-AI: A Large Multimodal Dataset for Machine Generated Content Detection
Language: Jupyter Notebook - Size: 3.36 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

nobel-postech/M2CoSC
Code and data for "Multimodal Cognitive Reframing Therapy via Multi-hop Psychotherapeutic Reasoning" (NAACL 2025)
Size: 493 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

fork123aniket/Agentic-RAG-Story-Generation-with-Multimodal-GenAI
Multimodal Agentic GenAI Workflow – Seamlessly blends retrieval and generation for intelligent storytelling
Language: Python - Size: 94.7 KB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

sggao/multimodal-pnc
Official Implementation of "Multimodal Analysis of PNC Data via Sparse GCA"
Language: MATLAB - Size: 1.21 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

srinadh99/VISION-TRANSFORMER-DRIVEN-LIDAR-DATA-FUSION-FOR-ENHANCED-HYPERSPECTRAL-IMAGE-CLASSIFICATION
Study of Self Attention and Cross Attention-based Transformer models for Multimodal Remote Sensing Image Classification
Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

srinadh99/Transformer-Models-for-Multimodal-Remote-Sensing-Data
Study of Transformer based models for Multimodal Remote Sensing Image Classification
Language: Jupyter Notebook - Size: 253 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 21 - Forks: 2

mdh266/speech2image
A Streamlit App For Speech To Image
Language: Python - Size: 279 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

machine-intelligence-laboratory/TopicNet
Interface for easier topic modelling.
Language: Python - Size: 10.5 MB - Last synced at: 11 days ago - Pushed at: 9 months ago - Stars: 138 - Forks: 17

kyegomez/EXA-1 Fork of pliang279/awesome-multimodal-ml
An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!
Language: Jupyter Notebook - Size: 1.15 GB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 42 - Forks: 2

sitamgithub-MSIT/well-being
Reducing neonatal and under-5 mortality rates via an AI-driven awareness platform with a Gradio app, Gemini API integration, and essential project utilities. #AIForGood
Language: Python - Size: 487 KB - Last synced at: 23 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

BlueQuartzSoftware/DREAM3D
Data Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
Language: C++ - Size: 149 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 163 - Forks: 76

sitamgithub-MSIT/streamlit-app-builder
A Streamlit-based AI assistant generates custom Streamlit app code from user-provided images or text using the Google Gemini model.
Language: Python - Size: 934 KB - Last synced at: 23 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 3

PaccMann/fdsa
A fully differentiable set autoencoder
Language: Python - Size: 6.1 MB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 3

kyegomez/Odin
SOTA Classification at scale for UAVs, Drones, and much more
Language: Python - Size: 211 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

sitamgithub-MSIT/TechSage
Language: Python - Size: 256 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

manuelpagliuca/pain-recognition-ml
Project for the courses of Natural Interaction and Affective Computing, University of Milan, M.Sc. in Computer Science, A.Y. 2022/2023. Predicting pain given a multi-modal dataset.
Language: Python - Size: 133 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

ChenHongruixuan/SRGCAE
[IEEE TGRS 2022] Official Pytorch implementation for Unsupervised Multimodal Change Detection Based on Structural Relationship Graph Representation Learning
Language: Python - Size: 2.41 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 31 - Forks: 0

michelecafagna26/HL-dataset
[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rationale.
Size: 5.67 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

akashe/Multimodal-action-recognition
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
Language: Python - Size: 64.7 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 69 - Forks: 11

pdx-labs/pdx
Prompt Engineering and Dev-Ops toolkit for applications powered by Language Models
Language: Python - Size: 2.68 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 1

cleopatra-itn/GOAL
Multimodal and Multilingual Georeferencing and News Retrieval
Language: Python - Size: 1.84 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 1

slipnitskaya/FAVSeq
FAVSeq is a machine learning-based pipeline for identifying factors affecting the difference between bulk and scRNA-Seq experiments.
Language: Python - Size: 93.8 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Eurus-Holmes/Tumor2Graph
Tumor2Graph: a novel Overall-Tumor-Profile-derived virtual graph deep learning for predicting tumor typing and subtyping.
Language: Python - Size: 3.83 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 0

mateuszkochanek/reprezentacja-projekt
Project created for Representation Learning course on the University of Technology in Wrocław.
Language: Jupyter Notebook - Size: 9.72 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

OlehOnyshchak/pyWikiMM
Collects a multimodal dataset of Wikipedia articles and their images
Language: Python - Size: 7.78 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

GOALCLEOPATRA/MLM
Multitask Learning with Multiple Languages and Modalities
Language: Jupyter Notebook - Size: 19 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 3

basiralab/GmTE-Net
Predicting the multi-trajectory evolution of multimodal brain connectivity.
Language: Python - Size: 1.43 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

GOALCLEOPATRA/MLM_Geo
Multimodal and Multilingual Georeferencing and News Retrieval
Language: Python - Size: 1.83 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 2

vaskanas/SemiSupervised-Algorithms
Language: Python - Size: 14.5 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 1
