An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: multimodal-data

ai4colonoscopy/IntelliScope

Frontiers in Intelligent Colonoscopy [ColonSurvey | ColonINST | ColonGPT]

Language: Python - Size: 30.9 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 64 - Forks: 5

willxxy/awesome-mmps

Corpus of resources for multimodal machine learning with physiological signals (mmps).

Size: 1.03 MB - Last synced at: about 10 hours ago - Pushed at: 18 days ago - Stars: 73 - Forks: 2

aclai-lab/SoleData.jl

Manage logical datasets!

Language: Julia - Size: 1.88 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 13 - Forks: 2

sarkadava/FLESH_Effort

This repository stores coding pipeline to process and analyze data associated with project "Putting in the Effort: Modulation of Multimodal Effort in Communicative Breakdowns during a Gestural-Vocal Referential Game" (FLESH).

Language: Jupyter Notebook - Size: 22.1 GB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

scverse/muon

muon is a multimodal omics Python framework

Language: Python - Size: 5.05 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 231 - Forks: 31

dhchenx/mmkit-features

A multimodal architecture to build multimodal knowledge graphs with flexible multimodal feature extraction and dynamic multimodal concept generation

Language: Python - Size: 324 MB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 0

ilaria-manco/multimodal-ml-music

List of academic resources on Multimodal ML for Music

Language: TeX - Size: 268 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 293 - Forks: 11

friedrichor/Awesome-Multimodal-Papers

A curated list of awesome Multimodal studies.

Language: HTML - Size: 63.2 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 172 - Forks: 16

BlueQuartzSoftware/simplnx

The backend algorithms and framework associated with DREAM3DNX, a data analysis program for materials science data analytics

Language: C++ - Size: 157 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 8 - Forks: 10

fork123aniket/Multi-Round-VLM-powered-Multimodal-Conversational-AI-Navigation-Bot

Streamlit App Combining Vision, Language, and Audio AI Models

Language: Python - Size: 18.6 KB - Last synced at: about 10 hours ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

google/space

Unified storage framework for the entire machine learning lifecycle

Language: Python - Size: 825 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 156 - Forks: 8

ZhihaoZhang97/RU-AI

[WWW'25] Official repo for paper: RU-AI: A Large Multimodal Dataset for Machine Generated Content Detection

Language: Jupyter Notebook - Size: 3.36 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

nobel-postech/M2CoSC

Code and data for "Multimodal Cognitive Reframing Therapy via Multi-hop Psychotherapeutic Reasoning" (NAACL 2025)

Size: 493 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

fork123aniket/Agentic-RAG-Story-Generation-with-Multimodal-GenAI

Multimodal Agentic GenAI Workflow – Seamlessly blends retrieval and generation for intelligent storytelling

Language: Python - Size: 94.7 KB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

sggao/multimodal-pnc

Official Implementation of "Multimodal Analysis of PNC Data via Sparse GCA"

Language: MATLAB - Size: 1.21 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

srinadh99/VISION-TRANSFORMER-DRIVEN-LIDAR-DATA-FUSION-FOR-ENHANCED-HYPERSPECTRAL-IMAGE-CLASSIFICATION

Study of Self Attention and Cross Attention-based Transformer models for Multimodal Remote Sensing Image Classification

Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

srinadh99/Transformer-Models-for-Multimodal-Remote-Sensing-Data

Study of Transformer based models for Multimodal Remote Sensing Image Classification

Language: Jupyter Notebook - Size: 253 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 21 - Forks: 2

mdh266/speech2image

A Streamlit App For Speech To Image

Language: Python - Size: 279 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

machine-intelligence-laboratory/TopicNet

Interface for easier topic modelling.

Language: Python - Size: 10.5 MB - Last synced at: 11 days ago - Pushed at: 9 months ago - Stars: 138 - Forks: 17

kyegomez/EXA-1 Fork of pliang279/awesome-multimodal-ml

An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!

Language: Jupyter Notebook - Size: 1.15 GB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 42 - Forks: 2

sitamgithub-MSIT/well-being

Reducing neonatal and under-5 mortality rates via an AI-driven awareness platform with a Gradio app, Gemini API integration, and essential project utilities. #AIForGood

Language: Python - Size: 487 KB - Last synced at: 23 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

BlueQuartzSoftware/DREAM3D

Data Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.

Language: C++ - Size: 149 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 163 - Forks: 76

sitamgithub-MSIT/streamlit-app-builder

A Streamlit-based AI assistant generates custom Streamlit app code from user-provided images or text using the Google Gemini model.

Language: Python - Size: 934 KB - Last synced at: 23 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 3

PaccMann/fdsa

A fully differentiable set autoencoder

Language: Python - Size: 6.1 MB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 3

kyegomez/Odin

SOTA Classification at scale for UAVs, Drones, and much more

Language: Python - Size: 211 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

sitamgithub-MSIT/TechSage

Language: Python - Size: 256 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

manuelpagliuca/pain-recognition-ml

Project for the courses of Natural Interaction and Affective Computing, University of Milan, M.Sc. in Computer Science, A.Y. 2022/2023. Predicting pain given a multi-modal dataset.

Language: Python - Size: 133 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

ChenHongruixuan/SRGCAE

[IEEE TGRS 2022] Official Pytorch implementation for Unsupervised Multimodal Change Detection Based on Structural Relationship Graph Representation Learning

Language: Python - Size: 2.41 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 31 - Forks: 0

michelecafagna26/HL-dataset

[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rationale.

Size: 5.67 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

akashe/Multimodal-action-recognition

Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.

Language: Python - Size: 64.7 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 69 - Forks: 11

pdx-labs/pdx

Prompt Engineering and Dev-Ops toolkit for applications powered by Language Models

Language: Python - Size: 2.68 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 1

cleopatra-itn/GOAL

Multimodal and Multilingual Georeferencing and News Retrieval

Language: Python - Size: 1.84 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 1

slipnitskaya/FAVSeq

FAVSeq is a machine learning-based pipeline for identifying factors affecting the difference between bulk and scRNA-Seq experiments.

Language: Python - Size: 93.8 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Eurus-Holmes/Tumor2Graph

Tumor2Graph: a novel Overall-Tumor-Profile-derived virtual graph deep learning for predicting tumor typing and subtyping.

Language: Python - Size: 3.83 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 0

mateuszkochanek/reprezentacja-projekt

Project created for Representation Learning course on the University of Technology in Wrocław.

Language: Jupyter Notebook - Size: 9.72 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

OlehOnyshchak/pyWikiMM

Collects a multimodal dataset of Wikipedia articles and their images

Language: Python - Size: 7.78 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

GOALCLEOPATRA/MLM

Multitask Learning with Multiple Languages and Modalities

Language: Jupyter Notebook - Size: 19 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 3

basiralab/GmTE-Net

Predicting the multi-trajectory evolution of multimodal brain connectivity.

Language: Python - Size: 1.43 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

GOALCLEOPATRA/MLM_Geo

Multimodal and Multilingual Georeferencing and News Retrieval

Language: Python - Size: 1.83 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 2

vaskanas/SemiSupervised-Algorithms

Language: Python - Size: 14.5 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 1

Related Keywords
multimodal-data 40 machine-learning 10 multimodal 9 multimodal-learning 8 generative-ai 7 multimodal-large-language-models 7 multimodal-deep-learning 7 python 4 deep-learning 4 artificial-intelligence 3 multitask-learning 3 multilingual 3 vision-language 3 multimodality 3 dataset 3 scrna-seq 2 remotesensing 2 hyperspectral-image-classification 2 deeplearning 2 autoencoder 2 graph-convolutional-networks 2 analysis 2 data-analysis 2 filter 2 vision-language-transformer 2 vision-language-model 2 vision-language-learning 2 internvl2 2 microstructure 2 materials-informatics 2 gemini-api 2 gemini-15-pro 2 streamlit 2 chatbot 2 gradio 2 huggingface-spaces 2 computer-vision 2 code-generation 1 materials-science 1 data-science 1 huggingface-datasets 1 image-captioning 1 image2text 1 multimodal-grounding 1 c-plus-plus 1 vision-and-language 1 cross-attention 1 multimodal-action-recognition 1 multimodal-fusion 1 anthropic 1 anthropic-claude 1 affective-computing 1 mediapipe 1 natural-interaction 1 techbot 1 opencv 1 gemini-pro-vision 1 gemini-pro 1 pain-detection 1 phuselab 1 svm-classifier 1 swarm-intelligence 1 change-detection 1 remote-sensing 1 structural-relationship 1 set-autoencoder 1 unsupervised-learning 1 weave 1 wandb 1 llm-tracing 1 data-collection 1 data-processing 1 database 1 multimodal-datasets 1 multimodal-representation 1 wikipedia 1 wikipedia-api 1 wikipedia-bot 1 wikipedia-corpus 1 wikipedia-dump 1 wikipedia-entries 1 wikipedia-page 1 wikipedia-scraper 1 wikipedia-search 1 wikipedia-viewer 1 brain-connectivity 1 brain-wiring 1 functional-connectome 1 graph-neural-networks 1 morphological-connectome 1 multitrajectory-prediction 1 student-teacher-learning 1 active-learning 1 semisupervised-learning 1 cohere 1 gpt-3 1 gpt-4 1 llm 1 llmops 1 llms 1