An open API service providing repository metadata for many open source software ecosystems.

Topic: "multimodal-machine-learning"

dqqcasia/awesome-speech-translation Fork of ucaslyc/speech_translation-papers

Size: 296 KB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 178 - Forks: 1

vincentlux/Awesome-Multimodal-LLM

Reading list for Multimodal Large Language Models

Size: 110 KB - Last synced at: 5 days ago - Pushed at: almost 2 years ago - Stars: 68 - Forks: 7

thuiar/MIntRec2.0

MIntRec2.0 is the first large-scale dataset for multimodal intent recognition and out-of-scope detection in multi-party conversations (ICLR 2024)

Language: Python - Size: 2.34 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 46 - Forks: 5

aclai-lab/MultiData.jl

Multimodal datasets for Machine-Learning

Language: Julia - Size: 389 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 3 - Forks: 0

Rajeevveera24/LatentAlignmentProcedural

This repository is cloned from https://github.com/HLR/LatentAlignmentProcedural. This is a potential baseline explored for the textual_cloze task on the RecipeQA Dataset - https://hucvl.github.io/recipeqa/

Language: Jupyter Notebook - Size: 47 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

andre-pereira/ICMI2024LLMsEnjoymentDetection

This repository contains the code, dataset, and model outputs for the ICMI 2024 paper Multimodal User Enjoyment Detection in Human-Robot Conversation: The Power of Large Language Models. It includes scripts for prompting LLMs, training supervised models, and evaluating multimodal enjoyment detection.

Language: Python - Size: 152 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0