GitHub topics: visual-audio

Repositories

gusanmaz/echosight

EchoSight is a tool that helps visually impaired individuals by audibly describing images taken with a Raspberry Pi Camera or inputted via image path or URL across different operating systems.

Language: Python - Size: 213 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

yiyiyi0817/Visual-Audio-Signal-Processing

哈工大视听觉信号处理实验作业Visual-auditory signal processing lab assignments

Language: Python - Size: 15.8 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

geminate/mwave

A Music Player that can show audio waveform

Language: JavaScript - Size: 3.6 MB - Last synced at: 2 months ago - Pushed at: almost 7 years ago - Stars: 69 - Forks: 16

MuSAELab/Multimodal-dataset-catalog

This repository lists publicly available datasets for visual-audio, speech and audio, and biomedical signal related tasks.

Size: 86.9 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

mx-mark/SPMNet

Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)

Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

MinglangQiao/MVVA-Database

Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020

Language: Python - Size: 130 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 7 - Forks: 1

MinglangQiao/visual_audio_saliency

Code for "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020

Language: Python - Size: 271 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

Related Keywords

visual-audio 7 visual-audio-saliency 2 multi-modal 2 cogvl 1 deepfake 1 healthcare 1 speech 1 audio-generation 1 audioset 1 cross-modality 1 synchronization 1 vas 1 video-understanding 1 visual-to-sound 1 multi-modal-database 1 talking-face 1 saliency-detection 1 saliency-prediction 1 coqui-tts 1 llm 1 llms 1 raspberry-pi 1 replicate 1 replicate-api 1 seamlessm4t 1 visual-audio-navigation 1 vllm 1 harbin-institute-of-technology 1 lab-assignment 1 signal-processing 1 electron-vue 1 player 1 biomedical-signal 1 dataset 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos