An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: visual-audio

gusanmaz/echosight

EchoSight is a tool that helps visually impaired individuals by audibly describing images taken with a Raspberry Pi Camera or inputted via image path or URL across different operating systems.

Language: Python - Size: 213 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

yiyiyi0817/Visual-Audio-Signal-Processing

哈工大视听觉信号处理实验作业Visual-auditory signal processing lab assignments

Language: Python - Size: 15.8 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

geminate/mwave

A Music Player that can show audio waveform

Language: JavaScript - Size: 3.6 MB - Last synced at: 2 months ago - Pushed at: almost 7 years ago - Stars: 69 - Forks: 16

MuSAELab/Multimodal-dataset-catalog

This repository lists publicly available datasets for visual-audio, speech and audio, and biomedical signal related tasks.

Size: 86.9 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

mx-mark/SPMNet

Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)

Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

MinglangQiao/MVVA-Database

Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020

Language: Python - Size: 130 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 7 - Forks: 1

MinglangQiao/visual_audio_saliency

Code for "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020

Language: Python - Size: 271 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1