Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: audio-visual
Libvisual/libvisual
Libvisual Audio Visualization
Language: C - Size: 20.6 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 81 - Forks: 30
krantiparida/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
Size: 58.6 KB - Last synced: 3 days ago - Pushed: 4 months ago - Stars: 620 - Forks: 70
jerosoler/waveform-path
🎙 Generator waveform paths for SVG 🎶
Language: JavaScript - Size: 226 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 67 - Forks: 6
samhirtarif/react-audio-visualize
An audio visualizer for React. Provides separate components to visualize both live audio and audio blobs.
Language: TypeScript - Size: 693 KB - Last synced: 7 days ago - Pushed: 5 months ago - Stars: 52 - Forks: 9
MacTirney/Audio-Visual-Scripts-and-Plugins
A compilation of audio-visual scripts and plugins primarily tailored for GrandMA2, GrandMA3, and MagicQ control software.
Language: Lua - Size: 33.2 KB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 1 - Forks: 0
Rudxain/bookmarks
Curated collection of links to a wide gamut of content
Size: 3.93 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 2 - Forks: 0
v-iashin/Synchformer
Efficient synchronization from sparse cues
Language: Python - Size: 92.9 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 13 - Forks: 3
satelllte/remotion-audio-visualizer
Programmatic minimalistic audio visualizations.
Language: TypeScript - Size: 52.1 MB - Last synced: 28 days ago - Pushed: 8 months ago - Stars: 31 - Forks: 0
guyyariv/TempoTokens
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
Language: Python - Size: 10.7 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 79 - Forks: 10
bfidatadigipres/BFI_scripts
Respository for BFI National Archive open source preservation workflow scripts
Language: Python - Size: 888 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 4 - Forks: 0
dkurzend/ClipClap-GZSL
Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models
Language: Python - Size: 27.6 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3 - Forks: 0
SMIL-SPCRAS/DAVIS
Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-based Method" in ICASSP 2024
Language: JavaScript - Size: 5.82 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 6 - Forks: 0
MCG-NJU/JoMoLD
[ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing
Language: Python - Size: 529 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 26 - Forks: 2
polygonjs/tutorial_audio_analysers
🎵 Tutorial showing how to use audio analysers to update a WebGL scene 🔊
Language: JavaScript - Size: 79.2 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 4 - Forks: 1
jinxiang-liu/anno-free-AVS
Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"
Language: Python - Size: 15.6 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 11 - Forks: 3
cogmhear/Intelligibility-Oriented-Audio-Visual-Speech-Enhancement
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Language: Python - Size: 27.3 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 13 - Forks: 2
tutaru99/Internet-Radio-Player-Vue
Internet Radio Player with an Audio Visualizer made using VueJS, Vuetify & Howler.JS frameworks. The Player has a bunch of radio stations. Check out the demo below.
Language: Vue - Size: 32.1 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 10 - Forks: 5
YitingLiu97/sun-pos-sounds
An interactive audio experience that creates a symphony of field recordings from the sun and us.
Language: JavaScript - Size: 73.2 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
dialogtekgeek/AVSD-DSTC10_Official
Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)
Size: 5.95 MB - Last synced: 2 months ago - Pushed: almost 2 years ago - Stars: 27 - Forks: 2
deeplsd/Syncnet_Analysis
This code is part of the paper: "A Deep Dive Into Neural Synchrony Evaluation for Audio-visual Translation" published at ACM ICMI 2022.
Language: Python - Size: 57.9 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
OpenGVLab/perception_test_iccv2023
Champion Solutions repository for Perception Test challenges in ICCV2023 workshop.
Language: Python - Size: 16.9 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 9 - Forks: 0
justVikram/av-emotion-recognition
Minor Project, VI Semester, 2021.
Language: Python - Size: 89.8 KB - Last synced: 5 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 1
v-iashin/SparseSync
Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)
Language: Python - Size: 72.4 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 38 - Forks: 7
usc-sail/mica-multimodal-ads
Segment-level autoencoders for multimodal representation
Language: Python - Size: 226 KB - Last synced: 12 days ago - Pushed: about 4 years ago - Stars: 9 - Forks: 1
TaoRuijie/TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Language: Python - Size: 52.3 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 209 - Forks: 49
Yu-Wu/Modaily-Aware-Audio-Visual-Video-Parsing
Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing
Language: Python - Size: 1 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 20 - Forks: 0
sripathisridhar/tau-av
CS677 final project: A study in audio-visual scene classification
Language: Python - Size: 11.8 MB - Last synced: 8 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
hmartelb/avlit
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model" (AVLIT)
Language: Python - Size: 422 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 8 - Forks: 1
ItsT3K/pixel-racks
A pixel art collection of A/V rackmount gear
Size: 0 Bytes - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
anonymous-demos/Multimodal-All-In-one-deprecated
Multi-Modal Speech Recognition, Separation and Diarization, Everything Streaming All at Once
Size: 24.4 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
dialogtekgeek/AudioVisualSceneAwareDialog
Language: Python - Size: 40 KB - Last synced: 2 months ago - Pushed: about 4 years ago - Stars: 27 - Forks: 9
georgesterpu/Taris
Transformer-based online speech recognition system with TensorFlow 2
Language: Python - Size: 5.4 MB - Last synced: 24 days ago - Pushed: over 3 years ago - Stars: 25 - Forks: 6
dedobbin/img_stripper
Library to convert image files to audio files and vice versa
Language: C++ - Size: 20.5 KB - Last synced: 11 months ago - Pushed: about 3 years ago - Stars: 6 - Forks: 0
MengyuanChen21/CVPR2023-CMPAE
[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
Language: Python - Size: 1.4 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 10 - Forks: 0
Overcautious/ADENet
Accepted by TMM 2022
Language: Python - Size: 4.71 MB - Last synced: 11 months ago - Pushed: almost 2 years ago - Stars: 11 - Forks: 1
TIB-Digital-Preservation/FilmConservationMetadata
a standardized way to record and store the finding of an inspection of an analogue film in order to document the state at the moment of digitization
Size: 1.43 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 5 - Forks: 0
ekazakos/temporal-binding-network
Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch
Language: Python - Size: 43.5 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 98 - Forks: 25
joannahong/AV-RelScore
Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" in CVPR23
Language: Python - Size: 24.1 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 5 - Forks: 0
m-onz/fakedac-videos
code and patches for fakedac~ music video's
Language: Processing - Size: 6.01 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
ankurbhatia24/MULTIMODAL-EMOTION-RECOGNITION
Human Emotion Understanding using multimodal dataset.
Language: Jupyter Notebook - Size: 5.72 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 50 - Forks: 17
magdalenafuentes/urbansas
Urban Sound & Sight dataset and baseline
Language: Jupyter Notebook - Size: 63.9 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 4 - Forks: 0
SAGNIKMJR/move2hear-active-AV-separation
Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)
Language: Python - Size: 1.31 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 12 - Forks: 0
FannyChao/AVS360_audiovisual_saliency_360
Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio
Language: Python - Size: 2.64 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 8 - Forks: 3
Edinburgh-College-of-Art/awesome-edinburgh-audio-vision
A curated list of real-time audio-vision resources.
Size: 60.5 KB - Last synced: 3 days ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 1
aprilcoffee/lift-off
Audio-Visual performance using MaxMsp & Processing3
Language: Max - Size: 283 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 3 - Forks: 0
MXZEHN/Transcontinental_Whispers
Transcontinental, collaborative, audio-visual performance
Size: 16.4 MB - Last synced: 12 months ago - Pushed: about 3 years ago - Stars: 2 - Forks: 1
markus-wa/av-clj
Audio Visual stuff in Clojure with Shadertone / GLSL
Language: GLSL - Size: 2.7 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 5 - Forks: 0
itec-hust/OMAPS
The OMAPS (Ordinary MIDI Aligned Piano Sounds) dataset was recorded from Yamaha electric piano P115 to evaluate audio-visual fusion piano transcription models.
Size: 4.03 GB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
aniskchaou/GCINEMAS-FRONTEND-USER
This project consists of developing a J2EE web application that allows viewers to book movies ticket using simple and Interactive GUI. The system is so simple and attractive which will make the audiences/viewers comfortable to use and choose their movie along with desired seat number and position.
Language: CSS - Size: 15.7 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0
sutterlaird/opinball
Open Pinspot for Ballrooms (OPinBall) is an open source DMX over Art-Net lighting controller designed to make it easier to pinspot centerpieces for banquet functions in ballrooms.
Language: Python - Size: 109 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
tridivb/attention_based_tbn
Attention-based Temporal Binding Network
Language: Python - Size: 8.36 MB - Last synced: 11 months ago - Pushed: almost 4 years ago - Stars: 8 - Forks: 1
janastu/maaya Fork of salus-sage/maaya
AV presentation from webVTT metadata
Language: JavaScript - Size: 360 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 3 - Forks: 1
Remi-Gau/AV-Attention-7T_code
Code to run and analyse of audiovisual/attention 7T experiment
Language: MATLAB - Size: 811 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 1
Remi-Gau/AV_Attention-Presentation_code-fMRI
code to run the AV attention fMRI experiment (7T)
Language: MATLAB - Size: 201 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0