Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: audio-visual

Repositories

Libvisual/libvisual

Libvisual Audio Visualization

Language: C - Size: 20.6 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 81 - Forks: 30

krantiparida/awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

Size: 58.6 KB - Last synced: 3 days ago - Pushed: 4 months ago - Stars: 620 - Forks: 70

jerosoler/waveform-path

🎙 Generator waveform paths for SVG 🎶

Language: JavaScript - Size: 226 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 67 - Forks: 6

samhirtarif/react-audio-visualize

An audio visualizer for React. Provides separate components to visualize both live audio and audio blobs.

Language: TypeScript - Size: 693 KB - Last synced: 7 days ago - Pushed: 5 months ago - Stars: 52 - Forks: 9

MacTirney/Audio-Visual-Scripts-and-Plugins

A compilation of audio-visual scripts and plugins primarily tailored for GrandMA2, GrandMA3, and MagicQ control software.

Language: Lua - Size: 33.2 KB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 1 - Forks: 0

Rudxain/bookmarks

Curated collection of links to a wide gamut of content

Size: 3.93 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 2 - Forks: 0

v-iashin/Synchformer

Efficient synchronization from sparse cues

Language: Python - Size: 92.9 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 13 - Forks: 3

satelllte/remotion-audio-visualizer

Programmatic minimalistic audio visualizations.

Language: TypeScript - Size: 52.1 MB - Last synced: 28 days ago - Pushed: 8 months ago - Stars: 31 - Forks: 0

guyyariv/TempoTokens

This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Language: Python - Size: 10.7 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 79 - Forks: 10

bfidatadigipres/BFI_scripts

Respository for BFI National Archive open source preservation workflow scripts

Language: Python - Size: 888 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 4 - Forks: 0

dkurzend/ClipClap-GZSL

Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models

Language: Python - Size: 27.6 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3 - Forks: 0

SMIL-SPCRAS/DAVIS

Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-based Method" in ICASSP 2024

Language: JavaScript - Size: 5.82 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 6 - Forks: 0

MCG-NJU/JoMoLD

[ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing

Language: Python - Size: 529 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 26 - Forks: 2

polygonjs/tutorial_audio_analysers

🎵 Tutorial showing how to use audio analysers to update a WebGL scene 🔊

Language: JavaScript - Size: 79.2 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 4 - Forks: 1

jinxiang-liu/anno-free-AVS

Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"

Language: Python - Size: 15.6 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 11 - Forks: 3

cogmhear/Intelligibility-Oriented-Audio-Visual-Speech-Enhancement

Towards Intelligibility-Oriented Audio-Visual Speech Enhancement

Language: Python - Size: 27.3 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 13 - Forks: 2

tutaru99/Internet-Radio-Player-Vue

Internet Radio Player with an Audio Visualizer made using VueJS, Vuetify & Howler.JS frameworks. The Player has a bunch of radio stations. Check out the demo below.

Language: Vue - Size: 32.1 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 10 - Forks: 5

YitingLiu97/sun-pos-sounds

An interactive audio experience that creates a symphony of field recordings from the sun and us.

Language: JavaScript - Size: 73.2 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

dialogtekgeek/AVSD-DSTC10_Official

Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)

Size: 5.95 MB - Last synced: 2 months ago - Pushed: almost 2 years ago - Stars: 27 - Forks: 2

deeplsd/Syncnet_Analysis

This code is part of the paper: "A Deep Dive Into Neural Synchrony Evaluation for Audio-visual Translation" published at ACM ICMI 2022.

Language: Python - Size: 57.9 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

OpenGVLab/perception_test_iccv2023

Champion Solutions repository for Perception Test challenges in ICCV2023 workshop.

Language: Python - Size: 16.9 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 9 - Forks: 0

justVikram/av-emotion-recognition

Minor Project, VI Semester, 2021.

Language: Python - Size: 89.8 KB - Last synced: 5 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 1

v-iashin/SparseSync

Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)

Language: Python - Size: 72.4 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 38 - Forks: 7

usc-sail/mica-multimodal-ads

Segment-level autoencoders for multimodal representation

Language: Python - Size: 226 KB - Last synced: 12 days ago - Pushed: about 4 years ago - Stars: 9 - Forks: 1

TaoRuijie/TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language: Python - Size: 52.3 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 209 - Forks: 49

Yu-Wu/Modaily-Aware-Audio-Visual-Video-Parsing

Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing

Language: Python - Size: 1 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 20 - Forks: 0

sripathisridhar/tau-av

CS677 final project: A study in audio-visual scene classification

Language: Python - Size: 11.8 MB - Last synced: 8 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

hmartelb/avlit

Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model" (AVLIT)

Language: Python - Size: 422 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 8 - Forks: 1

ItsT3K/pixel-racks

A pixel art collection of A/V rackmount gear

Size: 0 Bytes - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

anonymous-demos/Multimodal-All-In-one-deprecated

Multi-Modal Speech Recognition, Separation and Diarization, Everything Streaming All at Once

Size: 24.4 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

dialogtekgeek/AudioVisualSceneAwareDialog

Language: Python - Size: 40 KB - Last synced: 2 months ago - Pushed: about 4 years ago - Stars: 27 - Forks: 9

georgesterpu/Taris

Transformer-based online speech recognition system with TensorFlow 2

Language: Python - Size: 5.4 MB - Last synced: 24 days ago - Pushed: over 3 years ago - Stars: 25 - Forks: 6

dedobbin/img_stripper

Library to convert image files to audio files and vice versa

Language: C++ - Size: 20.5 KB - Last synced: 11 months ago - Pushed: about 3 years ago - Stars: 6 - Forks: 0

MengyuanChen21/CVPR2023-CMPAE

[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception

Language: Python - Size: 1.4 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 10 - Forks: 0

Overcautious/ADENet

Accepted by TMM 2022

Language: Python - Size: 4.71 MB - Last synced: 11 months ago - Pushed: almost 2 years ago - Stars: 11 - Forks: 1

TIB-Digital-Preservation/FilmConservationMetadata

a standardized way to record and store the finding of an inspection of an analogue film in order to document the state at the moment of digitization

Size: 1.43 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 5 - Forks: 0

ekazakos/temporal-binding-network

Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch

Language: Python - Size: 43.5 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 98 - Forks: 25

joannahong/AV-RelScore

Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" in CVPR23

Language: Python - Size: 24.1 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 5 - Forks: 0

m-onz/fakedac-videos

code and patches for fakedac~ music video's

Language: Processing - Size: 6.01 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

ankurbhatia24/MULTIMODAL-EMOTION-RECOGNITION

Human Emotion Understanding using multimodal dataset.

Language: Jupyter Notebook - Size: 5.72 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 50 - Forks: 17

magdalenafuentes/urbansas

Urban Sound & Sight dataset and baseline

Language: Jupyter Notebook - Size: 63.9 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 4 - Forks: 0

SAGNIKMJR/move2hear-active-AV-separation

Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)

Language: Python - Size: 1.31 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 12 - Forks: 0

FannyChao/AVS360_audiovisual_saliency_360

Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio

Language: Python - Size: 2.64 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 8 - Forks: 3

Edinburgh-College-of-Art/awesome-edinburgh-audio-vision

A curated list of real-time audio-vision resources.

Size: 60.5 KB - Last synced: 3 days ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 1

aprilcoffee/lift-off

Audio-Visual performance using MaxMsp & Processing3

Language: Max - Size: 283 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 3 - Forks: 0

MXZEHN/Transcontinental_Whispers

Transcontinental, collaborative, audio-visual performance

Size: 16.4 MB - Last synced: 12 months ago - Pushed: about 3 years ago - Stars: 2 - Forks: 1

markus-wa/av-clj

Audio Visual stuff in Clojure with Shadertone / GLSL

Language: GLSL - Size: 2.7 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 5 - Forks: 0

itec-hust/OMAPS

The OMAPS (Ordinary MIDI Aligned Piano Sounds) dataset was recorded from Yamaha electric piano P115 to evaluate audio-visual fusion piano transcription models.

Size: 4.03 GB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

aniskchaou/GCINEMAS-FRONTEND-USER

This project consists of developing a J2EE web application that allows viewers to book movies ticket using simple and Interactive GUI. The system is so simple and attractive which will make the audiences/viewers comfortable to use and choose their movie along with desired seat number and position.

Language: CSS - Size: 15.7 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

sutterlaird/opinball

Open Pinspot for Ballrooms (OPinBall) is an open source DMX over Art-Net lighting controller designed to make it easier to pinspot centerpieces for banquet functions in ballrooms.

Language: Python - Size: 109 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

tridivb/attention_based_tbn

Attention-based Temporal Binding Network

Language: Python - Size: 8.36 MB - Last synced: 11 months ago - Pushed: almost 4 years ago - Stars: 8 - Forks: 1

janastu/maaya Fork of salus-sage/maaya

AV presentation from webVTT metadata

Language: JavaScript - Size: 360 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 3 - Forks: 1

Remi-Gau/AV-Attention-7T_code

Code to run and analyse of audiovisual/attention 7T experiment

Language: MATLAB - Size: 811 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 1

Remi-Gau/AV_Attention-Presentation_code-fMRI

code to run the AV attention fMRI experiment (7T)

Language: MATLAB - Size: 201 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

Related Keywords

audio-visual 54 audio-visualizer 6 audio 6 deep-learning 5 audio-visualization 4 multi-modal 4 pytorch 4 creative-coding 3 multimodal-deep-learning 3 multimodal 3 speech-enhancement 3 fusion 3 attention 3 speech-recognition 3 python 2 action-recognition 2 video 2 archive 2 video-understanding 2 synchronization 2 glsl 2 audio-visual-speech-recognition 2 awesome-list 2 active-speaker-detection 2 live 2 av 2 preservation 2 dialog 2 tensorflow 2 avsr 2 audio-visual-learning 2 transformer 2 multisensory-integration 2 graphics 2 emotion-recognition 1 jpg 1 opencv 1 nonsense 1 steganography 1 wav 1 audio-visual-video-parsing 1 cvpr2023 1 multimodel 1 convolutional-networks 1 egocentric 1 algorave 1 gem 1 livecoding 1 puredata 1 deeplearning 1 bmvc 1 lrs 1 sparse 1 vggsound 1 advertisements 1 autoencoders 1 multimodal-representation 1 segment-level-autoencoders 1 awesome-asd 1 multimedia 1 cvpr 1 cvpr2021 1 dcase2021 1 iterative 1 lightweight 1 pytorch-lightning 1 speech-separation 1 rackmount 1 asr 1 diarization 1 multi-talker 1 multichannel-microphone-arrays 1 separation 1 dstc7 1 scene-aware-dialog 1 live-caption 1 mahcine-learning 1 online 1 speech-recognizer 1 taris 1 tensorflow2 1 keras 1 amt 1 angular 1 angular-cli 1 book-movies-ticket 1 cinema 1 express 1 film 1 full-stack 1 microservices 1 movie 1 node-js 1 ticket-management 1 typescript 1 art-net 1 banquet 1 dmx 1 event 1 lighting 1