Topic: "sound-source-localization"
aishoot/Sound_Localization_Algorithms
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
Language: Jupyter Notebook - Size: 3.62 MB - Last synced at: 9 months ago - Pushed at: over 5 years ago - Stars: 369 - Forks: 103

Audio-WestlakeU/RealMAN
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
Language: Python - Size: 62.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 98 - Forks: 11

Audio-WestlakeU/FN-SSL
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization
Language: Python - Size: 210 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 85 - Forks: 9

BrownsugarZeer/Multi_SSL
Combine sound source separation with SRP-PHAT to achieve multi-source localization.
Language: Python - Size: 17.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 59 - Forks: 11

stoneMo/DeepAVFusion
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
Language: Python - Size: 26.4 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 12 - Forks: 0

ishaaniwani/GCC-PHAT-SSL
MATLAB Simulation Framework For Basic Sound Source Localization Using the GCC PHAT Algorithm
Language: MATLAB - Size: 942 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 12 - Forks: 2

sutdcv/Chaotic-World
[ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events
Size: 23.4 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 9 - Forks: 1

wattai/sound-source-position-estimation
This scripts estimate Sound Source Position based on Cross-power Spectrum Phase (CSP) or Multiple Signal Classification (MUSIC).
Language: Python - Size: 39.1 KB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 6 - Forks: 0

axeber01/wav2pos
3D Sound Source Localization using Masked Autoencoders
Language: Jupyter Notebook - Size: 1010 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 3

linfeng-feng/Unbiased_Label_Distribution
Eliminating Quantization Errors in Classification-Based Sound Source Localization
Language: Python - Size: 10.8 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

Gl0dny/hexapod
This project develops an autonomous hexapod robot using auditory scene analysis for navigation. It integrates sound source localization (DOA) and beamforming via ODAS with a circular microphone array for precise spatial detection. A machine learning-based Keyword Spotting (KWS) module enables voice command recognition for human-robot interaction.
Language: Python - Size: 9.5 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0

RobertoAlessandri/CNN_DOA
Test of the ability of a Convolutional Neural Network (CNN) trained to localize the Direction Of Arrival (DOA), to generalize in different environments.
Language: Jupyter Notebook - Size: 362 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

ishaaniwani/SpeechProcessor
Program that takes multiple wav files and processes them so that they can be recognized.
Language: Python - Size: 79.1 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

MaloOLIVIER/hungarian-net
Hungarian Network 🔬 — Generate synthetic data and train your deep-learning implementation of the Hungarian algorithm.
Language: Python - Size: 10.6 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

YHuaa/DFLNet
Code
Language: Python - Size: 81.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ZahraBenslimane/sound_source_localization_with_beamforming
Localization of a sound source using a microphone array and beamforming technics
Size: 0 Bytes - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

ly-zhu/ly-zhu.github.io
Projects webpage
Language: HTML - Size: 41.1 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dasdristanta13/2.5D-Visual-Sound
Visualising Sound
Language: Python - Size: 2.35 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0
