Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: speech-separation
maum-ai/voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
Language: Python - Size: 1.14 MB - Last synced: 10 days ago - Pushed: 4 months ago - Stars: 1,035 - Forks: 227
chimechallenge/chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
Language: Python - Size: 2.51 MB - Last synced: 16 days ago - Pushed: 17 days ago - Stars: 13 - Forks: 2
aishoot/LSTM_PIT_Speech_Separation
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
Language: Jupyter Notebook - Size: 7.38 MB - Last synced: 11 days ago - Pushed: over 2 years ago - Stars: 302 - Forks: 90
Audio-WestlakeU/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Language: Python - Size: 892 KB - Last synced: 16 days ago - Pushed: 10 months ago - Stars: 508 - Forks: 148
kwatcharasupat/directional-sparse-filtering-tf
Python Implementation for Directional Sparse Filtering with Tensorflow/Keras
Language: Python - Size: 21.5 KB - Last synced: 24 days ago - Pushed: almost 3 years ago - Stars: 7 - Forks: 1
KyleZhang1118/Voice-Separation-and-Enhancement
A framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.
Language: MATLAB - Size: 35.5 MB - Last synced: 16 days ago - Pushed: over 2 years ago - Stars: 126 - Forks: 32
cyrta/awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Size: 13.7 KB - Last synced: 10 days ago - Pushed: over 4 years ago - Stars: 58 - Forks: 15
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language: Python - Size: 5.88 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 2,115 - Forks: 416
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Language: HTML - Size: 46.7 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 356 - Forks: 28
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language: Python - Size: 84.5 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7,821 - Forks: 1,272
espnet/espnet
End-to-End Speech Processing Toolkit
Language: Python - Size: 920 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7,825 - Forks: 2,083
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Language: Python - Size: 72.4 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 388 - Forks: 70
posenhuang/deeplearningsourceseparation
Deep Recurrent Neural Networks for Source Separation
Language: MATLAB - Size: 500 MB - Last synced: 3 months ago - Pushed: almost 3 years ago - Stars: 363 - Forks: 136
JusperLee/Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
Size: 97.7 KB - Last synced: 3 months ago - Pushed: about 2 years ago - Stars: 693 - Forks: 132
double22a/speech_dataset
The dataset of Speech Recognition
Size: 62.5 KB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 333 - Forks: 66
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Size: 139 KB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 1,176 - Forks: 125
meokz/looking-to-listen
Deep neural network (DNN) for noise reduction, removal of background music, and speech separation
Language: Python - Size: 33 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 157 - Forks: 19
funcwj/setk
Tools for Speech Enhancement integrated with Kaldi
Language: Python - Size: 36.3 MB - Last synced: 3 months ago - Pushed: 11 months ago - Stars: 387 - Forks: 91
JusperLee/Dual-Path-RNN-Pytorch
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
Language: Python - Size: 94.7 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 376 - Forks: 66
funcwj/aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
Language: Python - Size: 108 MB - Last synced: 3 months ago - Pushed: 11 months ago - Stars: 127 - Forks: 27
kaituoxu/Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
Language: Python - Size: 1.23 MB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 621 - Forks: 149
AppleHolic/source_separation
Deep learning based speech source separation using Pytorch
Language: Jupyter Notebook - Size: 4.11 MB - Last synced: 3 months ago - Pushed: over 3 years ago - Stars: 307 - Forks: 45
gemengtju/Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Language: MATLAB - Size: 74.6 MB - Last synced: 3 months ago - Pushed: over 3 years ago - Stars: 403 - Forks: 93
mcw519/PureSound
Make the sound you hear pure and clean by deep learning.
Language: Python - Size: 137 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 6 - Forks: 0
eesungkim/Speech_Enhancement_DNN_NMF
Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF
Language: Python - Size: 18.5 MB - Last synced: 3 months ago - Pushed: about 5 years ago - Stars: 165 - Forks: 58
funcwj/deep-clustering
deep clustering method for single-channel speech separation
Language: Python - Size: 23.4 KB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 109 - Forks: 35
JusperLee/Deep-Clustering-for-Speech-Separation
Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation
Language: Python - Size: 94.7 KB - Last synced: 3 months ago - Pushed: almost 4 years ago - Stars: 115 - Forks: 25
anton-jeran/MULTI-AUDIODEC
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
Language: Python - Size: 3.25 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 31 - Forks: 5
e13000/directional_sparse_filtering
Directional sparse filtering for blind speech separation
Language: MATLAB - Size: 16.1 MB - Last synced: 24 days ago - Pushed: almost 3 years ago - Stars: 7 - Forks: 3
seanwood/gcc-nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
Language: Python - Size: 43.2 MB - Last synced: 3 months ago - Pushed: about 5 years ago - Stars: 303 - Forks: 132
JusperLee/Calculate-SNR-SDR
Script to calculate SNR and SDR using python
Language: Python - Size: 8.79 KB - Last synced: 3 months ago - Pushed: almost 4 years ago - Stars: 82 - Forks: 25
anicolson/DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Language: MATLAB - Size: 497 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 463 - Forks: 119
JusperLee/Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
Language: Python - Size: 75.2 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 336 - Forks: 68
JusperLee/Looking-to-Listen-at-the-Cocktail-Party
Executable code based on Google articles
Language: Python - Size: 81.5 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 149 - Forks: 43
etzinis/sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
Language: Jupyter Notebook - Size: 21 MB - Last synced: 7 months ago - Pushed: 11 months ago - Stars: 266 - Forks: 30
tky823/DNN-based_source_separation
A PyTorch implementation of DNN-based source separation.
Language: Python - Size: 293 MB - Last synced: 7 months ago - Pushed: about 2 years ago - Stars: 242 - Forks: 46
NikhilC2209/AVSpeech_Sep
Thesis project for Speech Separation using Deep Learning
Language: Jupyter Notebook - Size: 32 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
funcwj/uPIT-for-speech-separation
Speech separation with utterance-level PIT experiments
Language: Python - Size: 38.1 KB - Last synced: 3 months ago - Pushed: almost 6 years ago - Stars: 98 - Forks: 39
kaituoxu/TasNet
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
Language: Python - Size: 1.15 MB - Last synced: 8 months ago - Pushed: over 5 years ago - Stars: 96 - Forks: 30
jacoxu/ASAM
This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]
Language: Python - Size: 49.1 MB - Last synced: 8 months ago - Pushed: about 6 years ago - Stars: 54 - Forks: 20
dangvansam/pyannote-onnx Fork of pyannote/pyannote-audio
PyAnnote with ONNX model
Language: Jupyter Notebook - Size: 273 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
funcwj/conv-tasnet
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https://github.com/funcwj/aps)
Language: Python - Size: 181 MB - Last synced: 7 months ago - Pushed: 11 months ago - Stars: 198 - Forks: 60
anicolson/bidirectional_2018
A Deep Learning Approach to Ideal Binary Mask Estimation
Size: 8.8 MB - Last synced: 9 months ago - Pushed: almost 5 years ago - Stars: 9 - Forks: 3
hangtingchen/Beam-Guided-TasNet
Beam-guided TasNet
Language: Python - Size: 23.3 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 33 - Forks: 7
hmartelb/avlit
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model" (AVLIT)
Language: Python - Size: 422 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 8 - Forks: 1
RishiKakade/Speech-Separating-Hearing-Aid
Language: JavaScript - Size: 10.7 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
ooshyun/Speech-Enhancement-Pytorch
Pytorch Models for Speech Enhancement
Language: Python - Size: 3.34 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
JusperLee/Deep-Encoder-Decoder-Conv-TasNet
A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "
Language: Python - Size: 3.91 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 34 - Forks: 9
ZhaZhaFon/demo-confusion
This is a demo for our paper 'Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches'
Size: 35.5 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 4 - Forks: 6
xuchenglin28/speech_separation
Constrained Permutation Invariant Training, Speech Separation
Language: Python - Size: 1.13 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 33 - Forks: 9
JusperLee/DANet-For-Speech-Separation
Pytorch implement of DANet For Speech Separation
Language: Python - Size: 17.6 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 18 - Forks: 4
JusperLee/UtterancePIT-Speech-Separation
According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.
Language: Python - Size: 34.2 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 59 - Forks: 10
funcwj/voice-filter
A unofficial Pytorch implementation of Google's VoiceFilter
Language: Python - Size: 4.67 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 80 - Forks: 21
haoxiangsnr/SpEx
Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".
Language: Python - Size: 18.6 KB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 26 - Forks: 8
Totoketchup/Adaptive-MultiSpeaker-Separation
Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem
Language: Jupyter Notebook - Size: 18 MB - Last synced: over 1 year ago - Pushed: almost 6 years ago - Stars: 45 - Forks: 18
ZhaZhaFon/demo-samom
This is a demo for our paper 'Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction'.
Size: 4.08 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 1
HeliosX7/voice-filter
Unofficial Tensorflow/Keras implementation of Google AI VoiceFilter
Language: Jupyter Notebook - Size: 3.69 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 9 - Forks: 3
ZhaZhaFon/repo_asteroid Fork of asteroid-team/asteroid
语音前端仓库 || a modified version of Asteroid toolkit for Speech Front-end
Language: Python - Size: 5.68 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
ZhaZhaFon/demo-speakerseparation
This is a demo for my bachelor thesis 'Speaker Separation and Machine Auditory Perception for Dialogue Scene'.
Language: Shell - Size: 2.91 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0
mborsdorf/TargetLanguageExtraction
Size: 21.5 KB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0
ZitengWang/uPIT-for-speech-separation Fork of funcwj/uPIT-for-speech-separation
target speaker separation using a short adaptation utterance
Language: Python - Size: 36.1 KB - Last synced: 11 months ago - Pushed: over 5 years ago - Stars: 1 - Forks: 1
Orelbenr/acoustic-fencing
Acoustic Fence Using Multi-Microphone Speaker Separation
Language: Python - Size: 8.16 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
lukereichold/visual-speech-separation
Flask app to demo multimodal deep learning speech separation in videos via TensorFlow Serving
Language: Python - Size: 20.8 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0
SouppuoS/Multi-round-record
a simple implement for multi-round recordings
Language: Python - Size: 4.88 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0
dacson/Demo-of-Speech-Separation
single channel speech separation for music vocal and accompany separate、voice reduce noise
Size: 151 MB - Last synced: over 1 year ago - Pushed: almost 5 years ago - Stars: 8 - Forks: 5
shun60s/Blind-Speech-Separation
U-Netによる音楽と音声のミックス信号(モノラル)からの音声の分離
Language: Python - Size: 28.3 MB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 3 - Forks: 2