An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: speech-separation

modelscope/ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Language: Python - Size: 276 MB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 2,971 - Forks: 239

asteroid-team/asteroid

The PyTorch-based audio source separation toolkit for researchers

Language: Python - Size: 5.88 MB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 2,403 - Forks: 436

espnet/espnet

End-to-End Speech Processing Toolkit

Language: Python - Size: 1.15 GB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 9,202 - Forks: 2,278

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Language: Python - Size: 98.2 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 9,952 - Forks: 1,504

speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Language: HTML - Size: 46.8 MB - Last synced at: 15 days ago - Pushed at: 26 days ago - Stars: 368 - Forks: 30

microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Language: Python - Size: 72.4 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 463 - Forks: 74

posenhuang/deeplearningsourceseparation

Deep Recurrent Neural Networks for Source Separation

Language: MATLAB - Size: 500 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 369 - Forks: 133

tky823/DNN-based_source_separation

A PyTorch implementation of DNN-based source separation.

Language: Python - Size: 293 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 301 - Forks: 51

sayemomer/Speech-Separations-with-variable-number-of-sources

Audio source separation model with a Whisper ECAPA-TDNN counter and pre‑trained speechbrain/sepformer-libri3mix and speechbrain/sepformer-wsj02mix for speech separation, implemented with SpeechBrain.

Language: Jupyter Notebook - Size: 4.31 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

JusperLee/Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Size: 48.8 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 779 - Forks: 137

maum-ai/voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Language: Python - Size: 1.13 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 1,137 - Forks: 228

chentuochao/Spatial-Speech-Translation

The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"

Language: Python - Size: 23.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 30 - Forks: 0

KyleZhang1118/Voice-Separation-and-Enhancement

A framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.

Language: MATLAB - Size: 35.5 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 157 - Forks: 35

cyrta/awesome-speech-enhancement

A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.

Size: 13.7 KB - Last synced at: about 2 months ago - Pushed at: almost 6 years ago - Stars: 67 - Forks: 15

seanwood/gcc-nmf

Real-time GCC-NMF Blind Speech Separation and Enhancement

Language: Python - Size: 43.2 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 319 - Forks: 134

double22a/speech_dataset

The dataset of Speech Recognition

Size: 74.2 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 413 - Forks: 77

kaituoxu/Conv-TasNet

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Language: Python - Size: 1.23 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 697 - Forks: 156

gemengtju/Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

Language: MATLAB - Size: 74.6 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 459 - Forks: 95

eesungkim/Speech_Enhancement_DNN_NMF

Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF

Language: Python - Size: 18.5 MB - Last synced at: 3 months ago - Pushed at: about 6 years ago - Stars: 184 - Forks: 61

coqui-ai/open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Size: 139 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1,318 - Forks: 142

funcwj/setk

Tools for Speech Enhancement integrated with Kaldi

Language: Python - Size: 36.3 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 410 - Forks: 91

JusperLee/Dual-Path-RNN-Pytorch

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

Language: Python - Size: 94.7 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 434 - Forks: 66

anton-jeran/MULTI-AUDIODEC

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

Language: Python - Size: 7.41 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 46 - Forks: 6

funcwj/uPIT-for-speech-separation

Speech separation with utterance-level PIT experiments

Language: Python - Size: 38.1 KB - Last synced at: 3 months ago - Pushed at: almost 7 years ago - Stars: 104 - Forks: 39

JusperLee/Calculate-SNR-SDR

Script to calculate SNR and SDR using python

Language: Python - Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 90 - Forks: 26

JusperLee/Deep-Clustering-for-Speech-Separation

Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation

Language: Python - Size: 94.7 KB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 131 - Forks: 24

funcwj/aps

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

Language: Python - Size: 108 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 142 - Forks: 28

AppleHolic/source_separation

Deep learning based speech source separation using Pytorch

Language: Jupyter Notebook - Size: 4.11 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 316 - Forks: 46

meokz/looking-to-listen

Deep neural network (DNN) for noise reduction, removal of background music, and speech separation

Language: Python - Size: 33 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 172 - Forks: 19

khanld/Dynamic-Mixing

Dynamic Mixing For Speech Processing (mix-on-the-fly)

Language: Python - Size: 12.5 MB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 17 - Forks: 2

aishoot/LSTM_PIT_Speech_Separation

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

Language: Jupyter Notebook - Size: 7.38 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 308 - Forks: 90

chentuochao/Sound_Bubble

Project for speech bubble

Language: Python - Size: 12.9 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 2

Audio-WestlakeU/FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Language: Python - Size: 892 KB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 552 - Forks: 156

funcwj/deep-clustering

deep clustering method for single-channel speech separation

Language: Python - Size: 23.4 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 109 - Forks: 34

chimechallenge/chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

Language: Python - Size: 2.63 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 21 - Forks: 3

mcw519/PureSound

Make the sound you hear pure and clean by deep learning.

Language: Python - Size: 138 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 8 - Forks: 0

mborsdorf/GlobalPhoneMS_Scripts

Language: MATLAB - Size: 5.44 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

mborsdorf/UniversalSpeakerExtraction

Language: Python - Size: 9.42 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 14 - Forks: 4

mborsdorf/TargetLanguageExtraction

Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

kwatcharasupat/directional-sparse-filtering-tf

Python Implementation for Directional Sparse Filtering with Tensorflow/Keras

Language: Python - Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 1

e13000/directional_sparse_filtering

Directional sparse filtering for blind speech separation

Language: MATLAB - Size: 16.1 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 3

anicolson/DeepXi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

Language: MATLAB - Size: 497 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 463 - Forks: 119

JusperLee/Conv-TasNet

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

Language: Python - Size: 75.2 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 336 - Forks: 68

JusperLee/Looking-to-Listen-at-the-Cocktail-Party

Executable code based on Google articles

Language: Python - Size: 81.5 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 149 - Forks: 43

etzinis/sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

Language: Jupyter Notebook - Size: 21 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 266 - Forks: 30

NikhilC2209/AVSpeech_Sep

Thesis project for Speech Separation using Deep Learning

Language: Jupyter Notebook - Size: 32 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kaituoxu/TasNet

A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.

Language: Python - Size: 1.15 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 96 - Forks: 30

jacoxu/ASAM

This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]

Language: Python - Size: 49.1 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 54 - Forks: 20

dangvansam/pyannote-onnx Fork of pyannote/pyannote-audio

PyAnnote with ONNX model

Language: Jupyter Notebook - Size: 273 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

funcwj/conv-tasnet

A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https://github.com/funcwj/aps)

Language: Python - Size: 181 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 198 - Forks: 60

anicolson/bidirectional_2018

A Deep Learning Approach to Ideal Binary Mask Estimation

Size: 8.8 MB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 9 - Forks: 3

hangtingchen/Beam-Guided-TasNet

Beam-guided TasNet

Language: Python - Size: 23.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 33 - Forks: 7

hmartelb/avlit

Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model" (AVLIT)

Language: Python - Size: 422 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 1

RishiKakade/Speech-Separating-Hearing-Aid

Language: JavaScript - Size: 10.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ooshyun/Speech-Enhancement-Pytorch

Pytorch Models for Speech Enhancement

Language: Python - Size: 3.34 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

JusperLee/Deep-Encoder-Decoder-Conv-TasNet

A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "

Language: Python - Size: 3.91 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 34 - Forks: 9

ZhaZhaFon/demo-confusion

This is a demo for our paper 'Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches'

Size: 35.5 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 6

xuchenglin28/speech_separation

Constrained Permutation Invariant Training, Speech Separation

Language: Python - Size: 1.13 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 33 - Forks: 9

JusperLee/DANet-For-Speech-Separation

Pytorch implement of DANet For Speech Separation

Language: Python - Size: 17.6 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 18 - Forks: 4

JusperLee/UtterancePIT-Speech-Separation

According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.

Language: Python - Size: 34.2 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 59 - Forks: 10

funcwj/voice-filter

A unofficial Pytorch implementation of Google's VoiceFilter

Language: Python - Size: 4.67 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 80 - Forks: 21

haoxiangsnr/SpEx

Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".

Language: Python - Size: 18.6 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 26 - Forks: 8

Totoketchup/Adaptive-MultiSpeaker-Separation

Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem

Language: Jupyter Notebook - Size: 18 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 45 - Forks: 18

ZhaZhaFon/demo-samom

This is a demo for our paper 'Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction'.

Size: 4.08 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

HeliosX7/voice-filter

Unofficial Tensorflow/Keras implementation of Google AI VoiceFilter

Language: Jupyter Notebook - Size: 3.69 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 3

ZhaZhaFon/repo_asteroid Fork of asteroid-team/asteroid

语音前端仓库 || a modified version of Asteroid toolkit for Speech Front-end

Language: Python - Size: 5.68 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ZhaZhaFon/demo-speakerseparation

This is a demo for my bachelor thesis 'Speaker Separation and Machine Auditory Perception for Dialogue Scene'.

Language: Shell - Size: 2.91 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

ZitengWang/uPIT-for-speech-separation Fork of funcwj/uPIT-for-speech-separation

target speaker separation using a short adaptation utterance

Language: Python - Size: 36.1 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

Orelbenr/acoustic-fencing

Acoustic Fence Using Multi-Microphone Speaker Separation

Language: Python - Size: 8.16 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

lukereichold/visual-speech-separation

Flask app to demo multimodal deep learning speech separation in videos via TensorFlow Serving

Language: Python - Size: 20.8 MB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

SouppuoS/Multi-round-record

a simple implement for multi-round recordings

Language: Python - Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

dacson/Demo-of-Speech-Separation

single channel speech separation for music vocal and accompany separate、voice reduce noise

Size: 151 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 8 - Forks: 5

shun60s/Blind-Speech-Separation

U-Netによる音楽と音声のミックス信号(モノラル)からの音声の分離

Language: Python - Size: 28.3 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 2

Related Keywords
speech-separation 73 speech-enhancement 24 pytorch 23 deep-learning 20 source-separation 14 speech-processing 13 speech 12 audio 8 audio-separation 8 speech-recognition 8 speaker-extraction 5 pit 4 tensorflow 4 speech-to-text 4 beamforming 4 python 4 tasnet 4 auditory-attention 4 matlab 4 speech-analysis 3 noise-reduction 3 speech-front-end 3 demo 3 diarization 3 speaker-verification 3 permutation-invariant-training 3 conv-tasnet 3 end-to-end 3 kaldi 3 speaker-diarization 3 speech-synthesis 3 speech-translation 3 text-to-speech 3 multi-speaker 3 audio-processing 3 speech-dataset 2 multilingual 2 target-speaker-extraction 2 blind-source-separation 2 speaker-separation 2 paper 2 voice-separation 2 speaker-recognition 2 spatial-audio 2 multi-channel 2 denoising 2 deep-xi 2 keras 2 pytorch-implementation 2 automatic-speech-recognition 2 deep-neural-networks 2 tts 2 resnet 2 signal-processing 2 voice-recognition 2 deeplearning 2 spoken-language-understanding 2 voice-denoise 2 voice-conversion 2 asr 2 chainer 2 speechrecognition 2 speech-emotion-recognition 2 speech-diarization 2 speech-denoising 2 multi-head-attention 1 mmse-lsa 1 mmse 1 noise-estimation 1 residual-networks 1 minimum-mean-square-error 1 music-separation 1 robust-asr 1 tcn 1 cnn-architecture 1 cocktail-party 1 facenet 1 librosa 1 auditory-selection 1 cocktail-party-effect 1 resblstm-ibm 1 dnn 1 denoise 1 cnn 1 speech-corpus 1 speech-database 1 corpus 1 dsp 1 icassp 1 icassp-2017 1 icassp-2021 1 unsupervised-learning 1 bss 1 bss-algorithms 1 a-priori-snr-estimator 1 attention 1 tensorflow-serving 1 deepmmse 1 deepxi 1 multisensory 1