Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: speech-separation

maum-ai/voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Language: Python - Size: 1.14 MB - Last synced: 10 days ago - Pushed: 4 months ago - Stars: 1,035 - Forks: 227

chimechallenge/chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

Language: Python - Size: 2.51 MB - Last synced: 16 days ago - Pushed: 17 days ago - Stars: 13 - Forks: 2

aishoot/LSTM_PIT_Speech_Separation

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

Language: Jupyter Notebook - Size: 7.38 MB - Last synced: 11 days ago - Pushed: over 2 years ago - Stars: 302 - Forks: 90

Audio-WestlakeU/FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Language: Python - Size: 892 KB - Last synced: 16 days ago - Pushed: 10 months ago - Stars: 508 - Forks: 148

kwatcharasupat/directional-sparse-filtering-tf

Python Implementation for Directional Sparse Filtering with Tensorflow/Keras

Language: Python - Size: 21.5 KB - Last synced: 24 days ago - Pushed: almost 3 years ago - Stars: 7 - Forks: 1

KyleZhang1118/Voice-Separation-and-Enhancement

A framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.

Language: MATLAB - Size: 35.5 MB - Last synced: 16 days ago - Pushed: over 2 years ago - Stars: 126 - Forks: 32

cyrta/awesome-speech-enhancement

A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.

Size: 13.7 KB - Last synced: 10 days ago - Pushed: over 4 years ago - Stars: 58 - Forks: 15

asteroid-team/asteroid

The PyTorch-based audio source separation toolkit for researchers

Language: Python - Size: 5.88 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 2,115 - Forks: 416

speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Language: HTML - Size: 46.7 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 356 - Forks: 28

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Language: Python - Size: 84.5 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7,821 - Forks: 1,272

espnet/espnet

End-to-End Speech Processing Toolkit

Language: Python - Size: 920 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7,825 - Forks: 2,083

microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Language: Python - Size: 72.4 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 388 - Forks: 70

posenhuang/deeplearningsourceseparation

Deep Recurrent Neural Networks for Source Separation

Language: MATLAB - Size: 500 MB - Last synced: 3 months ago - Pushed: almost 3 years ago - Stars: 363 - Forks: 136

JusperLee/Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Size: 97.7 KB - Last synced: 3 months ago - Pushed: about 2 years ago - Stars: 693 - Forks: 132

double22a/speech_dataset

The dataset of Speech Recognition

Size: 62.5 KB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 333 - Forks: 66

coqui-ai/open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Size: 139 KB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 1,176 - Forks: 125

meokz/looking-to-listen

Deep neural network (DNN) for noise reduction, removal of background music, and speech separation

Language: Python - Size: 33 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 157 - Forks: 19

funcwj/setk

Tools for Speech Enhancement integrated with Kaldi

Language: Python - Size: 36.3 MB - Last synced: 3 months ago - Pushed: 11 months ago - Stars: 387 - Forks: 91

JusperLee/Dual-Path-RNN-Pytorch

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

Language: Python - Size: 94.7 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 376 - Forks: 66

funcwj/aps

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

Language: Python - Size: 108 MB - Last synced: 3 months ago - Pushed: 11 months ago - Stars: 127 - Forks: 27

kaituoxu/Conv-TasNet

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Language: Python - Size: 1.23 MB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 621 - Forks: 149

AppleHolic/source_separation

Deep learning based speech source separation using Pytorch

Language: Jupyter Notebook - Size: 4.11 MB - Last synced: 3 months ago - Pushed: over 3 years ago - Stars: 307 - Forks: 45

gemengtju/Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

Language: MATLAB - Size: 74.6 MB - Last synced: 3 months ago - Pushed: over 3 years ago - Stars: 403 - Forks: 93

mcw519/PureSound

Make the sound you hear pure and clean by deep learning.

Language: Python - Size: 137 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 6 - Forks: 0

eesungkim/Speech_Enhancement_DNN_NMF

Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF

Language: Python - Size: 18.5 MB - Last synced: 3 months ago - Pushed: about 5 years ago - Stars: 165 - Forks: 58

funcwj/deep-clustering

deep clustering method for single-channel speech separation

Language: Python - Size: 23.4 KB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 109 - Forks: 35

JusperLee/Deep-Clustering-for-Speech-Separation

Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation

Language: Python - Size: 94.7 KB - Last synced: 3 months ago - Pushed: almost 4 years ago - Stars: 115 - Forks: 25

anton-jeran/MULTI-AUDIODEC

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

Language: Python - Size: 3.25 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 31 - Forks: 5

e13000/directional_sparse_filtering

Directional sparse filtering for blind speech separation

Language: MATLAB - Size: 16.1 MB - Last synced: 24 days ago - Pushed: almost 3 years ago - Stars: 7 - Forks: 3

seanwood/gcc-nmf

Real-time GCC-NMF Blind Speech Separation and Enhancement

Language: Python - Size: 43.2 MB - Last synced: 3 months ago - Pushed: about 5 years ago - Stars: 303 - Forks: 132

JusperLee/Calculate-SNR-SDR

Script to calculate SNR and SDR using python

Language: Python - Size: 8.79 KB - Last synced: 3 months ago - Pushed: almost 4 years ago - Stars: 82 - Forks: 25

anicolson/DeepXi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

Language: MATLAB - Size: 497 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 463 - Forks: 119

JusperLee/Conv-TasNet

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

Language: Python - Size: 75.2 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 336 - Forks: 68

JusperLee/Looking-to-Listen-at-the-Cocktail-Party

Executable code based on Google articles

Language: Python - Size: 81.5 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 149 - Forks: 43

etzinis/sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

Language: Jupyter Notebook - Size: 21 MB - Last synced: 7 months ago - Pushed: 11 months ago - Stars: 266 - Forks: 30

tky823/DNN-based_source_separation

A PyTorch implementation of DNN-based source separation.

Language: Python - Size: 293 MB - Last synced: 7 months ago - Pushed: about 2 years ago - Stars: 242 - Forks: 46

NikhilC2209/AVSpeech_Sep

Thesis project for Speech Separation using Deep Learning

Language: Jupyter Notebook - Size: 32 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

funcwj/uPIT-for-speech-separation

Speech separation with utterance-level PIT experiments

Language: Python - Size: 38.1 KB - Last synced: 3 months ago - Pushed: almost 6 years ago - Stars: 98 - Forks: 39

kaituoxu/TasNet

A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.

Language: Python - Size: 1.15 MB - Last synced: 8 months ago - Pushed: over 5 years ago - Stars: 96 - Forks: 30

jacoxu/ASAM

This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]

Language: Python - Size: 49.1 MB - Last synced: 8 months ago - Pushed: about 6 years ago - Stars: 54 - Forks: 20

dangvansam/pyannote-onnx Fork of pyannote/pyannote-audio

PyAnnote with ONNX model

Language: Jupyter Notebook - Size: 273 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

funcwj/conv-tasnet

A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https://github.com/funcwj/aps)

Language: Python - Size: 181 MB - Last synced: 7 months ago - Pushed: 11 months ago - Stars: 198 - Forks: 60

anicolson/bidirectional_2018

A Deep Learning Approach to Ideal Binary Mask Estimation

Size: 8.8 MB - Last synced: 9 months ago - Pushed: almost 5 years ago - Stars: 9 - Forks: 3

hangtingchen/Beam-Guided-TasNet

Beam-guided TasNet

Language: Python - Size: 23.3 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 33 - Forks: 7

hmartelb/avlit

Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model" (AVLIT)

Language: Python - Size: 422 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 8 - Forks: 1

RishiKakade/Speech-Separating-Hearing-Aid

Language: JavaScript - Size: 10.7 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

ooshyun/Speech-Enhancement-Pytorch

Pytorch Models for Speech Enhancement

Language: Python - Size: 3.34 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

JusperLee/Deep-Encoder-Decoder-Conv-TasNet

A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "

Language: Python - Size: 3.91 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 34 - Forks: 9

ZhaZhaFon/demo-confusion

This is a demo for our paper 'Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches'

Size: 35.5 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 4 - Forks: 6

xuchenglin28/speech_separation

Constrained Permutation Invariant Training, Speech Separation

Language: Python - Size: 1.13 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 33 - Forks: 9

JusperLee/DANet-For-Speech-Separation

Pytorch implement of DANet For Speech Separation

Language: Python - Size: 17.6 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 18 - Forks: 4

JusperLee/UtterancePIT-Speech-Separation

According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.

Language: Python - Size: 34.2 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 59 - Forks: 10

funcwj/voice-filter

A unofficial Pytorch implementation of Google's VoiceFilter

Language: Python - Size: 4.67 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 80 - Forks: 21

haoxiangsnr/SpEx

Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".

Language: Python - Size: 18.6 KB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 26 - Forks: 8

Totoketchup/Adaptive-MultiSpeaker-Separation

Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem

Language: Jupyter Notebook - Size: 18 MB - Last synced: over 1 year ago - Pushed: almost 6 years ago - Stars: 45 - Forks: 18

ZhaZhaFon/demo-samom

This is a demo for our paper 'Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction'.

Size: 4.08 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 1

HeliosX7/voice-filter

Unofficial Tensorflow/Keras implementation of Google AI VoiceFilter

Language: Jupyter Notebook - Size: 3.69 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 9 - Forks: 3

ZhaZhaFon/repo_asteroid Fork of asteroid-team/asteroid

语音前端仓库 || a modified version of Asteroid toolkit for Speech Front-end

Language: Python - Size: 5.68 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

ZhaZhaFon/demo-speakerseparation

This is a demo for my bachelor thesis 'Speaker Separation and Machine Auditory Perception for Dialogue Scene'.

Language: Shell - Size: 2.91 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

mborsdorf/TargetLanguageExtraction

Size: 21.5 KB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0

ZitengWang/uPIT-for-speech-separation Fork of funcwj/uPIT-for-speech-separation

target speaker separation using a short adaptation utterance

Language: Python - Size: 36.1 KB - Last synced: 11 months ago - Pushed: over 5 years ago - Stars: 1 - Forks: 1

Orelbenr/acoustic-fencing

Acoustic Fence Using Multi-Microphone Speaker Separation

Language: Python - Size: 8.16 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

lukereichold/visual-speech-separation

Flask app to demo multimodal deep learning speech separation in videos via TensorFlow Serving

Language: Python - Size: 20.8 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

SouppuoS/Multi-round-record

a simple implement for multi-round recordings

Language: Python - Size: 4.88 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

dacson/Demo-of-Speech-Separation

single channel speech separation for music vocal and accompany separate、voice reduce noise

Size: 151 MB - Last synced: over 1 year ago - Pushed: almost 5 years ago - Stars: 8 - Forks: 5

shun60s/Blind-Speech-Separation

U-Netによる音楽と音声のミックス信号(モノラル)からの音声の分離

Language: Python - Size: 28.3 MB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 3 - Forks: 2

Related Keywords
speech-separation 66 pytorch 22 speech-enhancement 22 deep-learning 17 source-separation 13 speech-processing 12 speech 10 speech-recognition 8 audio-separation 8 audio 7 pit 4 tasnet 4 speech-to-text 4 beamforming 4 tensorflow 4 speaker-verification 3 speech-front-end 3 speech-analysis 3 speaker-extraction 3 speaker-diarization 3 end-to-end 3 kaldi 3 demo 3 speech-synthesis 3 matlab 3 diarization 3 noise-reduction 3 conv-tasnet 3 multi-speaker 3 permutation-invariant-training 3 deep-xi 2 keras 2 speaker-separation 2 speech-emotion-recognition 2 speechrecognition 2 asr 2 audio-processing 2 resnet 2 auditory-attention 2 spoken-language-understanding 2 target-speaker-extraction 2 voice-recognition 2 chainer 2 tts 2 pytorch-implementation 2 text-to-speech 2 speech-translation 2 voice-conversion 2 speech-diarization 2 deep-neural-networks 2 voice-separation 2 denoising 2 paper 2 blind-source-separation 2 signal-processing 2 multi-channel 2 voice-denoise 2 speech-denoising 2 deeplearning 2 speaker-recognition 2 automatic-speech-recognition 2 cnn 1 auditory-selection 1 cocktail-party-effect 1 voicefilter 1 cocktail-party-problem 1 audio-segmentation 1 audio-split 1 audio-splitter 1 onnx 1 pyannote 1 speech-activity-detection 1 vad 1 voice-ac 1 brnn 1 estimator 1 gain 1 computer-vision 1 minimum-mean-square-error 1 mmse 1 mmse-lsa 1 mhanet 1 multi-head-attention 1 noise-estimation 1 deepxi 1 residual-networks 1 deepmmse 1 dnn 1 robust-asr 1 tcn 1 denoise 1 music-separation 1 cnn-architecture 1 cocktail-party 1 facenet 1 librosa 1 3d-convolutional-network 1 speech-dataset 1 speech-database 1 speech-corpus 1