GitHub topics: multi-speaker
mikebrady/shairport-sync Fork of abrasive/shairport
AirPlay and AirPlay 2 audio player
Language: C - Size: 11.2 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 7,623 - Forks: 586

r9y9/deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Language: Python - Size: 6.78 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 1,976 - Forks: 487

netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python - Size: 3.67 MB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 7,862 - Forks: 675

anton-jeran/MULTI-AUDIODEC
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
Language: Python - Size: 7.41 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 46 - Forks: 6

keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
Language: Python - Size: 143 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 325 - Forks: 42

keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Language: Python - Size: 3.45 MB - Last synced at: 11 days ago - Pushed at: almost 3 years ago - Stars: 146 - Forks: 19

aishoot/LSTM_PIT_Speech_Separation
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
Language: Jupyter Notebook - Size: 7.38 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 308 - Forks: 90

ranchlai/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
Language: Python - Size: 85.4 MB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 446 - Forks: 106

nikitashvarts/CocktailPartySpeakerRecognition
An Algorithm for Speaker Recognition in a Multi-Speaker Environment
Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 1

ZoraizQ/urdu-speech-recognition
Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs using the PRUS dataset.
Language: Shell - Size: 1.16 GB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

keonlee9420/Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Language: Python - Size: 130 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 31 - Forks: 8

hwRG/FastSpeech2-Pytorch-Korean-Multi-Speaker
Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.
Language: Python - Size: 5.32 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 1

parisimaa/multi_speaker
Language: MATLAB - Size: 7.81 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Totoketchup/Adaptive-MultiSpeaker-Separation
Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem
Language: Jupyter Notebook - Size: 18 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 45 - Forks: 18
