Topic: "vocoder"
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language: Python - Size: 162 MB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 39,507 - Forks: 5,002

PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language: Python - Size: 69.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 11,799 - Forks: 1,903

mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Language: Jupyter Notebook - Size: 120 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 9,792 - Forks: 1,291

open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language: Python - Size: 126 MB - Last synced at: 4 days ago - Pushed at: 15 days ago - Stars: 8,975 - Forks: 702

fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language: Python - Size: 10.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8,382 - Forks: 1,185

TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Language: Python - Size: 130 MB - Last synced at: 18 days ago - Pushed at: 10 months ago - Stars: 3,914 - Forks: 812

jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language: Python - Size: 605 KB - Last synced at: 16 days ago - Pushed at: 9 months ago - Stars: 2,103 - Forks: 525

kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language: Jupyter Notebook - Size: 34.8 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 1,601 - Forks: 345

mmorise/World
A high-quality speech analysis, manipulation and synthesis system
Language: C++ - Size: 878 KB - Last synced at: 15 days ago - Pushed at: 2 months ago - Stars: 1,222 - Forks: 256

haoheliu/voicefixer
General Speech Restoration
Language: Python - Size: 3.76 MB - Last synced at: 17 days ago - Pushed at: 2 months ago - Stars: 1,121 - Forks: 133

gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language: Python - Size: 17.3 MB - Last synced at: 23 days ago - Pushed at: 9 months ago - Stars: 909 - Forks: 107

lmnt-com/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Language: Python - Size: 20.5 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 823 - Forks: 116

Rongjiehuang/FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
Language: Python - Size: 2.98 MB - Last synced at: 22 days ago - Pushed at: 10 months ago - Stars: 410 - Forks: 61

ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
Language: Jupyter Notebook - Size: 18.1 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 384 - Forks: 52

rishikksh20/VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Language: Python - Size: 187 KB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 318 - Forks: 60

szechyjs/mbelib
P25 Phase 1 and ProVoice vocoder
Language: C++ - Size: 480 KB - Last synced at: 21 days ago - Pushed at: over 4 years ago - Stars: 285 - Forks: 119

lmnt-com/wavegrad
A fast, high-quality neural vocoder.
Language: Python - Size: 18.6 KB - Last synced at: 22 days ago - Pushed at: almost 2 years ago - Stars: 279 - Forks: 48

maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
Language: Python - Size: 22.7 MB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 272 - Forks: 46

sh123/codec2_talkie
Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)
Language: Java - Size: 27.5 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 243 - Forks: 40

rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Language: Python - Size: 865 KB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 208 - Forks: 45

maum-ai/phaseaug
ICASSP 2023 Accepted
Language: Python - Size: 47.8 MB - Last synced at: 9 days ago - Pushed at: 12 months ago - Stars: 189 - Forks: 14

descriptinc/cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
Language: Python - Size: 90.2 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 187 - Forks: 30

k2kobayashi/crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Language: Python - Size: 12.4 MB - Last synced at: 20 days ago - Pushed at: 9 months ago - Stars: 171 - Forks: 31

HidekiKawahara/legacy_STRAIGHT
A vocoder framework which had been widely used in research community since 1999.
Language: Matlab - Size: 19.8 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 164 - Forks: 41

yl4579/HiFTNet
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Language: Python - Size: 55.1 MB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 159 - Forks: 13

hhguo/MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
Language: Python - Size: 1.15 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 14

xcmyz/FastVocoder
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Language: Python - Size: 7.34 MB - Last synced at: 25 days ago - Pushed at: almost 4 years ago - Stars: 154 - Forks: 19

NTT123/vietTTS
Vietnamese Text to Speech library
Language: Python - Size: 11.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 153 - Forks: 70

ncsoft/avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
Language: Python - Size: 17.6 KB - Last synced at: 18 days ago - Pushed at: about 2 years ago - Stars: 150 - Forks: 19

Rongjiehuang/Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Language: Python - Size: 34.8 MB - Last synced at: 22 days ago - Pushed at: almost 3 years ago - Stars: 138 - Forks: 20

geneing/WaveRNN-Pytorch Fork of G-Wang/WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)
Language: Python - Size: 3.48 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 134 - Forks: 37

jurihock/stftPitchShift
STFT based real-time pitch and timbre shifting in C++ and Python
Language: C - Size: 2.21 MB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 133 - Forks: 17

X-LANCE/UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
Language: Python - Size: 1000 KB - Last synced at: 24 days ago - Pushed at: 11 months ago - Stars: 129 - Forks: 16

iamycy/golf
A DDSP-based neural voice synthesiser.
Language: Jupyter Notebook - Size: 634 MB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 116 - Forks: 9

rishikksh20/Avocodo-pytorch
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Language: Python - Size: 821 KB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 114 - Forks: 15

rishikksh20/Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Language: Python - Size: 600 KB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 99 - Forks: 32

syang1993/FFTNet
A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
Language: Python - Size: 17.8 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 92 - Forks: 20

magnetophon/VoiceOfFaust
Turn your voice into a synthesizer!
Language: Faust - Size: 137 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 91 - Forks: 2

erogol/FFTNet
FFTNet vocoder implementation
Language: Jupyter Notebook - Size: 756 KB - Last synced at: 6 months ago - Pushed at: over 6 years ago - Stars: 81 - Forks: 8

tuan3w/cnn_vocoder 📦
A fast cnn-based vocoder
Language: Python - Size: 3.91 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 78 - Forks: 13

CSTR-Edinburgh/magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
Language: Python - Size: 18.5 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 77 - Forks: 31

BogiHsu/WG-WaveNet
Real-Time High-Fidelity Speech Synthesis without GPU
Language: Python - Size: 25.7 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 72 - Forks: 13

zzw922cn/LPC_for_TTS
Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.
Language: Python - Size: 652 KB - Last synced at: about 16 hours ago - Pushed at: about 4 years ago - Stars: 69 - Forks: 10

rishikksh20/UnivNet-pytorch
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
Language: Python - Size: 596 KB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 68 - Forks: 10

philsyn/DiffWave-Vocoder
Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.
Language: Python - Size: 81.1 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 66 - Forks: 4

azraelkuan/FFTNet
FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
Language: Python - Size: 68.4 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 63 - Forks: 10

yistLin/universal-vocoder
A PyTorch implementation of the universal neural vocoder
Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 62 - Forks: 9

kaiidams/soundstream-pytorch
Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint
Language: Python - Size: 8.79 KB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 59 - Forks: 10

rishikksh20/melgan
MelGAN implementation with Multi-Band and Full Band supports...
Language: Jupyter Notebook - Size: 725 KB - Last synced at: 12 months ago - Pushed at: over 4 years ago - Stars: 59 - Forks: 15

ttop32/coqui_tts_korea
Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS
Language: Jupyter Notebook - Size: 2.79 MB - Last synced at: 22 days ago - Pushed at: about 3 years ago - Stars: 57 - Forks: 17

vtuber-plan/NSF-HiFiGAN
Vocoder NSF-HiFiGAN (Moved into deepaudio)
Language: Python - Size: 41 KB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 51 - Forks: 2

rishikksh20/iSTFT-Avocodo-pytorch
Ultrafast GAN based Vocoder for Text to Speech
Language: Python - Size: 1.05 MB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 51 - Forks: 7

hcy71o/AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Language: Python - Size: 1.35 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 48 - Forks: 9

DongyaoZhu/Real-Time-Accent-Conversion
Real Time Foreign Accent Conversion
Language: Python - Size: 1.03 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 46 - Forks: 8

revsic/tf-diffwave
Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis
Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: 27 days ago - Pushed at: over 4 years ago - Stars: 40 - Forks: 10

solalala-12/Singing-Voice-Conversion
2019/04~2019/09 투빅스 Singing Voice Conversion
Language: Jupyter Notebook - Size: 175 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 39 - Forks: 10

yoyolicoris/pytorch_FFTNet
A pytorch implementation of FFTNet.
Language: Python - Size: 547 KB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 37 - Forks: 4

jurihock/stftPitchShiftPlugin
Official JUCE plugin for stftPitchShift
Language: C++ - Size: 659 KB - Last synced at: 25 days ago - Pushed at: 10 months ago - Stars: 36 - Forks: 3

rishikksh20/multiband-hifigan Fork of jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language: Python - Size: 625 KB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 35 - Forks: 3

YuzukiTsuru/World.JS
World.JS is a JavaScript Wrapper for World Vocoder Powered by Emscripten
Language: C++ - Size: 1.55 MB - Last synced at: 18 days ago - Pushed at: almost 3 years ago - Stars: 33 - Forks: 4

vtuber-plan/hifi-gan
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
Language: Python - Size: 109 KB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 31 - Forks: 5

azraelkuan/repgan
RepVgg + HiFiGAN
Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 31 - Forks: 6

ljuvela/GlottDNN
GlottDNN vocoder and tools for training DNN excitation models
Language: C++ - Size: 3.34 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 30 - Forks: 5

Rongjiehuang/Multiband-WaveRNN
An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/
Language: Python - Size: 19.2 MB - Last synced at: 22 days ago - Pushed at: about 4 years ago - Stars: 29 - Forks: 5

azraelkuan/tensorflow_wavenet_vocoder
wavenet vocoder using tensorflow
Language: Python - Size: 3.13 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 27 - Forks: 11

ryhorv/tf-flowavenet
Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Language: Jupyter Notebook - Size: 6.12 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 25 - Forks: 3

BakerBunker/FreeV
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
Language: Python - Size: 44.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 23 - Forks: 3

jurihock/voyx
Standalone real time dynamic vocal harmonizer
Language: C++ - Size: 1.02 MB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 3

kokeshing/WaveNet-tf2
WaveNet with TensorFlow 2.0
Language: Python - Size: 22.5 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 23 - Forks: 6

fuziki/WorldInApple
Swift wrapper for vocoder World(https://github.com/mmorise/World)
Language: Swift - Size: 27.3 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 2

yoyolicoris/wavenet-like-vocoder
Basic wavenet and fftnet vocoder model.
Language: Python - Size: 46.9 KB - Last synced at: 3 days ago - Pushed at: about 3 years ago - Stars: 19 - Forks: 2

mmorise/NoiseGenerators
Noise generators for vocoder
Language: C++ - Size: 11.7 KB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 19 - Forks: 2

TUD-STKS/VocalTractLab-dev
VocalTractLab development repo
Language: C++ - Size: 14.3 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 15 - Forks: 3

mipuc/hts-engine-world
Language: C - Size: 646 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 14 - Forks: 8

geneing/WaveRNN Fork of fatchord/WaveRNN
Pytorch implementation of Deepmind's WaveRNN model
Language: Jupyter Notebook - Size: 144 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 13 - Forks: 1

kwarc93/audio-multieffect
Guitar multi effect running on STM32F746-DISCO board
Language: C - Size: 7.84 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 12 - Forks: 3

vtuber-plan/iSTFTNet
iSTFTNet Vocoder PyTorch Implement
Language: Python - Size: 35.2 KB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

ricardokleinklein/deepMultiSpeech
Deep Multi-Speech model
Language: Python - Size: 281 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 11 - Forks: 5

TariqAHassan/HiFiHybrid
Hifi-like Vocoder implemented in PyTorch
Language: Python - Size: 3.21 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 3

Bycob/harmonizer
Jacob Collier-like harmonizer, because I'm jealous and I want a choir for myself too
Language: C - Size: 2.87 MB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 0

egorsmkv/radtts-hifigan
RADTTS + HiFiGAN vocoder
Language: Python - Size: 62.5 KB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 9 - Forks: 2

revsic/speechset
Numpy-librosa implementation of Speech dataset pipeline
Language: Python - Size: 63.5 KB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 6

manhph2211/ViTTS
In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system :smile: In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...
Language: Python - Size: 768 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 0

will-rice/diffwave
TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis. (WIP)
Language: Python - Size: 26.4 KB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 0

Tom-McDermott/gr-ThumbDV
AMBE Vocoder using NWDR USB AMBE stick
Language: Python - Size: 13.7 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 1

34j/neural-source-filter
Python package for NSF and NSF-HiFi-GAN (unofficial)
Language: Python - Size: 602 KB - Last synced at: 13 days ago - Pushed at: 20 days ago - Stars: 6 - Forks: 0

NTT123/hifigan-tpu
Train HiFi-GAN on TPU
Language: Python - Size: 641 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 0

AlexIII/g729a-python Fork of ploverlake/g729a
G.729А audio codec for python 3
Language: C - Size: 2.07 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 1

takenori-y/pylstraight
An unofficial Python reimplementation of the legacy-STRAIGHT
Language: Python - Size: 4.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 1

yas-sim/csm_voice_encode_synthesis_python
Expermental code for CSM voice synthesis + CSM data generation
Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

ucdrstdenis/Iris
Iris: A Phase Vocoder ♪♫♬
Language: MATLAB - Size: 117 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

egorsmkv/radtts-uk
🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model
Size: 25.4 KB - Last synced at: 30 days ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 1

Rongjiehuang/SingGAN Fork of SingGAN/SingGAN.github.io
Project page for SingGAN (ACM-MM' 2022): Generative Adversarial Network For High-Fidelity Singing Voice Generation
Size: 78.5 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

NTT123/wavernn-16bit
The (unofficial) vanilla version of WaveRNN
Language: Python - Size: 47.9 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

gosha20777/LPCNet Fork of xiph/LPCNet
Efficient neural speech synthesis
Language: C - Size: 264 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

monocasual/vocoder
Probably one of the best text-to-speech online apps in the world (if your browser supports it).
Language: HTML - Size: 43 KB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 4

egorsmkv/radtts-istftnet
RADTTS + iSTFTNet vocoder
Language: Python - Size: 60.5 KB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

mangushev/aligntts
Implementation of ALIGNTTS: EFFICIENT FEED-FORWARD TEXT-TO-SPEECH SYSTEMWITHOUT EXPLICIT ALIGNMENT (arXiv:2003.01950v1 [eess.AS] 4 Mar 2020)
Language: Python - Size: 21.1 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 3

elephantmipt/MelGAN
MelGAN with catalyst framework
Language: Python - Size: 2.54 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

OpenT2S/inferStreamHiFiGAN
StreamHiFiGAN offers a HiFiGAN vocoder model optimized for streaming inference, providing real-time audio synthesis capabilities.
Language: Python - Size: 4.47 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0
