vocoder | Topic | Ecosyste.ms: Repos

Topic: "vocoder"

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language: Python - Size: 162 MB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 40,112 - Forks: 5,134

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language: Python - Size: 69.2 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 11,899 - Forks: 1,912

mozilla/TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language: Jupyter Notebook - Size: 120 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 9,835 - Forks: 1,296

open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language: Python - Size: 126 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 9,063 - Forks: 713

fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

Language: Python - Size: 10.5 MB - Last synced at: 6 days ago - Pushed at: 9 days ago - Stars: 8,415 - Forks: 1,192

TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Language: Python - Size: 130 MB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 3,924 - Forks: 810

jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language: Python - Size: 605 KB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 2,136 - Forks: 530

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language: Jupyter Notebook - Size: 34.8 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 1,604 - Forks: 345

mmorise/World

A high-quality speech analysis, manipulation and synthesis system

Language: C++ - Size: 878 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1,233 - Forks: 257

haoheliu/voicefixer

General Speech Restoration

Language: Python - Size: 3.76 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1,149 - Forks: 139

gemelo-ai/vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language: Python - Size: 17.3 MB - Last synced at: 15 days ago - Pushed at: 10 months ago - Stars: 923 - Forks: 110

lmnt-com/diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Language: Python - Size: 20.5 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 831 - Forks: 116

Rongjiehuang/FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

Language: Python - Size: 2.98 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 410 - Forks: 61

ivanvovk/WaveGrad

Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.

Language: Jupyter Notebook - Size: 18.1 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 384 - Forks: 52

rishikksh20/VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Language: Python - Size: 187 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 318 - Forks: 60

szechyjs/mbelib

P25 Phase 1 and ProVoice vocoder

Language: C++ - Size: 480 KB - Last synced at: 13 days ago - Pushed at: over 4 years ago - Stars: 292 - Forks: 123

lmnt-com/wavegrad

A fast, high-quality neural vocoder.

Language: Python - Size: 18.6 KB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 284 - Forks: 48

maum-ai/univnet

Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

Language: Python - Size: 22.7 MB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 271 - Forks: 46

sh123/codec2_talkie

Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)

Language: Java - Size: 27.5 MB - Last synced at: about 1 hour ago - Pushed at: about 1 month ago - Stars: 247 - Forks: 40

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Language: Python - Size: 865 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 208 - Forks: 45

maum-ai/phaseaug

ICASSP 2023 Accepted

Language: Python - Size: 47.8 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 189 - Forks: 14

descriptinc/cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

Language: Python - Size: 90.2 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 187 - Forks: 30

k2kobayashi/crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Language: Python - Size: 12.4 MB - Last synced at: 5 days ago - Pushed at: 10 months ago - Stars: 171 - Forks: 31

HidekiKawahara/legacy_STRAIGHT

A vocoder framework which had been widely used in research community since 1999.

Language: Matlab - Size: 19.8 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 164 - Forks: 41

yl4579/HiFTNet

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Language: Python - Size: 55.1 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 162 - Forks: 13

hhguo/MSMC-TTS

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

Language: Python - Size: 1.15 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 14

xcmyz/FastVocoder

Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

Language: Python - Size: 7.34 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 154 - Forks: 19

NTT123/vietTTS

Vietnamese Text to Speech library

Language: Python - Size: 11.8 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 153 - Forks: 70

ncsoft/avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

Language: Python - Size: 17.6 KB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 152 - Forks: 19

jurihock/stftPitchShift

STFT based real-time pitch and timbre shifting in C++ and Python

Language: C - Size: 2.21 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 141 - Forks: 17

Rongjiehuang/Multi-Singer

PyTorch Implementation of Multi-Singer (ACM-MM'21)

Language: Python - Size: 34.8 MB - Last synced at: 5 days ago - Pushed at: about 3 years ago - Stars: 138 - Forks: 20

geneing/WaveRNN-Pytorch Fork of G-Wang/WaveRNN-Pytorch

Fatcord's Alternative WaveRNN (Faster training)

Language: Python - Size: 3.48 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 134 - Forks: 37

X-LANCE/UniCATS-CTX-vec2wav

[AAAI 2024] Code for CTX-vec2wav in UniCATS

Language: Python - Size: 1000 KB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 129 - Forks: 16

iamycy/golf

A DDSP-based neural voice synthesiser.

Language: Jupyter Notebook - Size: 634 MB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 117 - Forks: 9

rishikksh20/Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Language: Python - Size: 821 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 114 - Forks: 15

magnetophon/VoiceOfFaust

Turn your voice into a synthesizer!

Language: Faust - Size: 137 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 112 - Forks: 2

rishikksh20/Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Language: Python - Size: 600 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 99 - Forks: 32

syang1993/FFTNet

A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

Language: Python - Size: 17.8 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 92 - Forks: 20

erogol/FFTNet

FFTNet vocoder implementation

Language: Jupyter Notebook - Size: 756 KB - Last synced at: 24 days ago - Pushed at: over 6 years ago - Stars: 81 - Forks: 8

tuan3w/cnn_vocoder 📦

A fast cnn-based vocoder

Language: Python - Size: 3.91 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 78 - Forks: 13

CSTR-Edinburgh/magphase

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Language: Python - Size: 18.5 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 77 - Forks: 31

BogiHsu/WG-WaveNet

Real-Time High-Fidelity Speech Synthesis without GPU

Language: Python - Size: 25.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 72 - Forks: 13

zzw922cn/LPC_for_TTS

Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.

Language: Python - Size: 652 KB - Last synced at: 28 days ago - Pushed at: about 4 years ago - Stars: 69 - Forks: 10

rishikksh20/UnivNet-pytorch

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Language: Python - Size: 596 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 68 - Forks: 10

philsyn/DiffWave-Vocoder

Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.

Language: Python - Size: 81.1 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 66 - Forks: 4

azraelkuan/FFTNet

FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

Language: Python - Size: 68.4 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 63 - Forks: 10

yistLin/universal-vocoder

A PyTorch implementation of the universal neural vocoder

Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 62 - Forks: 9

kaiidams/soundstream-pytorch

Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint

Language: Python - Size: 8.79 KB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 59 - Forks: 10

rishikksh20/melgan

MelGAN implementation with Multi-Band and Full Band supports...

Language: Jupyter Notebook - Size: 725 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 59 - Forks: 15

ttop32/coqui_tts_korea

Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS

Language: Jupyter Notebook - Size: 2.79 MB - Last synced at: 25 days ago - Pushed at: over 3 years ago - Stars: 57 - Forks: 17

vtuber-plan/NSF-HiFiGAN

Vocoder NSF-HiFiGAN (Moved into deepaudio)

Language: Python - Size: 41 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 51 - Forks: 2

rishikksh20/iSTFT-Avocodo-pytorch

Ultrafast GAN based Vocoder for Text to Speech

Language: Python - Size: 1.05 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 51 - Forks: 7

hcy71o/AutoVocoder

Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing

Language: Python - Size: 1.35 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 48 - Forks: 9

DongyaoZhu/Real-Time-Accent-Conversion

Real Time Foreign Accent Conversion

Language: Python - Size: 1.03 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 46 - Forks: 8

revsic/tf-diffwave

Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis

Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: 19 days ago - Pushed at: over 4 years ago - Stars: 41 - Forks: 10

solalala-12/Singing-Voice-Conversion

2019/04~2019/09 투빅스 Singing Voice Conversion

Language: Jupyter Notebook - Size: 175 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 39 - Forks: 10

yoyolicoris/pytorch_FFTNet

A pytorch implementation of FFTNet.

Language: Python - Size: 547 KB - Last synced at: 4 days ago - Pushed at: over 6 years ago - Stars: 37 - Forks: 4

jurihock/stftPitchShiftPlugin

Official JUCE plugin for stftPitchShift

Language: C++ - Size: 659 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 36 - Forks: 3

rishikksh20/multiband-hifigan Fork of jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language: Python - Size: 625 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 35 - Forks: 3

YuzukiTsuru/World.JS

World.JS is a JavaScript Wrapper for World Vocoder Powered by Emscripten

Language: C++ - Size: 1.55 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 33 - Forks: 4

vtuber-plan/hifi-gan

An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.

Language: Python - Size: 109 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 31 - Forks: 5

azraelkuan/repgan

RepVgg + HiFiGAN

Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 31 - Forks: 6

ljuvela/GlottDNN

GlottDNN vocoder and tools for training DNN excitation models

Language: C++ - Size: 3.34 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 30 - Forks: 5

Rongjiehuang/Multiband-WaveRNN

An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/

Language: Python - Size: 19.2 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 29 - Forks: 5

azraelkuan/tensorflow_wavenet_vocoder

wavenet vocoder using tensorflow

Language: Python - Size: 3.13 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 27 - Forks: 11

ryhorv/tf-flowavenet

Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Language: Jupyter Notebook - Size: 6.12 MB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 25 - Forks: 3

BakerBunker/FreeV

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Language: Python - Size: 44.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 23 - Forks: 3

jurihock/voyx

Standalone real time dynamic vocal harmonizer

Language: C++ - Size: 1.02 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 3

kokeshing/WaveNet-tf2

WaveNet with TensorFlow 2.0

Language: Python - Size: 22.5 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 23 - Forks: 6

fuziki/WorldInApple

Swift wrapper for vocoder World(https://github.com/mmorise/World)

Language: Swift - Size: 27.3 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 2

yoyolicoris/wavenet-like-vocoder

Basic wavenet and fftnet vocoder model.

Language: Python - Size: 46.9 KB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 2

mmorise/NoiseGenerators

Noise generators for vocoder

Language: C++ - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 19 - Forks: 2

kwarc93/audio-multieffect

Guitar multi effect running on STM32F746-DISCO board

Language: C - Size: 7.87 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 17 - Forks: 5

TUD-STKS/VocalTractLab-dev

VocalTractLab development repo

Language: C++ - Size: 14.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 15 - Forks: 3

mipuc/hts-engine-world

Language: C - Size: 646 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 14 - Forks: 8

geneing/WaveRNN Fork of fatchord/WaveRNN

Pytorch implementation of Deepmind's WaveRNN model

Language: Jupyter Notebook - Size: 144 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 13 - Forks: 1

vtuber-plan/iSTFTNet

iSTFTNet Vocoder PyTorch Implement

Language: Python - Size: 35.2 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

ricardokleinklein/deepMultiSpeech

Deep Multi-Speech model

Language: Python - Size: 281 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 11 - Forks: 5

TariqAHassan/HiFiHybrid

Hifi-like Vocoder implemented in PyTorch

Language: Python - Size: 3.21 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 3

Bycob/harmonizer

Jacob Collier-like harmonizer, because I'm jealous and I want a choir for myself too

Language: C - Size: 2.87 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 0

egorsmkv/radtts-hifigan

RADTTS + HiFiGAN vocoder

Language: Python - Size: 62.5 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 9 - Forks: 2

revsic/speechset

Numpy-librosa implementation of Speech dataset pipeline

Language: Python - Size: 63.5 KB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 6

manhph2211/ViTTS

In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system :smile: In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...

Language: Python - Size: 768 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 0

will-rice/diffwave

TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis. (WIP)

Language: Python - Size: 26.4 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 0

34j/neural-source-filter

Python package for NSF and NSF-HiFi-GAN (unofficial)

Language: Python - Size: 604 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 0

Tom-McDermott/gr-ThumbDV

AMBE Vocoder using NWDR USB AMBE stick

Language: Python - Size: 13.7 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 1

NTT123/hifigan-tpu

Train HiFi-GAN on TPU

Language: Python - Size: 641 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 0

AlexIII/g729a-python Fork of ploverlake/g729a

G.729А audio codec for python 3

Language: C - Size: 2.07 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 1

takenori-y/pylstraight

An unofficial Python reimplementation of the legacy-STRAIGHT

Language: Python - Size: 4.2 MB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 5 - Forks: 1

yas-sim/csm_voice_encode_synthesis_python

Expermental code for CSM voice synthesis + CSM data generation

Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

ucdrstdenis/Iris

Iris: A Phase Vocoder ♪♫♬

Language: MATLAB - Size: 117 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 0

egorsmkv/radtts-uk

🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model

Size: 25.4 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

Rongjiehuang/SingGAN Fork of SingGAN/SingGAN.github.io

Project page for SingGAN (ACM-MM' 2022): Generative Adversarial Network For High-Fidelity Singing Voice Generation

Size: 78.5 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

NTT123/wavernn-16bit

The (unofficial) vanilla version of WaveRNN

Language: Python - Size: 47.9 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

gosha20777/LPCNet Fork of xiph/LPCNet

Efficient neural speech synthesis

Language: C - Size: 264 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0