Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: fastspeech2

TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Language: Python - Size: 130 MB - Last synced: 24 days ago - Pushed: 6 months ago - Stars: 3,704 - Forks: 792

ranchlai/mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Language: Python - Size: 85.4 MB - Last synced: 28 days ago - Pushed: almost 2 years ago - Stars: 446 - Forks: 106

rishikksh20/FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Language: Jupyter Notebook - Size: 11.6 MB - Last synced: 24 days ago - Pushed: almost 2 years ago - Stars: 211 - Forks: 52

open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language: Python - Size: 10.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3,824 - Forks: 304

ssmlkl/MnTTS2

This is the experimental description of MnTTS2.

Language: Jupyter Notebook - Size: 39.6 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 7 - Forks: 2

rishikksh20/AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Language: Jupyter Notebook - Size: 4.05 MB - Last synced: 24 days ago - Pushed: over 2 years ago - Stars: 155 - Forks: 40

rishikksh20/LightSpeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Language: Python - Size: 3.23 MB - Last synced: 24 days ago - Pushed: over 2 years ago - Stars: 77 - Forks: 7

PaddlePaddle/Parakeet 📦

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)

Language: Python - Size: 9.32 MB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 599 - Forks: 82

majidAdibian77/ResGrad

Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech

Language: Python - Size: 2.23 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 11 - Forks: 4

utkarsh2299/Fastspeech2_HS

Created this repo as a part of the project "Speech Technologies in Indian languages". About Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrated with disability aids and various other applications.

Language: Perl - Size: 1.03 GB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

ZDisket/TensorVox

Desktop application for neural speech synthesis written in C++

Language: C++ - Size: 15.5 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 197 - Forks: 18

keonlee9420/Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

Language: Python - Size: 3.45 MB - Last synced: 7 months ago - Pushed: almost 2 years ago - Stars: 136 - Forks: 19

keonlee9420/Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Language: Python - Size: 143 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 288 - Forks: 39

lordzuko/FastSpeech2-jax

Implementation of FastSpeech2 in JAX

Size: 1000 Bytes - Last synced: 9 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

tuanh123789/AdaSpeech

An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"

Language: Python - Size: 50.4 MB - Last synced: 9 months ago - Pushed: almost 2 years ago - Stars: 80 - Forks: 23

ga642381/FastSpeech2

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:

Language: Python - Size: 39.7 MB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 79 - Forks: 16

hwRG/End-to-End-TTS-Fine-Tune

Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.

Language: Python - Size: 33.4 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 17 - Forks: 7

lordzuko/SpeakingStyle

Aligning latent space of speaking style with human perception using a re-embedding strategy

Language: Jupyter Notebook - Size: 133 MB - Last synced: 9 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0

deepaudio/deepaudio-tts

Language: Python - Size: 362 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 10 - Forks: 2

alessandropec/data_driven_ai_voice_cloning

This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering

Language: Python - Size: 268 MB - Last synced: 12 months ago - Pushed: about 1 year ago - Stars: 7 - Forks: 1

quackson/DG_HW

homework for deep generation. Combine FastSpeech2 with different vocoders ⭐REFERENCE (modify origin repos): https://github.com/ming024/FastSpeech2 https://github.com/NVIDIA/waveglow https://github.com/mindslab-ai/univnet https://github.com/jik876/hifi-gan

Language: Python - Size: 34.1 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

hwRG/FastSpeech2-Pytorch-Korean-Multi-Speaker

Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.

Language: Python - Size: 5.32 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 7 - Forks: 1

xcmyz/FastSpeech2

The Implementation of FastSpeech2 Based on Pytorch.

Language: Python - Size: 4.03 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 53 - Forks: 9

nikolaStanojkovski/Talk_Through_Me

An Android application that acts as a speaking assistant for the hearing impaired people.

Language: Python - Size: 108 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

nikolaStanojkovski/Assistive_Bus_Helper

An Android application that allows visually impaired people to hear which bus lines are passing next to them.

Language: Python - Size: 172 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 0

AppleHolic/FastSpeech2

Refactored version of https://github.com/ming024/FastSpeech2

Language: Python - Size: 1.35 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 10 - Forks: 2

gagan3012/image2audio

Convert Image to audio using ViT, GPT and FastSpeech

Language: Python - Size: 32.2 KB - Last synced: 24 days ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

dathudeptrai/FastSpeech2

A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech

Size: 7.14 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 10 - Forks: 0