neural-tts | Topic | Ecosyste.ms: Repos

Topic: "neural-tts"

keonlee9420/DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Language: Python - Size: 121 MB - Last synced at: 16 days ago - Pushed at: about 3 years ago - Stars: 331 - Forks: 45

keonlee9420/PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Language: Python - Size: 129 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 331 - Forks: 36

keonlee9420/Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Language: Python - Size: 143 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 325 - Forks: 42

keonlee9420/DiffSinger

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

Language: Python - Size: 133 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 233 - Forks: 30

KevinMIN95/StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Language: Python - Size: 1.35 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 212 - Forks: 38

keonlee9420/Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Language: Python - Size: 101 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 193 - Forks: 27

keonlee9420/Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Language: Python - Size: 99.3 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 189 - Forks: 45

keonlee9420/StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Language: Python - Size: 114 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 172 - Forks: 21

keonlee9420/Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

Language: Python - Size: 3.45 MB - Last synced at: 16 days ago - Pushed at: almost 3 years ago - Stars: 146 - Forks: 19

mush42/sonata

A cross-platform inference engine for neural TTS models.

Language: Rust - Size: 33.9 MB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 72 - Forks: 17

keonlee9420/VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Language: Python - Size: 122 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 72 - Forks: 14

keonlee9420/FastPitchFormant

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Language: Python - Size: 101 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 69 - Forks: 13

keonlee9420/WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Language: Python - Size: 18 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 64 - Forks: 14

keonlee9420/Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Language: Python - Size: 110 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 48 - Forks: 14

keonlee9420/Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Language: Python - Size: 130 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 31 - Forks: 8

Mobile-Artificial-Intelligence/babylon.cpp

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.

Language: Python - Size: 422 MB - Last synced at: 10 days ago - Pushed at: 8 months ago - Stars: 16 - Forks: 2

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos