Topic: "global-style-tokens"
syang1993/gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
Language: Python - Size: 412 KB - Last synced at: 6 days ago - Pushed at: over 6 years ago - Stars: 367 - Forks: 110

keonlee9420/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Language: Python - Size: 101 MB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 193 - Forks: 27

hash2430/pitchtron
TTS for pitch-accented language. Korean dialect DB.
Language: Python - Size: 4.69 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 154 - Forks: 30

acetylSv/GST-tacotron
Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)
Language: Python - Size: 1.21 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 60 - Forks: 4

CODEJIN/Glow_TTS
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
Language: Python - Size: 2.89 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 35 - Forks: 13
