GitHub topics: tacotron
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language: Python - Size: 162 MB - Last synced at: about 14 hours ago - Pushed at: 9 months ago - Stars: 39,755 - Forks: 5,056

MycroftAI/mimic-recording-studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
Language: JavaScript - Size: 5.94 MB - Last synced at: 1 day ago - Pushed at: about 2 years ago - Stars: 508 - Forks: 119

mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Language: Jupyter Notebook - Size: 120 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 9,792 - Forks: 1,291

Emotional-Text-to-Speech/dl-for-emo-tts
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
Language: Jupyter Notebook - Size: 5.26 MB - Last synced at: 5 days ago - Pushed at: 10 months ago - Stars: 447 - Forks: 44

google/tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Language: HTML - Size: 1.05 GB - Last synced at: 14 days ago - Pushed at: about 1 month ago - Stars: 540 - Forks: 83

syang1993/gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
Language: Python - Size: 412 KB - Last synced at: 8 days ago - Pushed at: over 6 years ago - Stars: 367 - Forks: 110

keithito/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Language: Python - Size: 110 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 2,975 - Forks: 955

fatchord/WaveRNN
WaveRNN Vocoder + TTS
Language: Python - Size: 236 MB - Last synced at: 19 days ago - Pushed at: almost 3 years ago - Stars: 2,157 - Forks: 698

Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Language: Python - Size: 8.94 MB - Last synced at: 20 days ago - Pushed at: almost 2 years ago - Stars: 2,308 - Forks: 913

slegroux/nimrod
minimal deep learning framework
Language: Jupyter Notebook - Size: 119 MB - Last synced at: 10 days ago - Pushed at: 16 days ago - Stars: 2 - Forks: 0

falkyn7/text-toolkit
Advanced MCP server providing comprehensive text transformation and formatting tools. TextToolkit offers over 40 specialized utilities for case conversion, encoding/decoding, formatting, analysis, and text manipulation - all accessible directly within your AI assistant workflow.
Language: TypeScript - Size: 1.36 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Kyubyong/expressive_tacotron
Tensorflow Implementation of Expressive Tacotron
Language: Python - Size: 5.8 MB - Last synced at: 10 days ago - Pushed at: over 6 years ago - Stars: 196 - Forks: 34

r9y9/tacotron_pytorch
PyTorch implementation of Tacotron speech synthesis model.
Language: Jupyter Notebook - Size: 20.7 MB - Last synced at: 24 days ago - Pushed at: almost 6 years ago - Stars: 309 - Forks: 78

bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Language: Python - Size: 1.01 MB - Last synced at: 26 days ago - Pushed at: over 4 years ago - Stars: 114 - Forks: 25

spring-media/ForwardTacotron Fork of fatchord/WaveRNN
⏩ Generating speech in a single forward pass without any attention!
Language: Python - Size: 203 MB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 579 - Forks: 112

karim23657/Persian-tts-coqui
Persian/Farsi text to speech(TTS) training using coqui tts
Language: Jupyter Notebook - Size: 53.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 128 - Forks: 18

stefantaubert/en-tts
Command-line interface and Python library for synthesizing English texts into speech.
Language: Python - Size: 804 KB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

Kyubyong/tacotron_asr
Speech Recognition Using Tacotron
Language: Python - Size: 4.65 MB - Last synced at: 10 days ago - Pushed at: over 7 years ago - Stars: 163 - Forks: 39

vlomme/Multi-Tacotron-Voice-Cloning 📦
Phoneme multilingual(Russian-English) voice cloning based on
Language: Python - Size: 985 KB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 390 - Forks: 96

MysteryPancake/Discord-TTS
Text to speech Discord bot using FakeYou
Language: JavaScript - Size: 157 KB - Last synced at: 23 days ago - Pushed at: about 2 years ago - Stars: 39 - Forks: 43

soobinseo/Tacotron-pytorch
Pytorch implementation of Tacotron
Language: Python - Size: 1.02 MB - Last synced at: 6 months ago - Pushed at: over 6 years ago - Stars: 206 - Forks: 41

StarxSky/tacotron2-JP
Base on "tacotron2-jpanese" builded & change
Language: Jupyter Notebook - Size: 1.52 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

MycroftAI/mimic2 Fork of keithito/tacotron
Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
Language: Python - Size: 816 KB - Last synced at: 6 months ago - Pushed at: almost 4 years ago - Stars: 580 - Forks: 103

dongheehand/Tacotron-PyTorch
PyTorch implementation of Tacotron
Language: Python - Size: 1.15 MB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 1

stefantaubert/zh-tts
Web app, command-line interface and Python library for synthesizing Chinese texts into speech.
Language: Python - Size: 2.04 MB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 1

sooftware/tacotron2
Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
Language: Python - Size: 89.8 KB - Last synced at: 24 days ago - Pushed at: over 4 years ago - Stars: 19 - Forks: 3

ide8/tacotron2
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
Language: Jupyter Notebook - Size: 2.96 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 128 - Forks: 26

BogiHsu/Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Language: Python - Size: 2.9 MB - Last synced at: 9 months ago - Pushed at: about 3 years ago - Stars: 142 - Forks: 38

Orca0917/Tacotron-pytorch
Unofficial implementation of Tacotron(2017) using PyTorch.
Language: Jupyter Notebook - Size: 6.16 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Adibian/persian_tacotron
Training Tacotron2 for Persian language as a Persian text-to-speech
Language: Jupyter Notebook - Size: 4.24 MB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 1

andi611/CS-Tacotron-Pytorch
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.
Language: Python - Size: 155 MB - Last synced at: 20 days ago - Pushed at: about 6 years ago - Stars: 23 - Forks: 6

DanRuta/xVA-Synth
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
Language: JavaScript - Size: 1.14 GB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 580 - Forks: 54

ranchlai/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
Language: Python - Size: 85.4 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 446 - Forks: 106

lygon55555/EDA
Emergency Disaster Alert (형남과학상 공모전)
Language: JavaScript - Size: 4.27 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

andi611/TTS-Tacotron-Pytorch
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
Language: Python - Size: 79.5 MB - Last synced at: 20 days ago - Pushed at: about 6 years ago - Stars: 29 - Forks: 10

hpbyte/myanmar-tts
Myanmar Text-to-Speech with End-to-End Speech Synthesis
Language: Python - Size: 98.6 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 3

Foxify52/RVG_tts
A retrieval based voice generation text to speech
Language: Python - Size: 233 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 2

everydaycodings/MimicMania
MimicMania is a web application that allows you to generate speech and clone voices using text-to-speech technology. With MimicMania, you can create custom voices in a variety of languages and use them for a range of applications, from voiceovers to chatbots.
Language: Python - Size: 1010 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 59 - Forks: 12

rishikksh20/gmvae_tacotron
Gaussian Mixture VAE Tacotron
Language: Python - Size: 56.6 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 51 - Forks: 12

rishikksh20/vae_tacotron2
VAE Tacotron 2, an alternative of GST Tacotron
Language: Python - Size: 63.5 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 85 - Forks: 29

adasegroup/OSM-one-shot-multispeaker
Framework for one-shot multispeaker system based on Deep Learning
Language: Python - Size: 46 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 19 - Forks: 4

gia-guar/JARVIS-ChatGPT
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.
Language: Python - Size: 107 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 280 - Forks: 67

NTT123/vietTTS
Vietnamese Text to Speech library
Language: Python - Size: 11.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 153 - Forks: 70

atomicoo/tacotron2-mandarin
Tensorflow implementation of Chinese/Mandarin TTS (Text-to-Speech) based on Tacotron-2 model.
Language: Python - Size: 8.47 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 122 - Forks: 45

atomicoo/FCH-TTS
A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
Language: Python - Size: 59.5 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 209 - Forks: 41

KinglittleQ/GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Language: Python - Size: 28.9 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 334 - Forks: 73

kaituoxu/Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Language: Python - Size: 1.67 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 52 - Forks: 13

xiaozhah/tacotron2 Fork of NVIDIA/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language: Jupyter Notebook - Size: 7.17 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 1

Yangyangii/Tacotron-pytorch
Tacotron implementation with pytorch 1.0
Language: Python - Size: 8.4 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 10 - Forks: 4

Yangyangii/TPGST-Tacotron
Google's TPGST reimplementation.
Language: Python - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 33 - Forks: 8

ttaoREtw/Tacotron-pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Language: Python - Size: 66.2 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 106 - Forks: 23

yookyungkho/Hand-to-Hand Fork of Tobigs-team/Hand-to-Hand
[제 11회 투빅스 컨퍼런스] 청각장애인의 즐거운 외출을 위한 수어생성 모델
Language: Jupyter Notebook - Size: 83.3 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

many-pw/tacotron
golang implementation of tacotron
Language: Go - Size: 1.91 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

eros71-dev/mario-voice-dataset
A dataset for Mario's voice (Charles Martinet), from the Super Mario franchise. More info here: https://uberduck.ai/about
Size: 21.7 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 1

stefantaubert/tacotron2 Fork of NVIDIA/tacotron2
Original Tacotron 2 modified to support IPA training/synthesis and multiple speakers.
Language: Python - Size: 145 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

ishandutta2007/Text-to-Speech-Landscape
Size: 37.1 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 39 - Forks: 6

almodhfer/Arabic_Diacritization
Several deep learning models for restoring Arabic diacritics using Pytorch.
Language: Python - Size: 30.3 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 24 - Forks: 7

solalala-12/Tacotron_Deep-Voice
TTS Deep Learning
Language: Python - Size: 29.2 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

erogol/TTS_tf
WIP Tensorflow implementation of https://github.com/mozilla/TTS
Language: Python - Size: 101 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 15 - Forks: 2

CODEJIN/GST_Tacotron
Implementation of Global Style Token Tacotron in TensorFlow2
Language: Python - Size: 155 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 22 - Forks: 10

rahulkumarm/Tamil-end-to-end-speech-synthesis
Tamil Speech Synthesis based on Google's Tacotron model and keithito's tacotron implementation https://github.com/keithito/tacotron
Language: Python - Size: 120 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 2

acetylSv/GST-tacotron
Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)
Language: Python - Size: 1.21 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 60 - Forks: 4

BridgetteSong/ExpressiveTacotron
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
Language: Python - Size: 59.6 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 66 - Forks: 11

keonlee9420/Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Language: Python - Size: 130 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 31 - Forks: 8

mehdihosseinimoghadam/Catalan-Text-to-Speech Fork of as-ideas/ForwardTacotron
Catalan Text to Speech
Language: Python - Size: 202 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

qaz9517532846/tacotron2_speech_synthesis
Deep Learning speech_synthesis homework3 at NTUT.
Language: Python - Size: 163 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

thuhcsi/tacotron
PyTorch implementation of Tacotron and Tacotron2
Language: Python - Size: 186 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 11

favorcat/Tacotron-Korean-Tensorflow2 Fork of chldkato/Tacotron-Korean-Tensorflow2
Tacotron-Korean-Tensorflow2 for ubuntu
Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

stefantaubert/ICSPCC-2022
Supplementary material for ICSPCC 2022 paper "A Comparison of Text Selection Algorithms for Sequence-to-Sequence Neural TTS".
Size: 16.7 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

taylorlu/ghostvlad-speaker
An tensorflow implementation of ghostvlad for speaker recognition
Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 13 - Forks: 9

dacson/Demo-of-Text-to-Speech-based-on-Deep-Learning
text to speech for mandarin,
Size: 6.06 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 1

lygon55555/soongSiri
Soongsil University CSE chatbot
Language: JavaScript - Size: 10.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

etosworld/etos-tts
Deep Learning TTS, Based on PyTorch Implementation of Tacotron: A Fully End-To-End Text-To-Speech Synthesis Model.
Language: Python - Size: 69.3 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

erogol/ddc-samples
🐸💬 Coqui TTS Double Decoder Consistency samples
Language: HTML - Size: 24.2 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 2

hcnoh/multi-speaker-tacotron-tensorflow
A TensorFlow implementation of Multi-Speaker Tacotron which was introduced on Deep Voice 2 paper by Baidu.
Language: Python - Size: 504 KB - Last synced at: about 2 months ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

KinglittleQ/Tacotron
An implementation of Tacotron with Pytorch0.4
Language: Python - Size: 159 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

Unisound/end-to-end_tts
Audio samples from an end-to-end speech synthesis model.
Language: HTML - Size: 24.1 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 3

harshanavkis/Hindi-TTS
Text to Speech system for Hindi language
Language: Python - Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 2

s3nh/pytorch-tacotron2
speech synthesis - common voice polish dataset.
Language: Python - Size: 166 KB - Last synced at: about 1 hour ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

RiccardoGrin/NVIDIA-tacotron2 Fork of NVIDIA/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference by NVIDIA
Language: Jupyter Notebook - Size: 2.41 MB - Last synced at: 9 months ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 2

MODU-FTNC/carpedm20-tacotron-tensorflow Fork of carpedm20/multi-speaker-tacotron-tensorflow
Multi-speaker Tacotron in TensorFlow.
Language: Python - Size: 18.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

DonggeunYu/Text2Speech
Text to Speech
Language: Python - Size: 100 MB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

kozistr/tacotron-tensorflow
A TensorFlow implementation of Google's Tacotron speech synthesis
Language: Python - Size: 866 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

Mogady/Tacotron-google-cloud
Language: Python - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

RiccardoGrin/tacotron2 Fork of A-Jacobson/tacotron2
pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf
Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 1
