An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: tacotron

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language: Python - Size: 162 MB - Last synced at: about 14 hours ago - Pushed at: 9 months ago - Stars: 39,755 - Forks: 5,056

MycroftAI/mimic-recording-studio

Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2

Language: JavaScript - Size: 5.94 MB - Last synced at: 1 day ago - Pushed at: about 2 years ago - Stars: 508 - Forks: 119

mozilla/TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language: Jupyter Notebook - Size: 120 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 9,792 - Forks: 1,291

Emotional-Text-to-Speech/dl-for-emo-tts

:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:

Language: Jupyter Notebook - Size: 5.26 MB - Last synced at: 5 days ago - Pushed at: 10 months ago - Stars: 447 - Forks: 44

google/tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Language: HTML - Size: 1.05 GB - Last synced at: 14 days ago - Pushed at: about 1 month ago - Stars: 540 - Forks: 83

syang1993/gst-tacotron

A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"

Language: Python - Size: 412 KB - Last synced at: 8 days ago - Pushed at: over 6 years ago - Stars: 367 - Forks: 110

keithito/tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Language: Python - Size: 110 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 2,975 - Forks: 955

fatchord/WaveRNN

WaveRNN Vocoder + TTS

Language: Python - Size: 236 MB - Last synced at: 19 days ago - Pushed at: almost 3 years ago - Stars: 2,157 - Forks: 698

Rayhane-mamah/Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Language: Python - Size: 8.94 MB - Last synced at: 20 days ago - Pushed at: almost 2 years ago - Stars: 2,308 - Forks: 913

slegroux/nimrod

minimal deep learning framework

Language: Jupyter Notebook - Size: 119 MB - Last synced at: 10 days ago - Pushed at: 16 days ago - Stars: 2 - Forks: 0

falkyn7/text-toolkit

Advanced MCP server providing comprehensive text transformation and formatting tools. TextToolkit offers over 40 specialized utilities for case conversion, encoding/decoding, formatting, analysis, and text manipulation - all accessible directly within your AI assistant workflow.

Language: TypeScript - Size: 1.36 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Kyubyong/expressive_tacotron

Tensorflow Implementation of Expressive Tacotron

Language: Python - Size: 5.8 MB - Last synced at: 10 days ago - Pushed at: over 6 years ago - Stars: 196 - Forks: 34

r9y9/tacotron_pytorch

PyTorch implementation of Tacotron speech synthesis model.

Language: Jupyter Notebook - Size: 20.7 MB - Last synced at: 24 days ago - Pushed at: almost 6 years ago - Stars: 309 - Forks: 78

bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Language: Python - Size: 1.01 MB - Last synced at: 26 days ago - Pushed at: over 4 years ago - Stars: 114 - Forks: 25

spring-media/ForwardTacotron Fork of fatchord/WaveRNN

⏩ Generating speech in a single forward pass without any attention!

Language: Python - Size: 203 MB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 579 - Forks: 112

karim23657/Persian-tts-coqui

Persian/Farsi text to speech(TTS) training using coqui tts

Language: Jupyter Notebook - Size: 53.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 128 - Forks: 18

stefantaubert/en-tts

Command-line interface and Python library for synthesizing English texts into speech.

Language: Python - Size: 804 KB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

Kyubyong/tacotron_asr

Speech Recognition Using Tacotron

Language: Python - Size: 4.65 MB - Last synced at: 10 days ago - Pushed at: over 7 years ago - Stars: 163 - Forks: 39

vlomme/Multi-Tacotron-Voice-Cloning 📦

Phoneme multilingual(Russian-English) voice cloning based on

Language: Python - Size: 985 KB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 390 - Forks: 96

MysteryPancake/Discord-TTS

Text to speech Discord bot using FakeYou

Language: JavaScript - Size: 157 KB - Last synced at: 23 days ago - Pushed at: about 2 years ago - Stars: 39 - Forks: 43

soobinseo/Tacotron-pytorch

Pytorch implementation of Tacotron

Language: Python - Size: 1.02 MB - Last synced at: 6 months ago - Pushed at: over 6 years ago - Stars: 206 - Forks: 41

StarxSky/tacotron2-JP

Base on "tacotron2-jpanese" builded & change

Language: Jupyter Notebook - Size: 1.52 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

MycroftAI/mimic2 Fork of keithito/tacotron

Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.

Language: Python - Size: 816 KB - Last synced at: 6 months ago - Pushed at: almost 4 years ago - Stars: 580 - Forks: 103

dongheehand/Tacotron-PyTorch

PyTorch implementation of Tacotron

Language: Python - Size: 1.15 MB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 1

stefantaubert/zh-tts

Web app, command-line interface and Python library for synthesizing Chinese texts into speech.

Language: Python - Size: 2.04 MB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 1

sooftware/tacotron2

Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.

Language: Python - Size: 89.8 KB - Last synced at: 24 days ago - Pushed at: over 4 years ago - Stars: 19 - Forks: 3

ide8/tacotron2

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

Language: Jupyter Notebook - Size: 2.96 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 128 - Forks: 26

BogiHsu/Tacotron2-PyTorch

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

Language: Python - Size: 2.9 MB - Last synced at: 9 months ago - Pushed at: about 3 years ago - Stars: 142 - Forks: 38

Orca0917/Tacotron-pytorch

Unofficial implementation of Tacotron(2017) using PyTorch.

Language: Jupyter Notebook - Size: 6.16 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Adibian/persian_tacotron

Training Tacotron2 for Persian language as a Persian text-to-speech

Language: Jupyter Notebook - Size: 4.24 MB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 1

andi611/CS-Tacotron-Pytorch

Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.

Language: Python - Size: 155 MB - Last synced at: 20 days ago - Pushed at: about 6 years ago - Stars: 23 - Forks: 6

DanRuta/xVA-Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

Language: JavaScript - Size: 1.14 GB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 580 - Forks: 54

ranchlai/mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Language: Python - Size: 85.4 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 446 - Forks: 106

lygon55555/EDA

Emergency Disaster Alert (형남과학상 공모전)

Language: JavaScript - Size: 4.27 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

andi611/TTS-Tacotron-Pytorch

Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.

Language: Python - Size: 79.5 MB - Last synced at: 20 days ago - Pushed at: about 6 years ago - Stars: 29 - Forks: 10

hpbyte/myanmar-tts

Myanmar Text-to-Speech with End-to-End Speech Synthesis

Language: Python - Size: 98.6 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 3

Foxify52/RVG_tts

A retrieval based voice generation text to speech

Language: Python - Size: 233 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 2

everydaycodings/MimicMania

MimicMania is a web application that allows you to generate speech and clone voices using text-to-speech technology. With MimicMania, you can create custom voices in a variety of languages and use them for a range of applications, from voiceovers to chatbots.

Language: Python - Size: 1010 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 59 - Forks: 12

rishikksh20/gmvae_tacotron

Gaussian Mixture VAE Tacotron

Language: Python - Size: 56.6 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 51 - Forks: 12

rishikksh20/vae_tacotron2

VAE Tacotron 2, an alternative of GST Tacotron

Language: Python - Size: 63.5 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 85 - Forks: 29

adasegroup/OSM-one-shot-multispeaker

Framework for one-shot multispeaker system based on Deep Learning

Language: Python - Size: 46 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 19 - Forks: 4

gia-guar/JARVIS-ChatGPT

A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.

Language: Python - Size: 107 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 280 - Forks: 67

NTT123/vietTTS

Vietnamese Text to Speech library

Language: Python - Size: 11.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 153 - Forks: 70

atomicoo/tacotron2-mandarin

Tensorflow implementation of Chinese/Mandarin TTS (Text-to-Speech) based on Tacotron-2 model.

Language: Python - Size: 8.47 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 122 - Forks: 45

atomicoo/FCH-TTS

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。

Language: Python - Size: 59.5 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 209 - Forks: 41

KinglittleQ/GST-Tacotron

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Language: Python - Size: 28.9 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 334 - Forks: 73

kaituoxu/Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Language: Python - Size: 1.67 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 52 - Forks: 13

xiaozhah/tacotron2 Fork of NVIDIA/tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Language: Jupyter Notebook - Size: 7.17 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 1

Yangyangii/Tacotron-pytorch

Tacotron implementation with pytorch 1.0

Language: Python - Size: 8.4 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 10 - Forks: 4

Yangyangii/TPGST-Tacotron

Google's TPGST reimplementation.

Language: Python - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 33 - Forks: 8

ttaoREtw/Tacotron-pytorch

A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model

Language: Python - Size: 66.2 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 106 - Forks: 23

yookyungkho/Hand-to-Hand Fork of Tobigs-team/Hand-to-Hand

[제 11회 투빅스 컨퍼런스] 청각장애인의 즐거운 외출을 위한 수어생성 모델

Language: Jupyter Notebook - Size: 83.3 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

many-pw/tacotron

golang implementation of tacotron

Language: Go - Size: 1.91 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

eros71-dev/mario-voice-dataset

A dataset for Mario's voice (Charles Martinet), from the Super Mario franchise. More info here: https://uberduck.ai/about

Size: 21.7 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 1

stefantaubert/tacotron2 Fork of NVIDIA/tacotron2

Original Tacotron 2 modified to support IPA training/synthesis and multiple speakers.

Language: Python - Size: 145 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

ishandutta2007/Text-to-Speech-Landscape

Size: 37.1 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 39 - Forks: 6

almodhfer/Arabic_Diacritization

Several deep learning models for restoring Arabic diacritics using Pytorch.

Language: Python - Size: 30.3 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 24 - Forks: 7

solalala-12/Tacotron_Deep-Voice

TTS Deep Learning

Language: Python - Size: 29.2 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

erogol/TTS_tf

WIP Tensorflow implementation of https://github.com/mozilla/TTS

Language: Python - Size: 101 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 15 - Forks: 2

CODEJIN/GST_Tacotron

Implementation of Global Style Token Tacotron in TensorFlow2

Language: Python - Size: 155 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 22 - Forks: 10

rahulkumarm/Tamil-end-to-end-speech-synthesis

Tamil Speech Synthesis based on Google's Tacotron model and keithito's tacotron implementation https://github.com/keithito/tacotron

Language: Python - Size: 120 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 2

acetylSv/GST-tacotron

Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)

Language: Python - Size: 1.21 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 60 - Forks: 4

BridgetteSong/ExpressiveTacotron

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

Language: Python - Size: 59.6 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 66 - Forks: 11

keonlee9420/Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Language: Python - Size: 130 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 31 - Forks: 8

mehdihosseinimoghadam/Catalan-Text-to-Speech Fork of as-ideas/ForwardTacotron

Catalan Text to Speech

Language: Python - Size: 202 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

qaz9517532846/tacotron2_speech_synthesis

Deep Learning speech_synthesis homework3 at NTUT.

Language: Python - Size: 163 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

thuhcsi/tacotron

PyTorch implementation of Tacotron and Tacotron2

Language: Python - Size: 186 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 11

favorcat/Tacotron-Korean-Tensorflow2 Fork of chldkato/Tacotron-Korean-Tensorflow2

Tacotron-Korean-Tensorflow2 for ubuntu

Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

stefantaubert/ICSPCC-2022

Supplementary material for ICSPCC 2022 paper "A Comparison of Text Selection Algorithms for Sequence-to-Sequence Neural TTS".

Size: 16.7 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

taylorlu/ghostvlad-speaker

An tensorflow implementation of ghostvlad for speaker recognition

Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 13 - Forks: 9

dacson/Demo-of-Text-to-Speech-based-on-Deep-Learning

text to speech for mandarin,

Size: 6.06 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 1

lygon55555/soongSiri

Soongsil University CSE chatbot

Language: JavaScript - Size: 10.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

etosworld/etos-tts

Deep Learning TTS, Based on PyTorch Implementation of Tacotron: A Fully End-To-End Text-To-Speech Synthesis Model.

Language: Python - Size: 69.3 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

erogol/ddc-samples

🐸💬 Coqui TTS Double Decoder Consistency samples

Language: HTML - Size: 24.2 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 2

hcnoh/multi-speaker-tacotron-tensorflow

A TensorFlow implementation of Multi-Speaker Tacotron which was introduced on Deep Voice 2 paper by Baidu.

Language: Python - Size: 504 KB - Last synced at: about 2 months ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

KinglittleQ/Tacotron

An implementation of Tacotron with Pytorch0.4

Language: Python - Size: 159 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

Unisound/end-to-end_tts

Audio samples from an end-to-end speech synthesis model.

Language: HTML - Size: 24.1 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 3

harshanavkis/Hindi-TTS

Text to Speech system for Hindi language

Language: Python - Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 2

s3nh/pytorch-tacotron2

speech synthesis - common voice polish dataset.

Language: Python - Size: 166 KB - Last synced at: about 1 hour ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

RiccardoGrin/NVIDIA-tacotron2 Fork of NVIDIA/tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference by NVIDIA

Language: Jupyter Notebook - Size: 2.41 MB - Last synced at: 9 months ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 2

MODU-FTNC/carpedm20-tacotron-tensorflow Fork of carpedm20/multi-speaker-tacotron-tensorflow

Multi-speaker Tacotron in TensorFlow.

Language: Python - Size: 18.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

DonggeunYu/Text2Speech

Text to Speech

Language: Python - Size: 100 MB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

kozistr/tacotron-tensorflow

A TensorFlow implementation of Google's Tacotron speech synthesis

Language: Python - Size: 866 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

Mogady/Tacotron-google-cloud

Language: Python - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

RiccardoGrin/tacotron2 Fork of A-Jacobson/tacotron2

pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf

Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 1