Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: non-autoregressive
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language: Jupyter Notebook - Size: 62.5 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 404 - Forks: 53
henry-yeh/GLOP
[AAAI 2024] GLOP: Learning Global Partition and Local Construction for Solving Large-scale Routing Problems in Real-time
Language: Python - Size: 1.21 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 41 - Forks: 4
lucidrains/soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Language: Python - Size: 334 KB - Last synced: 21 days ago - Pushed: 22 days ago - Stars: 1,119 - Forks: 77
kan-bayashi/NonARSeq2SeqVC
Non-autoregressive sequence-to-sequence voice conversion
Size: 4.51 MB - Last synced: 24 days ago - Pushed: over 3 years ago - Stars: 6 - Forks: 0
keonlee9420/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Language: Python - Size: 129 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 330 - Forks: 34
HKUNLP/diffusion-of-thoughts
Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
Language: Python - Size: 4.05 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 10 - Forks: 0
aistairc/BERT-NAR-BERT
BERT-based pre-trained non-autoregressive sequence-to-sequence model
Language: Python - Size: 74.7 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
keonlee9420/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Language: Python - Size: 101 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 237 - Forks: 38
keonlee9420/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Language: Python - Size: 101 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 162 - Forks: 25
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language: Python - Size: 121 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 271 - Forks: 44
keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Language: Python - Size: 3.45 MB - Last synced: 7 months ago - Pushed: almost 2 years ago - Stars: 136 - Forks: 19
hemingkx/SpecDec
Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)
Language: Python - Size: 7.22 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 16 - Forks: 0
keonlee9420/DailyTalk
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023 (Oral)
Language: Python - Size: 102 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 155 - Forks: 13
keonlee9420/FastPitchFormant
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Language: Python - Size: 101 MB - Last synced: 7 months ago - Pushed: almost 3 years ago - Stars: 69 - Forks: 13
keonlee9420/DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
Language: Python - Size: 133 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 206 - Forks: 27
keonlee9420/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Language: Python - Size: 114 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 172 - Forks: 21
yzhangcs/ctc-copy
[EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".
Language: Python - Size: 50.8 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 9 - Forks: 1
keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
Language: Python - Size: 143 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 288 - Forks: 39
keonlee9420/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Language: Python - Size: 99.3 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 181 - Forks: 43
RistoAle97/ContinualNAT
M.Sc. thesis on Continual Learning for multilingual non-autoregressive Neural Machine Translation (NAT)
Language: Python - Size: 3.76 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 6 - Forks: 0
keonlee9420/WaveGrad2
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Language: Python - Size: 18 MB - Last synced: 7 months ago - Pushed: almost 3 years ago - Stars: 64 - Forks: 14
xcfcode/What-I-Have-Read
Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
Size: 91.2 MB - Last synced: 7 months ago - Pushed: almost 2 years ago - Stars: 161 - Forks: 15
LARC-CMU-SMU/Enconter
Implementation of 2021 EACL paper Enconter
Language: Jupyter Notebook - Size: 945 KB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0
HKUNLP/reparam-discrete-diffusion
Reparameterized Discrete Diffusion Models for Text Generation
Language: Python - Size: 6.73 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 28 - Forks: 1
bearcatt/LaBERT
A length-controllable and non-autoregressive image captioning model.
Language: Python - Size: 34.2 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 56 - Forks: 10
keonlee9420/Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Language: Python - Size: 110 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 48 - Forks: 14
keonlee9420/VAENAR-TTS
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
Language: Python - Size: 122 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 68 - Forks: 13
keonlee9420/Deep-Learning-TTS-Template
This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
Language: Python - Size: 106 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 13 - Forks: 0