GitHub topics: text-to-speech-dataset
MahtaFetrat/ManaTTS-Persian-Speech-Dataset
ManaTTS is the largest open Persian speech dataset with 100+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
Language: Jupyter Notebook - Size: 16.4 MB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 24 - Forks: 1

MahtaFetrat/GPTInformal-Persian-Speech-Dataset
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
Size: 2.93 KB - Last synced at: 21 days ago - Pushed at: 7 months ago - Stars: 7 - Forks: 0

hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos
Language: Python - Size: 81.1 KB - Last synced at: 15 days ago - Pushed at: 11 months ago - Stars: 36 - Forks: 8
