Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: audio-datasets

jim-schwoebel/voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Size: 136 KB - Last synced: 9 days ago - Pushed: 2 months ago - Stars: 1,555 - Forks: 218

SuperKogito/SER-datasets

A collection of datasets for the purpose of emotion recognition/detection in speech.

Language: HTML - Size: 3.6 MB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 247 - Forks: 33

DagsHub/audio-datasets

open-source audio datasets

Size: 6.76 MB - Last synced: 27 days ago - Pushed: 9 months ago - Stars: 125 - Forks: 23

MorenoLaQuatra/audioset-download

This package aims at simplifying the download of the AudioSet dataset.

Language: Python - Size: 24.3 MB - Last synced: 17 days ago - Pushed: 8 months ago - Stars: 33 - Forks: 8

Audio-WestlakeU/audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Language: Python - Size: 13.1 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 65 - Forks: 7

ynop/audiomate

Python library for handling audio datasets.

Language: Python - Size: 9.07 MB - Last synced: 20 days ago - Pushed: 11 months ago - Stars: 130 - Forks: 25

AAnirudh07/CLEF-2022

This repository contains the resources our team used through the course of the CLEF competition.

Language: Jupyter Notebook - Size: 13.6 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

nuhmanpk/Webtrench

A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code

Language: Python - Size: 51.8 KB - Last synced: 4 days ago - Pushed: 6 months ago - Stars: 20 - Forks: 5

sovaai/sova-dataset

Size: 43 KB - Last synced: 27 days ago - Pushed: over 1 year ago - Stars: 110 - Forks: 7

hugolpz/LanguagesGallery

[v.1.0] Lingualibre Languages Gallery in VueJS.

Language: CSS - Size: 176 KB - Last synced: 26 days ago - Pushed: 3 months ago - Stars: 3 - Forks: 3

nafiuny/voice_conversion_dataset

top dataset for voice conversion models

Size: 5.86 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

silenterus/deepspeech-cleaner

Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework

Language: Python - Size: 389 KB - Last synced: 7 months ago - Pushed: 12 months ago - Stars: 47 - Forks: 7

Metiu-Metiu/Neural-Texture-Sound-synthesis---data-sets

Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.

Size: 1.5 GB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 3 - Forks: 0

devinschumacher/audio-datasets

Audio Datasets

Size: 6.67 MB - Last synced: 18 days ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

Rumeysakeskin/Speech-Datasets-for-ASR

Download speech datasets (English and non-English) for Automatic Speech Recognition

Language: Jupyter Notebook - Size: 2.56 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 7 - Forks: 0

freds0/katube

KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a list of YouTube playlists or YouTube channels, KATube will generate dataset with audios and texts.

Language: Python - Size: 782 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 13 - Forks: 4

yodakohl/nyumaya_audio_testdata

Dataset for tesing nyumaya audio recognition

Size: 33.7 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

Related Keywords
audio-datasets 17 datasets 5 audio 4 dataset 4 voice-datasets 4 speech-recognition 3 voice-dataset 3 data 3 audio-dataset 3 corpus-tools 2 dataset-creation 2 speech 2 dataset-filtering 2 dataset-manager 2 open-source 2 audioset 2 noise 2 voice-conversion 2 speech-to-text 2 python 2 machine-learning 2 speech-dataset 2 russian-datasets 1 opensource 1 opendata 1 sova-dataset 1 voice-data 1 languages-spoken 1 mic-noise 1 open-data 1 english-datasets 1 corpus 1 chinese-dataset 1 audio-data 1 text-datasets 1 scarper 1 image-data-generator 1 deep-learning 1 dataset-generation 1 marvin 1 voxforge-dataset 1 speech-synthesis 1 speech-processing 1 common-voice-dataset 1 asr 1 open-source-datasets 1 synthetic-dataset-generation 1 synthetic-dataset 1 real-dataset 1 data-augmentation 1 audio-segmentation 1 audio-dataset-for-machine-learning 1 multilanguage 1 mozilla 1 deepspeech 1 vc-dataset 1 vc 1 tts-dataset 1 tts 1 text-to-speech 1 pyth 1 lingualibre 1 audioset-download 1 hacktoberfest22 1 hacktoberfest2022 1 hacktoberfest-22 1 hacktoberfest-2023 1 hacktoberfest-2022 1 hacktoberfest 1 codepeak2022 1 codepeak 1 speech-emotion-recognition 1 multimodal-emotion-recognition 1 emotions-recognition 1 emotions 1 voice-synthesis 1 voice-recognition 1 voice 1 voice-control 1 voice-computing 1 voice-commands 1 voice-chat 1 voice-assistant 1 voice-activity-detection 1 data-science 1 data-collection 1 random-forest-regressor 1 multi-output-regression 1 dense-neural-network 1 cnn-classification 1 clef-aware 1 bird-clef 1 music 1 data-loader 1 voxceleb1 1 urbansound8k 1 speech-commands 1 self-supervised-learning 1 pytorch-lightning 1 pytorch 1