An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: audio-datasets

ynop/audiomate

Python library for handling audio datasets.

Language: Python - Size: 9.07 MB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 138 - Forks: 28

MorenoLaQuatra/audioset-download

This package aims at simplifying the download of the AudioSet dataset.

Language: Python - Size: 24.3 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 50 - Forks: 14

SuperKogito/SER-datasets

A collection of datasets for the purpose of emotion recognition/detection in speech.

Language: HTML - Size: 3.72 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 332 - Forks: 44

Audio-WestlakeU/RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

Language: Python - Size: 62.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 119 - Forks: 13

hugolpz/LanguagesGallery

[v.1.0] Lingualibre Languages Gallery in VueJS.

Language: CSS - Size: 375 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 6 - Forks: 2

jim-schwoebel/voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Size: 136 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1,875 - Forks: 237

GLJS/audio-datasets

GitHub Repository for the Survey Paper on Audio-Language Datasets for Scenes and Events

Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 0

DagsHub/audio-datasets

open-source audio datasets

Size: 6.76 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 148 - Forks: 26

nuhmanpk/Webtrench

A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code

Language: Python - Size: 51.8 KB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 25 - Forks: 6

sovaai/sova-dataset

Size: 43 KB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 116 - Forks: 7

Audio-WestlakeU/audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Language: Python - Size: 13.1 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 86 - Forks: 9

nafiuny/voice_conversion_dataset

top dataset for voice conversion models

Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

silenterus/deepspeech-cleaner

Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework

Language: Python - Size: 389 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 47 - Forks: 7

Metiu-Metiu/Neural-Texture-Sound-synthesis---data-sets

Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.

Size: 1.5 GB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

devinschumacher/audio-datasets

Audio Datasets

Size: 6.67 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Rumeysakeskin/Speech-Datasets-for-ASR

Download speech datasets (English and non-English) for Automatic Speech Recognition

Language: Jupyter Notebook - Size: 2.56 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

freds0/katube

KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a list of YouTube playlists or YouTube channels, KATube will generate dataset with audios and texts.

Language: Python - Size: 782 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 13 - Forks: 4

AAnirudh07/CLEF-2022

This repository contains the resources our team used through the course of the CLEF competition.

Language: Jupyter Notebook - Size: 13.6 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

yodakohl/nyumaya_audio_testdata

Dataset for tesing nyumaya audio recognition

Size: 33.7 MB - Last synced at: 2 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Related Keywords
audio-datasets 19 datasets 5 dataset 4 voice-datasets 4 audio 4 audio-dataset 3 data 3 speech-recognition 3 voice-dataset 3 open-source 2 machine-learning 2 python 2 speech-dataset 2 speech-to-text 2 corpus-tools 2 voice-conversion 2 audioset 2 speech 2 noise 2 dataset-manager 2 dataset-filtering 2 dataset-creation 2 tts 1 text-to-speech 1 pyth 1 voxceleb1 1 urbansound8k 1 tts-dataset 1 speech-commands 1 self-supervised-learning 1 pytorch-lightning 1 pytorch 1 nsynth 1 audio-self-supervised-learning 1 audio-representation 1 audio-pretraining 1 audio-classification 1 voice-data 1 sova-dataset 1 russian-datasets 1 opensource 1 opendata 1 mic-noise 1 marvin 1 random-forest-regressor 1 multi-output-regression 1 dense-neural-network 1 cnn-classification 1 clef-aware 1 bird-clef 1 voxforge-dataset 1 speech-synthesis 1 speech-processing 1 common-voice-dataset 1 asr 1 open-source-datasets 1 synthetic-dataset-generation 1 synthetic-dataset 1 real-dataset 1 data-augmentation 1 audio-segmentation 1 audio-dataset-for-machine-learning 1 multilanguage 1 mozilla 1 deepspeech 1 vc-dataset 1 vc 1 voice-computing 1 voice-commands 1 voice-chat 1 voice-assistant 1 voice-activity-detection 1 voice 1 lingualibre 1 languages-spoken 1 speech-enhancement 1 sound-source-localization 1 real-world-datasets 1 multi-channel 1 microphone-audio-capture 1 microphone-array-processing 1 doa-estimation 1 speech-emotion-recognition 1 multimodal-emotion-recognition 1 emotions-recognition 1 emotions 1 downloader 1 audioset-download 1 music 1 data-loader 1 open-data 1 english-datasets 1 corpus 1 chinese-dataset 1 audio-data 1 text-datasets 1 scarper 1 image-data-generator 1 deep-learning 1 dataset-generation 1