Topic: "audio-datasets"
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Size: 136 KB - Last synced at: 30 days ago - Pushed at: 11 months ago - Stars: 1,875 - Forks: 237

SuperKogito/SER-datasets
A collection of datasets for the purpose of emotion recognition/detection in speech.
Language: HTML - Size: 3.72 MB - Last synced at: 21 days ago - Pushed at: 7 months ago - Stars: 321 - Forks: 44

DagsHub/audio-datasets
open-source audio datasets
Size: 6.76 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 148 - Forks: 26

ynop/audiomate
Python library for handling audio datasets.
Language: Python - Size: 9.07 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 137 - Forks: 27

sovaai/sova-dataset
Size: 43 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 116 - Forks: 7

Audio-WestlakeU/RealMAN
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
Language: Python - Size: 62.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 98 - Forks: 11

Audio-WestlakeU/audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
Language: Python - Size: 13.1 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 86 - Forks: 9

MorenoLaQuatra/audioset-download
This package aims at simplifying the download of the AudioSet dataset.
Language: Python - Size: 24.3 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 48 - Forks: 14

silenterus/deepspeech-cleaner
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
Language: Python - Size: 389 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 47 - Forks: 7

nuhmanpk/Webtrench
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code
Language: Python - Size: 51.8 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 25 - Forks: 6

freds0/katube
KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a list of YouTube playlists or YouTube channels, KATube will generate dataset with audios and texts.
Language: Python - Size: 782 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 13 - Forks: 4

Rumeysakeskin/Speech-Datasets-for-ASR
Download speech datasets (English and non-English) for Automatic Speech Recognition
Language: Jupyter Notebook - Size: 2.56 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

GLJS/audio-datasets
GitHub Repository for the Survey Paper on Audio-Language Datasets for Scenes and Events
Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 0

hugolpz/LanguagesGallery
[v.1.0] Lingualibre Languages Gallery in VueJS.
Language: CSS - Size: 375 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 6 - Forks: 2

Metiu-Metiu/Neural-Texture-Sound-synthesis---data-sets
Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.
Size: 1.5 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

nafiuny/voice_conversion_dataset
top dataset for voice conversion models
Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

devinschumacher/audio-datasets
Audio Datasets
Size: 6.67 MB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

AAnirudh07/CLEF-2022
This repository contains the resources our team used through the course of the CLEF competition.
Language: Jupyter Notebook - Size: 13.6 MB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

yodakohl/nyumaya_audio_testdata
Dataset for tesing nyumaya audio recognition
Size: 33.7 MB - Last synced at: 12 days ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
