GitHub topics: audio-datasets
ynop/audiomate
Python library for handling audio datasets.
Language: Python - Size: 9.07 MB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 138 - Forks: 28

MorenoLaQuatra/audioset-download
This package aims at simplifying the download of the AudioSet dataset.
Language: Python - Size: 24.3 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 50 - Forks: 14

SuperKogito/SER-datasets
A collection of datasets for the purpose of emotion recognition/detection in speech.
Language: HTML - Size: 3.72 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 332 - Forks: 44

Audio-WestlakeU/RealMAN
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]
Language: Python - Size: 62.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 119 - Forks: 13

hugolpz/LanguagesGallery
[v.1.0] Lingualibre Languages Gallery in VueJS.
Language: CSS - Size: 375 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 6 - Forks: 2

jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Size: 136 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1,875 - Forks: 237

GLJS/audio-datasets
GitHub Repository for the Survey Paper on Audio-Language Datasets for Scenes and Events
Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 0

DagsHub/audio-datasets
open-source audio datasets
Size: 6.76 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 148 - Forks: 26

nuhmanpk/Webtrench
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code
Language: Python - Size: 51.8 KB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 25 - Forks: 6

sovaai/sova-dataset
Size: 43 KB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 116 - Forks: 7

Audio-WestlakeU/audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
Language: Python - Size: 13.1 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 86 - Forks: 9

nafiuny/voice_conversion_dataset
top dataset for voice conversion models
Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

silenterus/deepspeech-cleaner
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
Language: Python - Size: 389 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 47 - Forks: 7

Metiu-Metiu/Neural-Texture-Sound-synthesis---data-sets
Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.
Size: 1.5 GB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

devinschumacher/audio-datasets
Audio Datasets
Size: 6.67 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Rumeysakeskin/Speech-Datasets-for-ASR
Download speech datasets (English and non-English) for Automatic Speech Recognition
Language: Jupyter Notebook - Size: 2.56 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

freds0/katube
KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a list of YouTube playlists or YouTube channels, KATube will generate dataset with audios and texts.
Language: Python - Size: 782 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 13 - Forks: 4

AAnirudh07/CLEF-2022
This repository contains the resources our team used through the course of the CLEF competition.
Language: Jupyter Notebook - Size: 13.6 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

yodakohl/nyumaya_audio_testdata
Dataset for tesing nyumaya audio recognition
Size: 33.7 MB - Last synced at: 2 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
