GitHub topics: audioset

Repositories

curlsloth/audioset-strong-download Fork of MorenoLaQuatra/audioset-download

This package aims at simplifying the download of the strong version of AudioSet dataset.

Language: Python - Size: 24.4 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 6 - Forks: 0

MorenoLaQuatra/audioset-download

This package aims at simplifying the download of the AudioSet dataset.

Language: Python - Size: 24.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 54 - Forks: 14

harritaylor/torchvggish 📦

Pytorch port of Google Research's VGGish model used for extracting audio features.

Language: Python - Size: 316 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 393 - Forks: 71

IvanBirkmaier/Audioset

This repository is built with a focus on practical ways to obtain and work with the audio data of audioset. You can use this repository to download and precprocess audioset wav files for running the recipies of Audio Spectogram Transformer (AST) and Masked Autoencoder that listen (Audio - MAE).

Language: Jupyter Notebook - Size: 1.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

luuil/Tensorflow-Audio-Classification

Audio classification with VGGish as feature extractor in TensorFlow

Language: Python - Size: 7.46 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 130 - Forks: 30

gojibjib/jibjib-model 📦

Machine learning model for bird songs recognition

Language: Python - Size: 6 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 43 - Forks: 7

jim-schwoebel/sound_event_detection

🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.

Language: Python - Size: 24 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 42 - Forks: 3

tky823/hyperaudioset

Hyperbolic embedding using AudioSet ontology.

Language: Python - Size: 7.29 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

gojibjib/jibjib-query 📦

Query service to serve the JibJib TensorFlow model

Language: Python - Size: 40 KB - Last synced at: about 2 months ago - Pushed at: almost 7 years ago - Stars: 4 - Forks: 1

jim-schwoebel/download_audioset

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Language: Python - Size: 154 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 101 - Forks: 22

AndreaCossu/ContinualLearning-SequentialProcessing

Continual Learning with Gated Incremental Memories for Sequential Data Processing. IJCNN 2020. Continual Learning with Recurrent Neural Networks (RNNs) inspired by Progressive network architecture.

Language: Python - Size: 193 KB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 15 - Forks: 4

rohitrango/objects-that-sound

Unofficial Implementation of Google Deepmind's paper `Objects that Sound`

Language: Python - Size: 57 MB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 83 - Forks: 16

Audio-WestlakeU/audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Language: Python - Size: 13.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 86 - Forks: 9

gcolussi11/Biochallenge

Desenvolvimento e treino de modelos CNN e Random Forest para classificação de áudios ambientes

Language: Jupyter Notebook - Size: 15.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dlrudco/Fast-Audioset-Download

Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing

Language: Python - Size: 754 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 25 - Forks: 1

mx-mark/SPMNet

Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)

Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

paloukari/OrcaDetector

A VGGish-based DNN trained on the Watkins Marine Mammal Sound Database, with transfer learning from Audioset, to detect multiple marine mammal species.

Language: Jupyter Notebook - Size: 74.3 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 19 - Forks: 10

kyuyeonpooh/objects-that-sound

The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.

Language: Python - Size: 163 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 32 - Forks: 4

jim-schwoebel/audioset_models

📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).

Language: Python - Size: 13.9 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 29 - Forks: 11

nikola-j/audio_tag

Automatic wav tagging

Language: Python - Size: 10.5 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

ktonal/audioset-downloader

cli to download examples of a specific class from google's AudioSet

Language: Python - Size: 68.5 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

sjappig/mldn-capstone

AudioSet classification using RNN

Language: Jupyter Notebook - Size: 2.41 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 8 - Forks: 2

pritamqu/CrissCross

Official project page of CrissCross - AAAI 2023 (Oral)

Language: Python - Size: 19 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 1

zkmkarlsruhe/language-identification

Spoken Language Identification on Common Voice and AudioSet using Deep Learning

Language: Python - Size: 5.44 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 30 - Forks: 6

jonahanton/SSL_audio

Codebase for Imperial MSc AI Individual Project - Self-Supervised Learning for Audio Inference

Language: Python - Size: 59.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

bakhtos/GoogleAudioSetReformatted

Google's AudioSet consistently reformatted

Language: Python - Size: 78.8 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

bakhtos/GoogleAudioSetScripts

Scripts to process Google's Audioset

Language: Python - Size: 12.2 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

ml-illustrated/Pytorch-CoreML-Spectrogram

Repo accompanying the blog post "How to Deploy PyTorch Models with Core ML Conversion Issues"

Language: Swift - Size: 3.02 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 0

AppleHolic/audioset_augmentor

Sound augmentation using Large-scale audio dataset (Audioset)

Language: Python - Size: 24.7 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 42 - Forks: 8

ml-illustrated/Pytorch-CoreML-Sound-Classification

Repo accompanying the blog post "How to Deploy A State-of-the-art PyTorch Model to iOS via Core ML (Part 3)".

Language: Swift - Size: 19.7 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 1

black-mold/audioset_downloader

AudioSet downloader

Language: Python - Size: 2.17 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

nhattruongpham/torchvggish-gpu Fork of harritaylor/torchvggish

Re-Implementation of Google Research's VGGish model used for extracting audio features using Pytorch with GPU support.

Language: Python - Size: 2.99 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

usc-sail/mica-gender-from-audio

Gender prediction in movie audio

Language: TypeScript - Size: 11.5 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 3

Related Keywords

audioset 33 machine-learning 7 audio 6 vggish 6 tensorflow 5 deep-learning 5 audio-processing 4 sound-event-detection 4 pytorch 4 dataset 4 audioset-download 4 voice-computing 3 self-supervised-learning 3 python 2 coremltools 2 coreml 2 google-audioset 2 common-voice 2 google 2 audio-tagging 2 pafy 2 voice 2 youtube 2 sound 2 youtube-dl 2 audiosetdownload 2 downloader 2 audio-datasets 2 audio-embedding 2 audio-classification 2 sound-classification 2 pytorch-coreml 2 docker 2 cli 1 vgg 1 tx2 1 gpu 1 rest-api 1 keras 1 voice-recognition 1 voice-ml 1 synchronization 1 machinelearning-python 1 machine-learning-models 1 machine-learning-algorithms 1 download-file 1 download 1 vas 1 sound-localization 1 video-understanding 1 eccv2018 1 cross-modal-retrieval 1 audio-visual-learning 1 whoi 1 visual-audio 1 docker-container 1 visual-to-sound 1 dnn 1 voice-activity-detection 1 movie-data 1 gender-recognition-by-voice 1 female-speaking-time 1 gpu-support 1 pytorch-tutorial 1 speech 1 source-separation 1 augmentation 1 onnx-coreml 1 masked-autoencoder 1 byol 1 barlow-twins 1 zkm 1 spoken-language-identification 1 lid 1 language-identification 1 intelligent-museum 1 ucf101 1 representation-learning 1 kinetics400 1 kinetics-datasets 1 hmdb51 1 esc50 1 dcase 1 action-recognition 1 udacity-nanodegree 1 sklearn 1 capstone-project 1 python3 1 protobuf 1 machine-learning-api 1 grpc-python 1 grpc 1 flask 1 poincare-embeddings 1 hyperbolic-embeddings 1 voicebook 1 surveylex 1 object-detection-pipelines 1 object-detection-label 1 object-detection-accuracy 1