GitHub topics: audio-retrieval

Repositories

soham97/awesome-sound_event_detection

Reading list for research topics in Sound AI

Size: 145 KB - Last synced at: 22 days ago - Pushed at: 10 months ago - Stars: 180 - Forks: 8

Jonathan-Greif/QBV

This repository provides the code for "Improving Query-by-Vocal Imitation with Contrastive Learning and Audio Pretraining", presented at DCASE 2024. The paper addresses the challenge of audio retrieval using vocal imitations as queries, proposing a dual encoder architecture that leverages pretrained CNNs and an adapted NT-Xent loss for fine-tuning.

Language: Python - Size: 1.61 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

oncescuandreea/audio_egovlp

This is the official codebase used for obtaining the results in the ICASSP 2024 paper: A SOUND APPROACH: Using Large Language Models to generate audio descriptions for egocentric text-audio retrieval

Language: Python - Size: 13.7 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

microsoft/WavText5K

Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"

Language: Python - Size: 419 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 49 - Forks: 0

soham97/sound_ai_progress

Tracking states of the arts and recent results (bibliography) on sound tasks.

Size: 51.8 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 28 - Forks: 1

akoepke/audio-retrieval-benchmark

Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".

Language: Python - Size: 3.07 MB - Last synced at: 10 months ago - Pushed at: almost 3 years ago - Stars: 42 - Forks: 2

oncescuandreea/audio-retrieval

Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch

Language: Python - Size: 533 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 23 - Forks: 4

dvd125/Classification-of-musical-genres-and-music-retrival

Language: Jupyter Notebook - Size: 8.55 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

dariodellamura/Classification-of-musical-genres-and-music-retrieval

During the project for the DIGITAL SIGNAL IMAGE MANAGEMENT course I learned how to manage and process audio and image files. The aim of the project was the classification, through machine learning and deep learning models, of musical genres by extracting specific audio features from the "gtzan dataset" dataset files with which to train the models (SVM, Linear Regression, Decision tree , Random Forest, Neural Network). Mel spectograms were also extracted to train convolutional neural network models. In addition, the extracted audio features have been used to develop a model of music retrieval which given an audio track in input produces as output 5 audio tracks recommended meiante the use of cousine similarity.

Language: Jupyter Notebook - Size: 8.55 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Related Keywords

audio-retrieval 9 audio-processing 3 python 2 music-recognition 2 librosa 2 audio-classification 2 deep-learning 2 sound-event-detection 2 acoustic-scene-classification 2 audio-captioning 2 audio-generation 2 tensroflow 1 scikitlearn-machine-learning 1 neural-network 1 music 1 melspectrogram-classifications 1 melspectrogram 1 machine-learning 1 keras-tensorflow 1 classification 1 mel-spectrogram 1 cnn 1 icassp 1 text-based-retrieval 1 pytorch 1 interspeech 1 music-classification 1 text-audio 1 large-language-models 1 vocal-imitation 1 qbv 1 dual-encoder 1 contrastive-learning 1 zero-shot-learning 1 representation-learning 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos