GitHub topics: audio-classification

Repositories

VATHSAN08/Mental-Health-Sentiment-Analysis-using-Deep-Learning

# Mental Health Sentiment Analysis using Deep LearningThis project leverages deep learning to classify mental health-related sentiments from text into seven categories: Anxiety, Bipolar, Depression, Normal, Personality Disorder, Stress, and Suicidal. By utilizing advanced NLP techniques, we aim to enhance understanding and support for mental well

Language: Jupyter Notebook - Size: 4.12 MB - Last synced at: about 1 hour ago - Pushed at: about 2 hours ago - Stars: 1 - Forks: 0

UDA-IIT-Mandi/Unsupervised-Domain-Adaptation-Learning

This repository contains implementations of unsupervised domain adaptation techniques using Gradient Reversal Layer (GRL) and PaSST feature extractors across various datasets. The code was collected and modified from various GitHub sources as a learning exercise and precursor to our main research project.

Language: Jupyter Notebook - Size: 8.19 MB - Last synced at: about 21 hours ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

microsoft/Semi-supervised-learning

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

Language: Python - Size: 1.65 MB - Last synced at: about 9 hours ago - Pushed at: 1 day ago - Stars: 1,482 - Forks: 197

LENGKH/Voice_Classifier

A Python app for classifying voice recordings using KNN and SVM models. Includes a graphical interface for training, evaluating, and classifying audio data with acoustic descriptors. Designed for audio analysis and machine learning experimentation.

Size: 1000 Bytes - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 4 - Forks: 0

RBGTOP/Music-Genre-Recognition

Music genre classification using deep learning

Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 10 - Forks: 0

sergio-sanz-rodriguez/torchsuite

A Comprehensive Pytorch library for Deep Learning Modeling

Language: Python - Size: 102 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

rreezN/AudioBots

Audio Explorers Electrical Challenge 1, "Sound scene classifier for hearing aids" created by Team AudioBots.

Language: Python - Size: 112 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

RetroCirce/HTS-Audio-Transformer

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Language: Python - Size: 896 KB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 413 - Forks: 68

peterprospl12/breathing-classification-v2

This repository focuses on the classification of breathing sounds using machine learning techniques. It includes training, validation, and test data for developing and evaluating models.

Language: Python - Size: 856 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 1

Ladbaby/InsRec

🎹 A Musical Instrument Recognition App Using Neural Networks.

Language: Python - Size: 302 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

IBM/MAX-Audio-Classifier

Identify sounds in short audio clips

Language: Python - Size: 38.2 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 155 - Forks: 53

VGD3626/English_Accent_Detection Fork of Divyang029/English_Accent_Detection

Audio classification using transfer learning-based approach

Language: Jupyter Notebook - Size: 6.52 MB - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

HumanSignal/label-studio-frontend 📦

Data labeling react app that is backend agnostic and can be embedded into your applications — distributed as an NPM package

Language: JavaScript - Size: 102 MB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 433 - Forks: 321

phurwicz/hover

:speedboat: Label data at scale. Fun and precision included.

Language: Python - Size: 294 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 327 - Forks: 19

ashishpatel26/Best-Audio-Classification-Resources-with-Deep-learning

List of articles related to deep learning applied to music

Language: TeX - Size: 5.2 MB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 94 - Forks: 11

alessiopittiglio/mm-argfallacy

Language: Python - Size: 26.4 KB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

MohammedAly22/Accent-Detective

Accent Detective is a Streamlit-based application that detects if spoken language is English and then classifies the speaker's English accent from audio or video files. It uses OpenAI's Whisper for transcription and a Hugging Face model for accent classification.

Language: Python - Size: 479 KB - Last synced at: 11 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

MTG/DCASE-models

Python library for rapid prototyping of environmental sound analysis systems

Language: Jupyter Notebook - Size: 133 MB - Last synced at: 2 days ago - Pushed at: about 3 years ago - Stars: 43 - Forks: 5

Omar10lfc/Audio-Classification-Using-CNN

This Project Shows a comprehensive pipeline for urban sound classification using CNN. It covers all stages from data exploration to Model Inferance, providing visualizations, and explanations. The approach leverages Mel spectrograms and a robust CNN architecture, achieving strong performance on the UrbanSound8K dataset.

Language: Jupyter Notebook - Size: 6.86 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

natgluons/ChronoSense

Personalized Sleep Optimizer App, a machine learning project that analyzes sleep audio using librosa, PyTorch, and scikit-learn to detect disturbances and optimize sleep quality through personalized recommendations.

Language: Python - Size: 5.86 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 1 - Forks: 0

ArmDeveloperEcosystem/ml-audio-classifier-example-for-pico

ML Audio Classifier Example for Pico 🔊🔥🔔

Language: Jupyter Notebook - Size: 42 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 67 - Forks: 24

Pooh555/AI_vs_human_generated_content_models

Infomatrix 2025

Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

towhee-io/examples

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

Language: Jupyter Notebook - Size: 289 MB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 491 - Forks: 118

awsaf49/sonics

[ICLR 2025] SONICS: Synthetic Or Not - Identifying Counterfeit Songs

Language: Python - Size: 1.75 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 15 - Forks: 3

faizaliyaqat/Speech-emotion-recognition

Speech Emotion Recognition using Wav2Vec 2.0 + Random Forest Real-time emotion detection system built with Streamlit, trained on RAVDESS and SAVEE datasets using Wav2Vec 2.0 features and a Random Forest classifier. Includes SHAP explainability and audio waveform visualization.

Language: Python - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

cwx-worst-one/EAT

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Language: Python - Size: 6.51 MB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 153 - Forks: 8

kmohammedsu/bird_sound_neural_network

Bird species classification from audio using deep learning and spectrogram analysis

Language: Jupyter Notebook - Size: 9.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

paul92150/voice-emotion-recognition

Voice emotion recognition system using MFCC features and machine learning models.

Language: Python - Size: 20.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Westlake-AI/SemiReward

[ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning

Language: Python - Size: 1.13 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 66 - Forks: 2

ksanjeevan/crnn-audio-classification

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

Language: Python - Size: 3.48 MB - Last synced at: 29 days ago - Pushed at: about 4 years ago - Stars: 390 - Forks: 80

rfcx/tfk-audio

Tools for TensorFlow/Keras audio recognition workflows

Language: Python - Size: 13.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 1

MaxiDonkey/DelphiHuggingFace

The Hugging Face API wrapper for Delphi leverages cutting-edge models to deliver powerful features, including object detection, music generation, text classification, sentiment analysis, image segmentation, speech-to-text transcription, and text generation.

Language: Pascal - Size: 666 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 4

aqibsaeed/Urban-Sound-Classification

Urban sound classification using Deep Learning

Language: Jupyter Notebook - Size: 9.97 MB - Last synced at: 27 days ago - Pushed at: almost 3 years ago - Stars: 517 - Forks: 244

ashleysally00/soundguard-genai-agent

SoundGuard is a GenAI agent that detects emergency sounds, explains what it hears, and responds like a smart assistant — built with YAMNet, Gradio, Google Cloud, and deployed on Hugging Face Spaces.

Language: Jupyter Notebook - Size: 2.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

yeyupiaoling/AudioClassification-PaddlePaddle

基于PaddlePaddle实现的音频分类，支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型，还有多种预处理方法

Language: Python - Size: 541 KB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 94 - Forks: 16

YuanGongND/whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Language: Python - Size: 20.5 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 376 - Forks: 30

AimiliosKourpas/sound-signal-processing

A Python-based system for automatic word segmentation in speech using ML models like SVM, MLP, and RNN.

Language: Python - Size: 3.34 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ahmed222220/Music-Genre-Recognition

Music-genre-classification-using-deep-learning

Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ParitoshParmar/Piano-Skills-Assessment

Piano Skills Assessment [IEEE MMSP 2021]

Language: Python - Size: 854 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 17 - Forks: 2

andremsouza/vision-aed-swine-barn-weak-labels

Code for the paper "Deep Learning Solutions for Audio Event Detection in a Swine Barn Using Environmental Audio and Weak Labels".

Language: Python - Size: 1.36 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

vatsalmehta2001/speech-emotion-recognition

Deep learning system for emotion recognition from speech, achieving 50.5% accuracy on 8-class classification using transformer architecture and real-time analysis

Language: Python - Size: 1.53 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

Nickaine1/Music-Genre-Recognition

Music-genre-classification-using-deep-learning

Size: 3.91 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

koudounasalkis/voc2vec

This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.

Language: Python - Size: 19.5 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 16 - Forks: 0

mgoltzsche/essentia-container

Docker container to retrieve musical information from audio data using Essentia extractors

Language: Dockerfile - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1

idaishe/Music-Genre-Recognition

Music-genre-classification-using-deep-learning

Size: 2.93 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

SiavashShams/ssamba

[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

Language: Python - Size: 1.88 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 118 - Forks: 9

CybLX/CNN_UrbanSound8K

Full pipeline for urban sound classification using PyTorch and the UrbanSound8K dataset. Converts audio into MEL spectrograms, applies data augmentation, and trains a CNN to recognize sounds like horns, barks, and sirens.

Language: Python - Size: 354 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

drscotthawley/panotti

A multi-channel neural network audio classifier using Keras

Language: Python - Size: 1.39 MB - Last synced at: 30 days ago - Pushed at: almost 4 years ago - Stars: 269 - Forks: 69

gibbona1/neal

NEAL (Nature+Energy Audio Labeller) is an open-source interactive audio data annotation tool.

Language: R - Size: 502 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 16 - Forks: 1

Sreyan88/Synthio

Code for ICLR 2025 Paper: Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Language: Python - Size: 2.29 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

JohannesBuchner/spoken-command-recognition

A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition

Language: Python - Size: 63.5 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 69 - Forks: 31

pooya-mohammadi/audio-classification-pytorch

In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.

Language: Jupyter Notebook - Size: 871 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 41 - Forks: 4

LHPT2009/Music-Genre-Recognition

Music genre classification using deep learning

Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

CouncilDataProject/speakerbox

Speakerbox: Fine-tune Audio Transformers for speaker identification.

Language: Python - Size: 17.7 MB - Last synced at: 4 days ago - Pushed at: 7 months ago - Stars: 56 - Forks: 6

dxspeeder/Music-Genre-Recognition

Music genre classification using deep learning

Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

nikitakunz/Voice-based-gender-classification

Audio classification (моё решение задачи с контеста)

Language: Jupyter Notebook - Size: 412 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Tirovo/EmotionAI-voice

An AI-powered application for detecting human emotions

Language: Python - Size: 9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

DevinWSoTuff/Music-Genre-Recognition

Music genre classification using deep learning

Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

jonnor/ESC-CNN-microcontroller

Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks

Language: Jupyter Notebook - Size: 32.5 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 102 - Forks: 20

sainathadapa/kaggle-freesound-audio-tagging 📦

8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)

Language: Python - Size: 31.3 KB - Last synced at: about 10 hours ago - Pushed at: over 4 years ago - Stars: 114 - Forks: 25

lvntky/audion

Offline Audio Fingerprinting & Recognition

Language: Java - Size: 59.6 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

FilipTirnanic96/mfcc_extraction

Implementation of Mel-Frequency Cepstral Coefficients (MFCC) extraction

Language: Python - Size: 46.7 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 2

NonBreathableAir/audiblez

Generate audiobooks from e-books

Size: 1000 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

mgoltzsche/beets-container

An opinionated, containerized beets distribution

Language: Makefile - Size: 162 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 4 - Forks: 0

JDSherbert/Audio-File-Guide

Simple guide to audio files and what you should use where.

Size: 412 KB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

emuell/AFEC

Cross platform audio feature extraction and sound classification tool

Language: C++ - Size: 128 MB - Last synced at: about 23 hours ago - Pushed at: 12 months ago - Stars: 22 - Forks: 4

herbitovich/genre-classification

Trivial music genre classification

Language: Jupyter Notebook - Size: 323 KB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

wyy511511/Chinese-Phonetic-Dictionary-Dataset

Chinese Phonetic Dataset with Homophone Clustering

Language: HTML - Size: 22 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

GeorgiosIoannouCoder/vera

Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. 🔊

Language: Jupyter Notebook - Size: 11.4 MB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 5 - Forks: 0

Kardbord/hfapigo

Unofficial (Golang) Go bindings for the Hugging Face Inference API

Language: Go - Size: 3.35 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 62 - Forks: 5

JonaKoenemann/DSKIM_audio_atmosphere_classification

Classification of the atmosphere in a stadium based on audio files.

Language: Jupyter Notebook - Size: 3.89 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

Hridxyz/Music-Genre-Classification

A deep learning model to classify music audio into 10 genres using Convolutional Neural Networks (CNNs). Achieved over 97% training accuracy and 90% validation accuracy.

Language: Jupyter Notebook - Size: 20 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

ashaydave/Audio-Classifier-For-Unity

Neural network based audio event tagging using YAMnet to organize audio assets in Unity.

Language: Python - Size: 58.6 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Alexyskoutnev/accent-detector

Speech Accent Detector

Language: Jupyter Notebook - Size: 8.13 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

otonomee/streamstem

Implements ML audio separation algorithm on audio from YouTube or Spotify resulting in "stems" for download (e.g. vocals, drums, bass) in MP3, WAV or FLAC.

Language: Python - Size: 186 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 27 - Forks: 3

Ruben165/speech-audio-processing-project

Speech-Audio Processing Projects

Language: Jupyter Notebook - Size: 9.02 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

shivendrra/ava

building AVA from ex-machina; a lightweight multi-modal system from scratch, just for learning & experimentation

Language: Python - Size: 12 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

sweat0198/audio_classification_CNN_ESC-50

Audio classification model which uses CNN to train ESC-50 dataset.

Language: Jupyter Notebook - Size: 37 MB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 2

AlvaroVasquezAI/Voice_Classifier

Language: Python - Size: 62.9 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

pavlosdais/Music-Genre-Recognition

Music genre classification using deep learning

Language: Jupyter Notebook - Size: 1.98 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

zh320/audio-classification-pytorch

Simplified PyTorch implementation of audio classification, support multi-gpu training and validating, automatic mixed precision training, knowledge distillation etc.

Language: Python - Size: 24.4 KB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

hibatillah/deep-learning

Text Sentiment Analysis and Audio Classification

Language: TypeScript - Size: 164 MB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

aliceheiman/watkins-marine-sound-model

Analyze and categorize marine mammal audio recordings using deep learning.

Language: Jupyter Notebook - Size: 81.8 MB - Last synced at: 2 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

asif-hanif/palm

[EMNLP 2024] Official code repository of paper titled "PALM: Few-Shot Prompt Learning for Audio Language Models" accepted in EMNLP 2024 conference.

Language: Python - Size: 17.8 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 21 - Forks: 0

johnmartinsson/differentiable-mel-spectrogram

The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer in neural networks".

Language: Python - Size: 2.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 18 - Forks: 0

UmarIgan/Machine-Learning

A set of jupyter notebooks

Language: Jupyter Notebook - Size: 16.7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 23 - Forks: 8

SameerKamani/Emotion-detection-from-audio-in-urdu

This project detects emotions from Urdu speech using deep learning. It focuses on classifying emotions like anger, happiness, sadness, and neutrality, using models like Wav2Vec2.0. The aim is to advance sentiment analysis for underrepresented languages like Urdu, with applications in mental health, customer service, and user experience.

Language: Jupyter Notebook - Size: 5.51 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0