GitHub topics: audio-classification
VATHSAN08/Mental-Health-Sentiment-Analysis-using-Deep-Learning
# Mental Health Sentiment Analysis using Deep LearningThis project leverages deep learning to classify mental health-related sentiments from text into seven categories: Anxiety, Bipolar, Depression, Normal, Personality Disorder, Stress, and Suicidal. By utilizing advanced NLP techniques, we aim to enhance understanding and support for mental well
Language: Jupyter Notebook - Size: 4.12 MB - Last synced at: about 1 hour ago - Pushed at: about 2 hours ago - Stars: 1 - Forks: 0

UDA-IIT-Mandi/Unsupervised-Domain-Adaptation-Learning
This repository contains implementations of unsupervised domain adaptation techniques using Gradient Reversal Layer (GRL) and PaSST feature extractors across various datasets. The code was collected and modified from various GitHub sources as a learning exercise and precursor to our main research project.
Language: Jupyter Notebook - Size: 8.19 MB - Last synced at: about 21 hours ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

microsoft/Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
Language: Python - Size: 1.65 MB - Last synced at: about 9 hours ago - Pushed at: 1 day ago - Stars: 1,482 - Forks: 197

LENGKH/Voice_Classifier
A Python app for classifying voice recordings using KNN and SVM models. Includes a graphical interface for training, evaluating, and classifying audio data with acoustic descriptors. Designed for audio analysis and machine learning experimentation.
Size: 1000 Bytes - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 4 - Forks: 0

RBGTOP/Music-Genre-Recognition
Music genre classification using deep learning
Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 10 - Forks: 0

sergio-sanz-rodriguez/torchsuite
A Comprehensive Pytorch library for Deep Learning Modeling
Language: Python - Size: 102 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

rreezN/AudioBots
Audio Explorers Electrical Challenge 1, "Sound scene classifier for hearing aids" created by Team AudioBots.
Language: Python - Size: 112 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

RetroCirce/HTS-Audio-Transformer
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
Language: Python - Size: 896 KB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 413 - Forks: 68

peterprospl12/breathing-classification-v2
This repository focuses on the classification of breathing sounds using machine learning techniques. It includes training, validation, and test data for developing and evaluating models.
Language: Python - Size: 856 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 1

Ladbaby/InsRec
🎹 A Musical Instrument Recognition App Using Neural Networks.
Language: Python - Size: 302 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

IBM/MAX-Audio-Classifier
Identify sounds in short audio clips
Language: Python - Size: 38.2 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 155 - Forks: 53

VGD3626/English_Accent_Detection Fork of Divyang029/English_Accent_Detection
Audio classification using transfer learning-based approach
Language: Jupyter Notebook - Size: 6.52 MB - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

HumanSignal/label-studio-frontend 📦
Data labeling react app that is backend agnostic and can be embedded into your applications — distributed as an NPM package
Language: JavaScript - Size: 102 MB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 433 - Forks: 321

phurwicz/hover
:speedboat: Label data at scale. Fun and precision included.
Language: Python - Size: 294 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 327 - Forks: 19

ashishpatel26/Best-Audio-Classification-Resources-with-Deep-learning
List of articles related to deep learning applied to music
Language: TeX - Size: 5.2 MB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 94 - Forks: 11

alessiopittiglio/mm-argfallacy
Language: Python - Size: 26.4 KB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

MohammedAly22/Accent-Detective
Accent Detective is a Streamlit-based application that detects if spoken language is English and then classifies the speaker's English accent from audio or video files. It uses OpenAI's Whisper for transcription and a Hugging Face model for accent classification.
Language: Python - Size: 479 KB - Last synced at: 11 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

MTG/DCASE-models
Python library for rapid prototyping of environmental sound analysis systems
Language: Jupyter Notebook - Size: 133 MB - Last synced at: 2 days ago - Pushed at: about 3 years ago - Stars: 43 - Forks: 5

Omar10lfc/Audio-Classification-Using-CNN
This Project Shows a comprehensive pipeline for urban sound classification using CNN. It covers all stages from data exploration to Model Inferance, providing visualizations, and explanations. The approach leverages Mel spectrograms and a robust CNN architecture, achieving strong performance on the UrbanSound8K dataset.
Language: Jupyter Notebook - Size: 6.86 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

natgluons/ChronoSense
Personalized Sleep Optimizer App, a machine learning project that analyzes sleep audio using librosa, PyTorch, and scikit-learn to detect disturbances and optimize sleep quality through personalized recommendations.
Language: Python - Size: 5.86 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 1 - Forks: 0

ArmDeveloperEcosystem/ml-audio-classifier-example-for-pico
ML Audio Classifier Example for Pico 🔊🔥🔔
Language: Jupyter Notebook - Size: 42 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 67 - Forks: 24

Pooh555/AI_vs_human_generated_content_models
Infomatrix 2025
Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

towhee-io/examples
Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.
Language: Jupyter Notebook - Size: 289 MB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 491 - Forks: 118

awsaf49/sonics
[ICLR 2025] SONICS: Synthetic Or Not - Identifying Counterfeit Songs
Language: Python - Size: 1.75 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 15 - Forks: 3

faizaliyaqat/Speech-emotion-recognition
Speech Emotion Recognition using Wav2Vec 2.0 + Random Forest Real-time emotion detection system built with Streamlit, trained on RAVDESS and SAVEE datasets using Wav2Vec 2.0 features and a Random Forest classifier. Includes SHAP explainability and audio waveform visualization.
Language: Python - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

cwx-worst-one/EAT
[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
Language: Python - Size: 6.51 MB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 153 - Forks: 8

kmohammedsu/bird_sound_neural_network
Bird species classification from audio using deep learning and spectrogram analysis
Language: Jupyter Notebook - Size: 9.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

paul92150/voice-emotion-recognition
Voice emotion recognition system using MFCC features and machine learning models.
Language: Python - Size: 20.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Westlake-AI/SemiReward
[ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning
Language: Python - Size: 1.13 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 66 - Forks: 2

ksanjeevan/crnn-audio-classification
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
Language: Python - Size: 3.48 MB - Last synced at: 29 days ago - Pushed at: about 4 years ago - Stars: 390 - Forks: 80

rfcx/tfk-audio
Tools for TensorFlow/Keras audio recognition workflows
Language: Python - Size: 13.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 1

MaxiDonkey/DelphiHuggingFace
The Hugging Face API wrapper for Delphi leverages cutting-edge models to deliver powerful features, including object detection, music generation, text classification, sentiment analysis, image segmentation, speech-to-text transcription, and text generation.
Language: Pascal - Size: 666 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 4

aqibsaeed/Urban-Sound-Classification
Urban sound classification using Deep Learning
Language: Jupyter Notebook - Size: 9.97 MB - Last synced at: 27 days ago - Pushed at: almost 3 years ago - Stars: 517 - Forks: 244

ashleysally00/soundguard-genai-agent
SoundGuard is a GenAI agent that detects emergency sounds, explains what it hears, and responds like a smart assistant — built with YAMNet, Gradio, Google Cloud, and deployed on Hugging Face Spaces.
Language: Jupyter Notebook - Size: 2.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

yeyupiaoling/AudioClassification-PaddlePaddle
基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法
Language: Python - Size: 541 KB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 94 - Forks: 16

YuanGongND/whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
Language: Python - Size: 20.5 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 376 - Forks: 30

AimiliosKourpas/sound-signal-processing
A Python-based system for automatic word segmentation in speech using ML models like SVM, MLP, and RNN.
Language: Python - Size: 3.34 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ahmed222220/Music-Genre-Recognition
Music-genre-classification-using-deep-learning
Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ParitoshParmar/Piano-Skills-Assessment
Piano Skills Assessment [IEEE MMSP 2021]
Language: Python - Size: 854 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 17 - Forks: 2

andremsouza/vision-aed-swine-barn-weak-labels
Code for the paper "Deep Learning Solutions for Audio Event Detection in a Swine Barn Using Environmental Audio and Weak Labels".
Language: Python - Size: 1.36 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

vatsalmehta2001/speech-emotion-recognition
Deep learning system for emotion recognition from speech, achieving 50.5% accuracy on 8-class classification using transformer architecture and real-time analysis
Language: Python - Size: 1.53 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

Nickaine1/Music-Genre-Recognition
Music-genre-classification-using-deep-learning
Size: 3.91 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

koudounasalkis/voc2vec
This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.
Language: Python - Size: 19.5 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 16 - Forks: 0

mgoltzsche/essentia-container
Docker container to retrieve musical information from audio data using Essentia extractors
Language: Dockerfile - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1

idaishe/Music-Genre-Recognition
Music-genre-classification-using-deep-learning
Size: 2.93 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

SiavashShams/ssamba
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
Language: Python - Size: 1.88 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 118 - Forks: 9

CybLX/CNN_UrbanSound8K
Full pipeline for urban sound classification using PyTorch and the UrbanSound8K dataset. Converts audio into MEL spectrograms, applies data augmentation, and trains a CNN to recognize sounds like horns, barks, and sirens.
Language: Python - Size: 354 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

drscotthawley/panotti
A multi-channel neural network audio classifier using Keras
Language: Python - Size: 1.39 MB - Last synced at: 30 days ago - Pushed at: almost 4 years ago - Stars: 269 - Forks: 69

gibbona1/neal
NEAL (Nature+Energy Audio Labeller) is an open-source interactive audio data annotation tool.
Language: R - Size: 502 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 16 - Forks: 1

Sreyan88/Synthio
Code for ICLR 2025 Paper: Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Language: Python - Size: 2.29 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

JohannesBuchner/spoken-command-recognition
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
Language: Python - Size: 63.5 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 69 - Forks: 31

pooya-mohammadi/audio-classification-pytorch
In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.
Language: Jupyter Notebook - Size: 871 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 41 - Forks: 4

LHPT2009/Music-Genre-Recognition
Music genre classification using deep learning
Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

CouncilDataProject/speakerbox
Speakerbox: Fine-tune Audio Transformers for speaker identification.
Language: Python - Size: 17.7 MB - Last synced at: 4 days ago - Pushed at: 7 months ago - Stars: 56 - Forks: 6

dxspeeder/Music-Genre-Recognition
Music genre classification using deep learning
Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

nikitakunz/Voice-based-gender-classification
Audio classification (моё решение задачи с контеста)
Language: Jupyter Notebook - Size: 412 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Tirovo/EmotionAI-voice
An AI-powered application for detecting human emotions
Language: Python - Size: 9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

DevinWSoTuff/Music-Genre-Recognition
Music genre classification using deep learning
Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

jonnor/ESC-CNN-microcontroller
Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks
Language: Jupyter Notebook - Size: 32.5 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 102 - Forks: 20

sainathadapa/kaggle-freesound-audio-tagging 📦
8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)
Language: Python - Size: 31.3 KB - Last synced at: about 10 hours ago - Pushed at: over 4 years ago - Stars: 114 - Forks: 25

lvntky/audion
Offline Audio Fingerprinting & Recognition
Language: Java - Size: 59.6 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

FilipTirnanic96/mfcc_extraction
Implementation of Mel-Frequency Cepstral Coefficients (MFCC) extraction
Language: Python - Size: 46.7 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 2

NonBreathableAir/audiblez
Generate audiobooks from e-books
Size: 1000 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

mgoltzsche/beets-container
An opinionated, containerized beets distribution
Language: Makefile - Size: 162 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 4 - Forks: 0

JDSherbert/Audio-File-Guide
Simple guide to audio files and what you should use where.
Size: 412 KB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

emuell/AFEC
Cross platform audio feature extraction and sound classification tool
Language: C++ - Size: 128 MB - Last synced at: about 23 hours ago - Pushed at: 12 months ago - Stars: 22 - Forks: 4

herbitovich/genre-classification
Trivial music genre classification
Language: Jupyter Notebook - Size: 323 KB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

wyy511511/Chinese-Phonetic-Dictionary-Dataset
Chinese Phonetic Dataset with Homophone Clustering
Language: HTML - Size: 22 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

GeorgiosIoannouCoder/vera
Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. 🔊
Language: Jupyter Notebook - Size: 11.4 MB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 5 - Forks: 0

Kardbord/hfapigo
Unofficial (Golang) Go bindings for the Hugging Face Inference API
Language: Go - Size: 3.35 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 62 - Forks: 5

JonaKoenemann/DSKIM_audio_atmosphere_classification
Classification of the atmosphere in a stadium based on audio files.
Language: Jupyter Notebook - Size: 3.89 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

Hridxyz/Music-Genre-Classification
A deep learning model to classify music audio into 10 genres using Convolutional Neural Networks (CNNs). Achieved over 97% training accuracy and 90% validation accuracy.
Language: Jupyter Notebook - Size: 20 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

ashaydave/Audio-Classifier-For-Unity
Neural network based audio event tagging using YAMnet to organize audio assets in Unity.
Language: Python - Size: 58.6 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Alexyskoutnev/accent-detector
Speech Accent Detector
Language: Jupyter Notebook - Size: 8.13 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

otonomee/streamstem
Implements ML audio separation algorithm on audio from YouTube or Spotify resulting in "stems" for download (e.g. vocals, drums, bass) in MP3, WAV or FLAC.
Language: Python - Size: 186 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 27 - Forks: 3

Ruben165/speech-audio-processing-project
Speech-Audio Processing Projects
Language: Jupyter Notebook - Size: 9.02 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

shivendrra/ava
building AVA from ex-machina; a lightweight multi-modal system from scratch, just for learning & experimentation
Language: Python - Size: 12 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

sweat0198/audio_classification_CNN_ESC-50
Audio classification model which uses CNN to train ESC-50 dataset.
Language: Jupyter Notebook - Size: 37 MB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 2

AlvaroVasquezAI/Voice_Classifier
A Python app for classifying voice recordings using KNN and SVM models. Includes a graphical interface for training, evaluating, and classifying audio data with acoustic descriptors. Designed for audio analysis and machine learning experimentation.
Language: Python - Size: 62.9 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

pavlosdais/Music-Genre-Recognition
Music genre classification using deep learning
Language: Jupyter Notebook - Size: 1.98 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

zh320/audio-classification-pytorch
Simplified PyTorch implementation of audio classification, support multi-gpu training and validating, automatic mixed precision training, knowledge distillation etc.
Language: Python - Size: 24.4 KB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

hibatillah/deep-learning
Text Sentiment Analysis and Audio Classification
Language: TypeScript - Size: 164 MB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

aliceheiman/watkins-marine-sound-model
Analyze and categorize marine mammal audio recordings using deep learning.
Language: Jupyter Notebook - Size: 81.8 MB - Last synced at: 2 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

asif-hanif/palm
[EMNLP 2024] Official code repository of paper titled "PALM: Few-Shot Prompt Learning for Audio Language Models" accepted in EMNLP 2024 conference.
Language: Python - Size: 17.8 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 21 - Forks: 0

johnmartinsson/differentiable-mel-spectrogram
The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer in neural networks".
Language: Python - Size: 2.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 18 - Forks: 0

UmarIgan/Machine-Learning
A set of jupyter notebooks
Language: Jupyter Notebook - Size: 16.7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 23 - Forks: 8

SameerKamani/Emotion-detection-from-audio-in-urdu
This project detects emotions from Urdu speech using deep learning. It focuses on classifying emotions like anger, happiness, sadness, and neutrality, using models like Wav2Vec2.0. The aim is to advance sentiment analysis for underrepresented languages like Urdu, with applications in mental health, customer service, and user experience.
Language: Jupyter Notebook - Size: 5.51 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Labbeti/SSLH
Deep Semi-Supervised Learning with Holistic methods for audio classification.
Language: Jupyter Notebook - Size: 3.02 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 10 - Forks: 1

braydenoneal/neural-audio-classification
Audio classification using a neural network
Language: Python - Size: 1.52 MB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

WAT-ai/buble-ATS
Audio Temporal Segmentation & Sentiment Analysis
Language: Python - Size: 20.1 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

kaistmm/Audio-Mamba-AuM
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
Language: Python - Size: 10.7 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 108 - Forks: 13

nbrochec/realtimeIPTrecognition
Real-Time Instrumental Playing Techniques Recognition
Language: Python - Size: 637 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 4 - Forks: 0

farthornas/labear
Real-time sound monitoring using audio classification with deep learning
Language: Jupyter Notebook - Size: 44.7 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

gitmehrdad/FACE
Urban Sound Annotation and Classification
Language: Jupyter Notebook - Size: 55.7 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 9 - Forks: 4

stephanielees/BirdSoundClassification
Sound classification for classifying five birds
Language: Jupyter Notebook - Size: 983 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

chen0040/mxnet-audio
Implementation of music genre classification, audio-to-vec, song recommender, and music search in mxnet
Language: Python - Size: 15.7 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 55 - Forks: 15

mdfirman/CityNet
A neural network classifier for urban soundscapes
Language: Jupyter Notebook - Size: 54.2 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 5

ml13571/audio-classifier
Classification model to detect water, alarm and other sounds, including training, inference and dataset
Language: Python - Size: 6.31 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

GeorgiosIoannouCoder/vera-deployed-v2
Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the 2nd deployed version of VERA. 🔊
Language: Jupyter Notebook - Size: 11 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

anyantudre/Audio-Transformers-Hugging-Face
Explore the application of transformers to audio data in this course. Learn to tackle tasks like speech recognition, audio classification, and text-to-speech generation using cutting-edge transformer models.
Language: Jupyter Notebook - Size: 7.12 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0
