GitHub topics: librosa
Super-Badmen-Viper/NineSong
NineSong aims to provide Cloud native and AI extended solutions for data sharing in various ToB and ToC businesses, used to manage various file metadata and metadata derived business attributes, and applied to various application scenarios, including but not limited to music, movies, notes, documents, photo albums, e-book readers, etc.
Language: Go - Size: 11.2 MB - Last synced at: about 17 hours ago - Pushed at: about 19 hours ago - Stars: 12 - Forks: 2

nhuyiuem/skill-sync
SkillSync is a platform designed for effective team collaboration and skill management. It enables organizations to track skills and tasks while ensuring secure access and real-time updates. 🛠️💻
Language: JavaScript - Size: 53.7 KB - Last synced at: about 22 hours ago - Pushed at: about 23 hours ago - Stars: 0 - Forks: 0

nishatPY/ADHD_recognition
ADHD_Recognition with personal voice
Language: Python - Size: 15.5 MB - Last synced at: about 23 hours ago - Pushed at: about 24 hours ago - Stars: 1 - Forks: 0

natgluons/ChronoSense
Personalized Sleep Optimizer App, a machine learning project that analyzes sleep audio using librosa, PyTorch, and scikit-learn to detect disturbances and optimize sleep quality through personalized recommendations.
Language: Python - Size: 5.86 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

librosa/librosa
Python library for audio and music analysis
Language: Python - Size: 33.5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 7,637 - Forks: 984

ankurbhatia24/MULTIMODAL-EMOTION-RECOGNITION
Human Emotion Understanding using multimodal dataset.
Language: Jupyter Notebook - Size: 5.72 MB - Last synced at: 1 day ago - Pushed at: almost 5 years ago - Stars: 98 - Forks: 24

jocoandonob/audio-processing
Audio Processing Script with AWS SageMarker.
Language: Python - Size: 1.15 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

Super-Badmen-Viper/NSMusicS
NSMusicS NineSong Cloud-Native Music Server/ Full platform Client,support navidrome, jellyfin, emby
Language: TypeScript - Size: 722 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,967 - Forks: 90

KAIST-MACLab/PyTSMod
An open-source Python library for audio time-scale modification.
Language: Python - Size: 255 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 210 - Forks: 27

benkhelifamohamedtaher/speech-emotion-recognition
Deep learning system for emotion recognition from speech, achieving 50.5% accuracy on 8-class classification using transformer architecture and real-time analysis
Language: Python - Size: 1.56 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

aliyzd95/Emotion-Recognition-In-Persian-Speech-Using-Deep-Neural-Networks
This project aims to perform Emotion Recognition in Speech using Deep Neural Networks (DNNs)
Language: Python - Size: 29.3 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

ranimeshehata/Speech-Emotion-Recognition Fork of habibatarek26/Speech-Emotion-Recognition
Implementing a Speech Emotion Recognition (SER) system using deep learning. It extracts audio features from the CREMA-D dataset and trains both 1D and 2D Convolutional Neural Networks (CNNs) to classify emotions from speech.
Language: Jupyter Notebook - Size: 122 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

x4nth055/emotion-recognition-using-speech
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
Language: Python - Size: 944 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 631 - Forks: 242

Kazuhito00/Audio-Processing-Node-Editor
処理の検証や比較検討での用途を想定したノードエディターベースのオーディオ処理アプリ(A node editor-based audioprocessing application intended for use in processing verification and comparison studies)
Language: Python - Size: 7.62 MB - Last synced at: 12 days ago - Pushed at: 21 days ago - Stars: 9 - Forks: 0

Demfier/multimodal-speech-emotion-recognition
Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)
Language: Jupyter Notebook - Size: 11.9 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 415 - Forks: 88

mokshhhhh/AudioCaptchaRecognizer
A conversational AI : Speech synthesis project where we develop and use a model to identify audio captcha often seen in websites' human verification.
Language: Python - Size: 13.6 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

paul92150/voice-emotion-recognition
Voice emotion recognition system using MFCC features and machine learning models.
Language: Python - Size: 20.5 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

sujalk777/Signal_systems_lab
This repository contains the assignments for the Signal Systems Laboratory course offered at IIT Jammu Autumn 24
Language: Python - Size: 1.12 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

DrStef/Deep-Learning-and-Digital-Signal-Processing-for-Environmental-Sound-Classification
Automatic environmental sound classification (ESC) based on ESC-50 dataset (and ESC-10 subset)
Language: Jupyter Notebook - Size: 18.7 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 9 - Forks: 0

Phoenix-95107/ADHD_recognition
ADHD_Recognition with personal voice
Language: Python - Size: 15.5 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 11 - Forks: 0

tiagoft/audio_to_midi
(monophonic) audio to midi converter using Python and librosa
Language: Python - Size: 65.4 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 106 - Forks: 13

fetzu/ImpressionMovieMaker
Better than Windows Movie Maker. Worse than an AAR.
Language: Python - Size: 271 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 3 - Forks: 1

sh4shv4t/SonicSig
SonicSig – A Flask web app that fuses classic audio fingerprinting with YAMNet embeddings for lightning-fast, high-accuracy song recognition. Modular code handles spectrogram peak hashing, deep-learning feature extraction, and secure file uploads, all wrapped in a clean UI and built for easy extension or cloud deployment.
Language: Python - Size: 13.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

yeyupiaoling/AudioClassification-PaddlePaddle
基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法
Language: Python - Size: 541 KB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 94 - Forks: 16

marcogdepinto/emotion-classification-from-audio-files
Understanding emotions from audio files using neural networks and multiple datasets.
Language: Python - Size: 646 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 417 - Forks: 137

melvinczyk/Personal-Website
This is my personal website, filled with many features that I think are cool.
Language: HTML - Size: 82.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ZionC27/Speech-Emotion-Recognition
Speech Emotion Recognition (SER) using Deep neural networks CNN and RNN
Language: Jupyter Notebook - Size: 31.3 KB - Last synced at: 22 days ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 1

gabiteodoru/liveaudio
real-time pitch tracking and audio processing for python
Language: Python - Size: 338 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

NeuroByte-Consulting/Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs
Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MFCCs)
Language: Jupyter Notebook - Size: 15.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0

musa11971/manhuw
Recognizing and identifying Quran reciters from audio recordings.
Language: Python - Size: 224 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

GianlucaPaolocci/Sound-classification-on-Raspberry-Pi-with-Tensorflow
In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone
Language: Python - Size: 385 KB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 98 - Forks: 30

nannib/audiodf
This program can detect if an audio message is a Deep Fake or it is genuine
Language: Python - Size: 52.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

xi/infinity-player
infinite jukebox clone using librosa
Language: Python - Size: 48.8 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 16 - Forks: 4

thekartikeyamishra/VoiceCloner
The Voice Cloner is a Python-based project that leverages Tacotron 2 and WaveGlow models for text-to-speech (TTS) synthesis and basic voice cloning. This project supports 22 official Indian languages, including Sanskrit, making it versatile for multilingual text input.
Language: Python - Size: 11.7 KB - Last synced at: about 23 hours ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

bits-bytes-nn/sound-anomaly-detection-with-autoencoders
MIMII Sound Anomaly Detection with AutoEncoders
Language: Jupyter Notebook - Size: 24.9 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 38 - Forks: 12

Sudemc/firstVoiceProject
🎵 Müzik Enstrüman Ayrıştırma ve Görselleştirme Projesi Bu proje, bir müzik parçasını Spleeter ve Librosa kullanarak enstrüman ve vokal bileşenlerine ayırır. Ayrıca, ses sinyallerinin spektral ve zamansal analizini görselleştirir.
Language: Python - Size: 2.13 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Apex077/Voice_Emotion_Recognition_App
A basic Python script that keeps user's voices and processes them using Librosa and recognizes emotions using TF-Keras
Language: Jupyter Notebook - Size: 2.98 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

sanatren/signal_processing_and_speech_recognition
all the practices related to speech recognition and pytorch for audios.
Language: Python - Size: 130 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

spotify/realbook
Easier audio-based machine learning with TensorFlow.
Language: Python - Size: 83 KB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 120 - Forks: 7

melvinczyk/Bird-classifier
A deep learning Alabama bird chirp classifier
Language: Python - Size: 39.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

YashviGarg/heart-rate-detection
The Heart Rate Monitoring System using Human Speech is a patent-pending deep learning project that analyzes speech signals to classify heart rates as normal or abnormal. This B.Tech final year project (Aug-Dec 2021) achieves 79% accuracy and 0.89 precision by leveraging advanced audio processing techniques.
Language: Python - Size: 38.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

librosa/data
Example (audio) data for use with librosa
Language: Python - Size: 14.4 MB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 9 - Forks: 1

Ztrimus/speech-emotion-recognition
Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.
Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 46 - Forks: 28

ryoha000/librosapp
A C++ implementation of stft, melspectrogram and mel_to_stft
Language: C++ - Size: 1.13 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 0

RijoSLal/Mickey
Mickey is a ML web app that captures emotions in music using LSTM and GRU-based neural networks built with TensorFlow. It features a FastAPI backend with Jinja templates for the frontend, and uses Librosa for audio processing. The system analyzes music to classify emotions, making it a powerful tool for mood-based music recommendations
Language: HTML - Size: 1.1 GB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

GeorgiosIoannouCoder/vera
Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. 🔊
Language: Jupyter Notebook - Size: 11.4 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

VasilijeJukic01/GTZAN-DeepAudio
Exploring audio features for classification and recommendation with the GTZAN using XGBoost, YAMNet, and KNN.
Language: Jupyter Notebook - Size: 7.15 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Franky1/speech-emotion-webapp Fork of CyberMaryVer/speech-emotion-webapp
Streamlit app forked for debugging purposes
Language: Python - Size: 118 MB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 2

nikiblauer/Music-Retrieval
This project is an implementation of the audio identification algorithm from the Original Shazam Paper (Wang, 2003). It uses audio fingerprinting and hash-based matching to enable efficient and accurate audio retrieval.
Language: Jupyter Notebook - Size: 12 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Edopramudya/Audio-Emotion-Analysis-using-STFT-Librosa.pyin
Proyek ini merupakan implementasi analisis audio menggunakan Short-Time Fourier Transform (STFT) dan Librosa.pyin untuk ekstraksi fitur suara, seperti frekuensi dasar (F0) dan intensitas suara. Model ini dapat digunakan dalam berbagai aplikasi, termasuk pengenalan suara, deteksi emosi, dan analisis musik.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

dannylee1020/music-genre-classification
music genre classification using 2D CNN, 1D CNN - LSTM and Librosa
Language: Python - Size: 33.7 MB - Last synced at: 7 days ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

oren-cohen/whatsmybitrate
Whatsmybitrate analyzes audio files for quality metrics such as bit rate, frequency, and codec type in bulk. It also generates spectrograms for visual representation of the audio spectrum. It supports a variety of audio formats, including MP3, FLAC, WAV, AAC, M4A, and more.
Language: Python - Size: 94.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 2

rohankrgupta/Orca-call-Classifier-Machine-learning
Advanced ML Project : An Orca Call classifier using mel-spectrograms as audio representations to detect Killer whales
Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 48 - Forks: 6

eslam69/Music-Recognizer
shazam-like app
Language: Python - Size: 167 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 1

scherroman/mugen
A command-line music video generator based on rhythm
Language: Python - Size: 23.3 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 229 - Forks: 41

AdityaDutt/Bird-Song-Classification
Classify bird species based on their songs using SIamese Networks and 1D dilated convolutions.
Language: Python - Size: 2.83 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 17 - Forks: 8

akash-rajak/Volume-Suggester
Python Script to suggest the volume at which the music audio file needs to be played for better experience and feeling.
Language: Python - Size: 169 MB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1

hallowshaw/Speech-Emotion-Recognition-with-MFCC
A project to classify emotions like happiness, sadness, and anger from speech using MFCCs, machine learning models, and visualizations for audio features and model performance.
Language: Jupyter Notebook - Size: 969 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Vasugi2003/Fusion-AI---MultiModal-Persuvasiveness-Prediction
Developed a system to predict persuasiveness using multi-modal data (text, images, audio). Utilized BERT for text embeddings, ResNet for image features, and Librosa for audio analysis. Fused data from all modalities for enhanced prediction accuracy.
Language: Jupyter Notebook - Size: 770 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

AAC-Open-Source-Pool/Audio-Tamper-Detection
Detecting audio tampering using MFCC features and deep learning with TensorFlow/Keras for classification of authentic vs tampered audio
Language: Jupyter Notebook - Size: 73.2 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

machinelearningzuu/Data-Engineering-Process-of-Audio-Data
This Repository Consists of the Feature Engineering Process of Audio Signals in both Time Domain & Frequency Domain. In more the repository contains Jupiter-notebook implementations which uses python & librosa
Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

alihassanml/Speech-Recognition-System
This project implements a speech recognition system using the LibriSpeech dataset and the `librosa` library for feature extraction, alongside a deep learning model built with TensorFlow/Keras.
Language: Jupyter Notebook - Size: 466 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

SwapnilKumbhar/DSP-Project
Digital Signal Processing mini project: Autotune
Language: Python - Size: 9.77 KB - Last synced at: 18 days ago - Pushed at: over 7 years ago - Stars: 19 - Forks: 8

leafdesk/meeting-coach-fastapi
⚡ Okestro Meeting Coach FastAPI Server (Hansung Univ. Pre-Capstone Design)
Language: Python - Size: 249 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

adzialocha/tomomibot
Artificial intelligence bot for live voice improvisation
Language: Python - Size: 452 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 31 - Forks: 4

SappirBo/Audio-DSP-Playground
Educational DSP Audio Processor
Language: Python - Size: 84.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

albincorreya/ChromaCoverId
Methods to compute various chroma audio features and audio similarity measures particularly for the task of cover song identification
Language: Jupyter Notebook - Size: 13.3 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 25 - Forks: 10

ShehbazAlam/Bird-Voice-Classifier
A Machine Learning Model integrated in a web app that classifies bird species based on it's sound
Language: JavaScript - Size: 44.7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

NiranjanChaudhari0929/Prediction-of-Insect-species-using-Acoustic-features
Prediction model built to predict the insect species using the acoustic data gathered.
Language: Python - Size: 6.84 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

sugarcane-mk/finetuning_wav2vec2
This repo provides step by step process from sctatch to fine tune facebook's wav2vec2-large model using transformers
Language: Jupyter Notebook - Size: 42 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

GeorgiosIoannouCoder/vera-deployed-v2
Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the 2nd deployed version of VERA. 🔊
Language: Jupyter Notebook - Size: 11 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Priyanshg0211/GARUD-ML
The Gunshot Detection and Localization System is designed to enhance the safety of military personnel by accurately detecting and localizing gunshots in real-time. This system utilizes a circular microphone array to capture audio, combined with advanced processing techniques for reliable detection and classification of gunshot sounds.
Language: Python - Size: 1.05 GB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

zashin-AI/project
Speech-Recognition STT Project
Language: Jupyter Notebook - Size: 26.8 MB - Last synced at: 15 days ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 0

NajdBinrabah/Deep-Learning-with-TensorFlow-and-Keras
This project explores emotion recognition in audio data, focusing on feature extraction techniques while also comparing the performance of LSTM and 1D CNN models.
Language: Jupyter Notebook - Size: 855 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

korseby/py3tag
Write tags to audio files (mp3, flac, and m4a are supported) based on their filenames
Language: Python - Size: 14.6 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

anuragmmer/bpm-analyser
Designed to offers an analysis of BPM (Beats Per Minute) by examining the segments of an audio file. The script provides detailed insights, including overall BPM, modal BPM, standard deviation, and much more.
Language: Python - Size: 66.4 KB - Last synced at: 7 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

danyalimran93/Music-Genre-Classification
Classifying English Music (.mp3) files using Music Information Retrieval (MIR), Digital/Audio Signal Processing (DIP) and Machine Learning (ML) Strategies
Language: HTML - Size: 5.46 MB - Last synced at: 5 months ago - Pushed at: about 8 years ago - Stars: 33 - Forks: 13

rupeshs/audio-regen
Language: Python - Size: 1.95 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

matlab-deep-learning/Use-a-Python-Speech-Command-Recognition-System-to-MATLAB
Use a Python speech command recognition system in MATLAB
Language: MATLAB - Size: 1.32 MB - Last synced at: 23 days ago - Pushed at: 8 months ago - Stars: 6 - Forks: 4

GeorgiosIoannouCoder/vera-deployed
Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the deployed version of Vera. 🔊
Language: Jupyter Notebook - Size: 8.96 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

SaadARazzaq/Speech-to-Text-Transformer
ASR with Facebook's Wav2Vec2 model for accurate 🎙️ to 📝 conversion.
Language: Jupyter Notebook - Size: 11.2 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

palak-463/TablaTaalRecognitionSystem
Software built using Python which makes use of CNN and FNN to detect the Taals of the Tabla, an Indian classical music instrument. 🎛️
Language: Python - Size: 3.6 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

MeidanGR/SpeechEmotionRecognition_Realtime
Speech Emotion Recognition (SER) in real-time, using Deep Neural Networks (DNN) of Long Short Memory Term (LSTM).
Language: Jupyter Notebook - Size: 16.4 MB - Last synced at: 9 months ago - Pushed at: about 3 years ago - Stars: 87 - Forks: 19

terranivium/speech-emotion-recognition
Speech emotion recognition with PyTorch
Language: Jupyter Notebook - Size: 29.6 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 6 - Forks: 2

seanghay/soundcheck
A multi-processing audio check
Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

brayvid/engine-detection
Flatiron School Data Science Bootcamp Phase 4 Project
Language: Jupyter Notebook - Size: 2.01 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

radadiavasu/PyAudioVisualizer
Whole Audio Visualization in Python with multiple diagrams in streamlit.
Language: Python - Size: 15.4 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

grvnsh/ai-ektara-isolation-model
This Python script implements a neural network model for detecting the presence of an ektara in audio recordings.
Language: Python - Size: 10.7 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

matlab-deep-learning/Convert-librosa-Audio-Feature-Extraction-To-MATLAB
Convert librosa Audio Feature Extraction To MATLAB
Language: MATLAB - Size: 1.78 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

Arfa-Ahsan/AI-Project
An advanced system that uses Computer Vision and Audio Processing to automatically track and study wildlife, aiding in research, conservation, and security.
Size: 3.91 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

choppystick/chord_detection
Python implementation of chord-detection algorithms
Language: Python - Size: 6.46 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ykgautam09/Emotion_Detection_audio
This project uses famous CREMA-D dataset to classify human emotion using deep learning.
Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

bhattsameer/Eyeshield
Data Transmission Between two devices using Sound
Language: Python - Size: 301 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 6 - Forks: 4

khushijtrivedi/speech
The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.
Language: CSS - Size: 1.07 GB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

Kavayk29/Audio-classification-using-Python-Library
This is a audio classification Project using python Libraries such as librosa to make the visual representation of the audio files, and using numpy to make array of data for manipulation and then extraction the features for classification to train and test of CNN model.
Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

kookmin-sw/capstone-2022-15
IN4U - 면접 연습 웹 서비스
Language: Python - Size: 155 MB - Last synced at: 25 days ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 3

dipch/Audio-Classification-using-ML-DNN-LSTM
Includes visualization and feature extraction using Librosa, dataset creation from raw audio files, feature selection, model training (using different ML and DL methods), and hyperparameter tuning.
Language: Jupyter Notebook - Size: 6.42 MB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

dipch/Audio-Feature-Extraction-Librosa
This notebook demonstrates visualization and analysis of music and audio files using the Librosa python library.
Language: Jupyter Notebook - Size: 4 MB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

SAGARTR/Deep-Audio-Classifier-using-Machine-Learning
Languages Used: Python Developed and implemented a deep audio classifier using CNNs and LSTMs to accurately categorize diverse audio signals, achieving high accuracy and robustness. Utilized Python and TensorFlow for model development and training, incorporating data augmentation techniques to enhance performance
Language: Jupyter Notebook - Size: 104 KB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mehdihosseinimoghadam/Signal-Processing
Signal Processing with Python and Librosa
Language: Jupyter Notebook - Size: 46.6 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2
