An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: librosa

Super-Badmen-Viper/NineSong

NineSong aims to provide Cloud native and AI extended solutions for data sharing in various ToB and ToC businesses, used to manage various file metadata and metadata derived business attributes, and applied to various application scenarios, including but not limited to music, movies, notes, documents, photo albums, e-book readers, etc.

Language: Go - Size: 11.2 MB - Last synced at: about 17 hours ago - Pushed at: about 19 hours ago - Stars: 12 - Forks: 2

nhuyiuem/skill-sync

SkillSync is a platform designed for effective team collaboration and skill management. It enables organizations to track skills and tasks while ensuring secure access and real-time updates. 🛠️💻

Language: JavaScript - Size: 53.7 KB - Last synced at: about 22 hours ago - Pushed at: about 23 hours ago - Stars: 0 - Forks: 0

nishatPY/ADHD_recognition

ADHD_Recognition with personal voice

Language: Python - Size: 15.5 MB - Last synced at: about 23 hours ago - Pushed at: about 24 hours ago - Stars: 1 - Forks: 0

natgluons/ChronoSense

Personalized Sleep Optimizer App, a machine learning project that analyzes sleep audio using librosa, PyTorch, and scikit-learn to detect disturbances and optimize sleep quality through personalized recommendations.

Language: Python - Size: 5.86 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

librosa/librosa

Python library for audio and music analysis

Language: Python - Size: 33.5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 7,637 - Forks: 984

ankurbhatia24/MULTIMODAL-EMOTION-RECOGNITION

Human Emotion Understanding using multimodal dataset.

Language: Jupyter Notebook - Size: 5.72 MB - Last synced at: 1 day ago - Pushed at: almost 5 years ago - Stars: 98 - Forks: 24

jocoandonob/audio-processing

Audio Processing Script with AWS SageMarker.

Language: Python - Size: 1.15 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

Super-Badmen-Viper/NSMusicS

NSMusicS NineSong Cloud-Native Music Server/ Full platform Client,support navidrome, jellyfin, emby

Language: TypeScript - Size: 722 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,967 - Forks: 90

KAIST-MACLab/PyTSMod

An open-source Python library for audio time-scale modification.

Language: Python - Size: 255 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 210 - Forks: 27

benkhelifamohamedtaher/speech-emotion-recognition

Deep learning system for emotion recognition from speech, achieving 50.5% accuracy on 8-class classification using transformer architecture and real-time analysis

Language: Python - Size: 1.56 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

aliyzd95/Emotion-Recognition-In-Persian-Speech-Using-Deep-Neural-Networks

This project aims to perform Emotion Recognition in Speech using Deep Neural Networks (DNNs)

Language: Python - Size: 29.3 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

ranimeshehata/Speech-Emotion-Recognition Fork of habibatarek26/Speech-Emotion-Recognition

Implementing a Speech Emotion Recognition (SER) system using deep learning. It extracts audio features from the CREMA-D dataset and trains both 1D and 2D Convolutional Neural Networks (CNNs) to classify emotions from speech.

Language: Jupyter Notebook - Size: 122 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

x4nth055/emotion-recognition-using-speech

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

Language: Python - Size: 944 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 631 - Forks: 242

Kazuhito00/Audio-Processing-Node-Editor

処理の検証や比較検討での用途を想定したノードエディターベースのオーディオ処理アプリ(A node editor-based audioprocessing application intended for use in processing verification and comparison studies)

Language: Python - Size: 7.62 MB - Last synced at: 12 days ago - Pushed at: 21 days ago - Stars: 9 - Forks: 0

Demfier/multimodal-speech-emotion-recognition

Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)

Language: Jupyter Notebook - Size: 11.9 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 415 - Forks: 88

mokshhhhh/AudioCaptchaRecognizer

A conversational AI : Speech synthesis project where we develop and use a model to identify audio captcha often seen in websites' human verification.

Language: Python - Size: 13.6 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

paul92150/voice-emotion-recognition

Voice emotion recognition system using MFCC features and machine learning models.

Language: Python - Size: 20.5 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

sujalk777/Signal_systems_lab

This repository contains the assignments for the Signal Systems Laboratory course offered at IIT Jammu Autumn 24

Language: Python - Size: 1.12 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

DrStef/Deep-Learning-and-Digital-Signal-Processing-for-Environmental-Sound-Classification

Automatic environmental sound classification (ESC) based on ESC-50 dataset (and ESC-10 subset)

Language: Jupyter Notebook - Size: 18.7 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 9 - Forks: 0

Phoenix-95107/ADHD_recognition

ADHD_Recognition with personal voice

Language: Python - Size: 15.5 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 11 - Forks: 0

tiagoft/audio_to_midi

(monophonic) audio to midi converter using Python and librosa

Language: Python - Size: 65.4 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 106 - Forks: 13

fetzu/ImpressionMovieMaker

Better than Windows Movie Maker. Worse than an AAR.

Language: Python - Size: 271 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 3 - Forks: 1

sh4shv4t/SonicSig

SonicSig – A Flask web app that fuses classic audio fingerprinting with YAMNet embeddings for lightning-fast, high-accuracy song recognition. Modular code handles spectrogram peak hashing, deep-learning feature extraction, and secure file uploads, all wrapped in a clean UI and built for easy extension or cloud deployment.

Language: Python - Size: 13.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

yeyupiaoling/AudioClassification-PaddlePaddle

基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法

Language: Python - Size: 541 KB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 94 - Forks: 16

marcogdepinto/emotion-classification-from-audio-files

Understanding emotions from audio files using neural networks and multiple datasets.

Language: Python - Size: 646 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 417 - Forks: 137

melvinczyk/Personal-Website

This is my personal website, filled with many features that I think are cool.

Language: HTML - Size: 82.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ZionC27/Speech-Emotion-Recognition

Speech Emotion Recognition (SER) using Deep neural networks CNN and RNN

Language: Jupyter Notebook - Size: 31.3 KB - Last synced at: 22 days ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 1

gabiteodoru/liveaudio

real-time pitch tracking and audio processing for python

Language: Python - Size: 338 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

NeuroByte-Consulting/Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs

Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MFCCs)

Language: Jupyter Notebook - Size: 15.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0

musa11971/manhuw

Recognizing and identifying Quran reciters from audio recordings.

Language: Python - Size: 224 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

GianlucaPaolocci/Sound-classification-on-Raspberry-Pi-with-Tensorflow

In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone

Language: Python - Size: 385 KB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 98 - Forks: 30

nannib/audiodf

This program can detect if an audio message is a Deep Fake or it is genuine

Language: Python - Size: 52.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

xi/infinity-player

infinite jukebox clone using librosa

Language: Python - Size: 48.8 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 16 - Forks: 4

thekartikeyamishra/VoiceCloner

The Voice Cloner is a Python-based project that leverages Tacotron 2 and WaveGlow models for text-to-speech (TTS) synthesis and basic voice cloning. This project supports 22 official Indian languages, including Sanskrit, making it versatile for multilingual text input.

Language: Python - Size: 11.7 KB - Last synced at: about 23 hours ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

bits-bytes-nn/sound-anomaly-detection-with-autoencoders

MIMII Sound Anomaly Detection with AutoEncoders

Language: Jupyter Notebook - Size: 24.9 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 38 - Forks: 12

Sudemc/firstVoiceProject

🎵 Müzik Enstrüman Ayrıştırma ve Görselleştirme Projesi Bu proje, bir müzik parçasını Spleeter ve Librosa kullanarak enstrüman ve vokal bileşenlerine ayırır. Ayrıca, ses sinyallerinin spektral ve zamansal analizini görselleştirir.

Language: Python - Size: 2.13 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Apex077/Voice_Emotion_Recognition_App

A basic Python script that keeps user's voices and processes them using Librosa and recognizes emotions using TF-Keras

Language: Jupyter Notebook - Size: 2.98 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

sanatren/signal_processing_and_speech_recognition

all the practices related to speech recognition and pytorch for audios.

Language: Python - Size: 130 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

spotify/realbook

Easier audio-based machine learning with TensorFlow.

Language: Python - Size: 83 KB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 120 - Forks: 7

melvinczyk/Bird-classifier

A deep learning Alabama bird chirp classifier

Language: Python - Size: 39.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

YashviGarg/heart-rate-detection

The Heart Rate Monitoring System using Human Speech is a patent-pending deep learning project that analyzes speech signals to classify heart rates as normal or abnormal. This B.Tech final year project (Aug-Dec 2021) achieves 79% accuracy and 0.89 precision by leveraging advanced audio processing techniques.

Language: Python - Size: 38.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

librosa/data

Example (audio) data for use with librosa

Language: Python - Size: 14.4 MB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 9 - Forks: 1

Ztrimus/speech-emotion-recognition

Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.

Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 46 - Forks: 28

ryoha000/librosapp

A C++ implementation of stft, melspectrogram and mel_to_stft

Language: C++ - Size: 1.13 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 0

RijoSLal/Mickey

Mickey is a ML web app that captures emotions in music using LSTM and GRU-based neural networks built with TensorFlow. It features a FastAPI backend with Jinja templates for the frontend, and uses Librosa for audio processing. The system analyzes music to classify emotions, making it a powerful tool for mood-based music recommendations

Language: HTML - Size: 1.1 GB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

GeorgiosIoannouCoder/vera

Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. 🔊

Language: Jupyter Notebook - Size: 11.4 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

VasilijeJukic01/GTZAN-DeepAudio

Exploring audio features for classification and recommendation with the GTZAN using XGBoost, YAMNet, and KNN.

Language: Jupyter Notebook - Size: 7.15 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Franky1/speech-emotion-webapp Fork of CyberMaryVer/speech-emotion-webapp

Streamlit app forked for debugging purposes

Language: Python - Size: 118 MB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 2

nikiblauer/Music-Retrieval

This project is an implementation of the audio identification algorithm from the Original Shazam Paper (Wang, 2003). It uses audio fingerprinting and hash-based matching to enable efficient and accurate audio retrieval.

Language: Jupyter Notebook - Size: 12 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Edopramudya/Audio-Emotion-Analysis-using-STFT-Librosa.pyin

Proyek ini merupakan implementasi analisis audio menggunakan Short-Time Fourier Transform (STFT) dan Librosa.pyin untuk ekstraksi fitur suara, seperti frekuensi dasar (F0) dan intensitas suara. Model ini dapat digunakan dalam berbagai aplikasi, termasuk pengenalan suara, deteksi emosi, dan analisis musik.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

dannylee1020/music-genre-classification

music genre classification using 2D CNN, 1D CNN - LSTM and Librosa

Language: Python - Size: 33.7 MB - Last synced at: 7 days ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

oren-cohen/whatsmybitrate

Whatsmybitrate analyzes audio files for quality metrics such as bit rate, frequency, and codec type in bulk. It also generates spectrograms for visual representation of the audio spectrum. It supports a variety of audio formats, including MP3, FLAC, WAV, AAC, M4A, and more.

Language: Python - Size: 94.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 2

rohankrgupta/Orca-call-Classifier-Machine-learning

Advanced ML Project : An Orca Call classifier using mel-spectrograms as audio representations to detect Killer whales

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 48 - Forks: 6

eslam69/Music-Recognizer

shazam-like app

Language: Python - Size: 167 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 1

scherroman/mugen

A command-line music video generator based on rhythm

Language: Python - Size: 23.3 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 229 - Forks: 41

AdityaDutt/Bird-Song-Classification

Classify bird species based on their songs using SIamese Networks and 1D dilated convolutions.

Language: Python - Size: 2.83 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 17 - Forks: 8

akash-rajak/Volume-Suggester

Python Script to suggest the volume at which the music audio file needs to be played for better experience and feeling.

Language: Python - Size: 169 MB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1

hallowshaw/Speech-Emotion-Recognition-with-MFCC

A project to classify emotions like happiness, sadness, and anger from speech using MFCCs, machine learning models, and visualizations for audio features and model performance.

Language: Jupyter Notebook - Size: 969 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Vasugi2003/Fusion-AI---MultiModal-Persuvasiveness-Prediction

Developed a system to predict persuasiveness using multi-modal data (text, images, audio). Utilized BERT for text embeddings, ResNet for image features, and Librosa for audio analysis. Fused data from all modalities for enhanced prediction accuracy.

Language: Jupyter Notebook - Size: 770 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

AAC-Open-Source-Pool/Audio-Tamper-Detection

Detecting audio tampering using MFCC features and deep learning with TensorFlow/Keras for classification of authentic vs tampered audio

Language: Jupyter Notebook - Size: 73.2 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

machinelearningzuu/Data-Engineering-Process-of-Audio-Data

This Repository Consists of the Feature Engineering Process of Audio Signals in both Time Domain & Frequency Domain. In more the repository contains Jupiter-notebook implementations which uses python & librosa

Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

alihassanml/Speech-Recognition-System

This project implements a speech recognition system using the LibriSpeech dataset and the `librosa` library for feature extraction, alongside a deep learning model built with TensorFlow/Keras.

Language: Jupyter Notebook - Size: 466 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

SwapnilKumbhar/DSP-Project

Digital Signal Processing mini project: Autotune

Language: Python - Size: 9.77 KB - Last synced at: 18 days ago - Pushed at: over 7 years ago - Stars: 19 - Forks: 8

leafdesk/meeting-coach-fastapi

⚡ Okestro Meeting Coach FastAPI Server (Hansung Univ. Pre-Capstone Design)

Language: Python - Size: 249 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

adzialocha/tomomibot

Artificial intelligence bot for live voice improvisation

Language: Python - Size: 452 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 31 - Forks: 4

SappirBo/Audio-DSP-Playground

Educational DSP Audio Processor

Language: Python - Size: 84.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

albincorreya/ChromaCoverId

Methods to compute various chroma audio features and audio similarity measures particularly for the task of cover song identification

Language: Jupyter Notebook - Size: 13.3 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 25 - Forks: 10

ShehbazAlam/Bird-Voice-Classifier

A Machine Learning Model integrated in a web app that classifies bird species based on it's sound

Language: JavaScript - Size: 44.7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

NiranjanChaudhari0929/Prediction-of-Insect-species-using-Acoustic-features

Prediction model built to predict the insect species using the acoustic data gathered.

Language: Python - Size: 6.84 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

sugarcane-mk/finetuning_wav2vec2

This repo provides step by step process from sctatch to fine tune facebook's wav2vec2-large model using transformers

Language: Jupyter Notebook - Size: 42 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

GeorgiosIoannouCoder/vera-deployed-v2

Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the 2nd deployed version of VERA. 🔊

Language: Jupyter Notebook - Size: 11 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Priyanshg0211/GARUD-ML

The Gunshot Detection and Localization System is designed to enhance the safety of military personnel by accurately detecting and localizing gunshots in real-time. This system utilizes a circular microphone array to capture audio, combined with advanced processing techniques for reliable detection and classification of gunshot sounds.

Language: Python - Size: 1.05 GB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

zashin-AI/project

Speech-Recognition STT Project

Language: Jupyter Notebook - Size: 26.8 MB - Last synced at: 15 days ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 0

NajdBinrabah/Deep-Learning-with-TensorFlow-and-Keras

This project explores emotion recognition in audio data, focusing on feature extraction techniques while also comparing the performance of LSTM and 1D CNN models.

Language: Jupyter Notebook - Size: 855 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

korseby/py3tag

Write tags to audio files (mp3, flac, and m4a are supported) based on their filenames

Language: Python - Size: 14.6 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

anuragmmer/bpm-analyser

Designed to offers an analysis of BPM (Beats Per Minute) by examining the segments of an audio file. The script provides detailed insights, including overall BPM, modal BPM, standard deviation, and much more.

Language: Python - Size: 66.4 KB - Last synced at: 7 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

danyalimran93/Music-Genre-Classification

Classifying English Music (.mp3) files using Music Information Retrieval (MIR), Digital/Audio Signal Processing (DIP) and Machine Learning (ML) Strategies

Language: HTML - Size: 5.46 MB - Last synced at: 5 months ago - Pushed at: about 8 years ago - Stars: 33 - Forks: 13

rupeshs/audio-regen

Language: Python - Size: 1.95 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

matlab-deep-learning/Use-a-Python-Speech-Command-Recognition-System-to-MATLAB

Use a Python speech command recognition system in MATLAB

Language: MATLAB - Size: 1.32 MB - Last synced at: 23 days ago - Pushed at: 8 months ago - Stars: 6 - Forks: 4

GeorgiosIoannouCoder/vera-deployed

Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the deployed version of Vera. 🔊

Language: Jupyter Notebook - Size: 8.96 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

SaadARazzaq/Speech-to-Text-Transformer

ASR with Facebook's Wav2Vec2 model for accurate 🎙️ to 📝 conversion.

Language: Jupyter Notebook - Size: 11.2 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

palak-463/TablaTaalRecognitionSystem

Software built using Python which makes use of CNN and FNN to detect the Taals of the Tabla, an Indian classical music instrument. 🎛️

Language: Python - Size: 3.6 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

MeidanGR/SpeechEmotionRecognition_Realtime

Speech Emotion Recognition (SER) in real-time, using Deep Neural Networks (DNN) of Long Short Memory Term (LSTM).

Language: Jupyter Notebook - Size: 16.4 MB - Last synced at: 9 months ago - Pushed at: about 3 years ago - Stars: 87 - Forks: 19

terranivium/speech-emotion-recognition

Speech emotion recognition with PyTorch

Language: Jupyter Notebook - Size: 29.6 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 6 - Forks: 2

seanghay/soundcheck

A multi-processing audio check

Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

brayvid/engine-detection

Flatiron School Data Science Bootcamp Phase 4 Project

Language: Jupyter Notebook - Size: 2.01 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

radadiavasu/PyAudioVisualizer

Whole Audio Visualization in Python with multiple diagrams in streamlit.

Language: Python - Size: 15.4 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

grvnsh/ai-ektara-isolation-model

This Python script implements a neural network model for detecting the presence of an ektara in audio recordings.

Language: Python - Size: 10.7 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

matlab-deep-learning/Convert-librosa-Audio-Feature-Extraction-To-MATLAB

Convert librosa Audio Feature Extraction To MATLAB

Language: MATLAB - Size: 1.78 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

Arfa-Ahsan/AI-Project

An advanced system that uses Computer Vision and Audio Processing to automatically track and study wildlife, aiding in research, conservation, and security.

Size: 3.91 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

choppystick/chord_detection

Python implementation of chord-detection algorithms

Language: Python - Size: 6.46 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ykgautam09/Emotion_Detection_audio

This project uses famous CREMA-D dataset to classify human emotion using deep learning.

Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

bhattsameer/Eyeshield

Data Transmission Between two devices using Sound

Language: Python - Size: 301 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 6 - Forks: 4

khushijtrivedi/speech

The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.

Language: CSS - Size: 1.07 GB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

Kavayk29/Audio-classification-using-Python-Library

This is a audio classification Project using python Libraries such as librosa to make the visual representation of the audio files, and using numpy to make array of data for manipulation and then extraction the features for classification to train and test of CNN model.

Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

kookmin-sw/capstone-2022-15

IN4U - 면접 연습 웹 서비스

Language: Python - Size: 155 MB - Last synced at: 25 days ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 3

dipch/Audio-Classification-using-ML-DNN-LSTM

Includes visualization and feature extraction using Librosa, dataset creation from raw audio files, feature selection, model training (using different ML and DL methods), and hyperparameter tuning.

Language: Jupyter Notebook - Size: 6.42 MB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

dipch/Audio-Feature-Extraction-Librosa

This notebook demonstrates visualization and analysis of music and audio files using the Librosa python library.

Language: Jupyter Notebook - Size: 4 MB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

SAGARTR/Deep-Audio-Classifier-using-Machine-Learning

Languages Used: Python Developed and implemented a deep audio classifier using CNNs and LSTMs to accurately categorize diverse audio signals, achieving high accuracy and robustness. Utilized Python and TensorFlow for model development and training, incorporating data augmentation techniques to enhance performance

Language: Jupyter Notebook - Size: 104 KB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mehdihosseinimoghadam/Signal-Processing

Signal Processing with Python and Librosa

Language: Jupyter Notebook - Size: 46.6 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

Related Keywords
librosa 367 python 124 deep-learning 76 tensorflow 74 machine-learning 69 audio-processing 68 keras 57 python3 43 audio 41 cnn 38 numpy 35 music 31 speech-recognition 26 audio-classification 26 audio-analysis 25 pytorch 25 pandas 24 matplotlib 24 mfcc 22 scikit-learn 20 flask 20 convolutional-neural-networks 19 speech-emotion-recognition 19 neural-network 19 classification 19 jupyter-notebook 18 lstm 17 emotion-recognition 17 keras-tensorflow 16 spectrogram 14 scipy 14 mfcc-features 13 feature-extraction 13 signal-processing 13 data-science 13 mel-spectrogram 12 sound-classification 12 music-information-retrieval 12 sklearn 12 deep-neural-networks 11 speech-processing 11 opencv 11 neural-networks 10 speech-to-text 10 cnn-keras 10 mlp-classifier 10 seaborn 10 emotion-detection 10 django 9 voice 9 streamlit 9 stft 8 speech 8 cnn-classification 7 wav 7 audio-visualizer 7 rnn 7 pyqt5 7 digital-signal-processing 7 sound 7 torchaudio 6 voice-recognition 6 dsp 6 ffmpeg 6 emotion 6 deeplearning 6 melspectrogram 6 pyaudio 6 fourier-transform 6 cnn-model 6 urban-sound-classification 6 visualization 6 artificial-intelligence 5 ai 5 opensmile 5 tensorflow2 5 nlp 5 moviepy 5 sound-processing 5 svm 5 fft 5 dataset 4 mel-spectrograms 4 transformer 4 music-classification 4 kaggle-dataset 4 raspberry-pi 4 spotify 4 transformers 4 torch 4 audio-signal-processing 4 mlp 4 genre-classification 4 classifier 4 fastapi 4 recurrent-neural-networks 4 artificial-neural-networks 4 crema-d 4 html-css-javascript 4 ravdess-dataset 4