GitHub topics: mfcc
8g6-new/c_spectrogram
A high performance spectrogram with STFT Mel and MFCC support in pure C
Language: C - Size: 190 MB - Last synced at: about 5 hours ago - Pushed at: about 15 hours ago - Stars: 4 - Forks: 0

NeuroByte-Consulting/Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs
Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MFCCs)
Language: Jupyter Notebook - Size: 15.6 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 9 - Forks: 0

stefantaubert/mel-cepstral-distance
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the method proposed by Robert F. Kubichek in "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment".
Language: Python - Size: 59.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 52 - Forks: 10

Nickaine1/Music-Genre-Recognition
Music-genre-classification-using-deep-learning
Size: 3.91 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

RBGTOP/Music-Genre-Recognition
Music genre classification using deep learning
Size: 1.95 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 8 - Forks: 0

FragIt/fragit-main
FragIt main repository
Language: Python - Size: 529 KB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 26 - Forks: 12

idaishe/Music-Genre-Recognition
Music-genre-classification-using-deep-learning
Size: 2.93 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

ddbourgin/numpy-ml
Machine learning, in numpy
Language: Python - Size: 10 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 16,044 - Forks: 3,796

aubio/aubio
a library for audio and music analysis
Language: C - Size: 11.3 MB - Last synced at: 12 days ago - Pushed at: 9 months ago - Stars: 3,412 - Forks: 390

sp-nitech/diffsptk
A differentiable version of SPTK
Language: Python - Size: 1.59 MB - Last synced at: 7 days ago - Pushed at: 11 days ago - Stars: 180 - Forks: 15

SuperKogito/spafe
:sound: spafe: Simplified Python Audio Features Extraction
Language: Python - Size: 20.7 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 469 - Forks: 79

x4nth055/emotion-recognition-using-speech
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
Language: Python - Size: 944 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 621 - Forks: 242

tympanix/subsync
Synchronize your subtitles using machine learning
Language: Python - Size: 468 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 152 - Forks: 16

ZhuoZhuoCrayon/AcousticKeyBoard-Web
声学键盘|❓脑洞大开:做一个能听懂键盘敲击键位的「玩具」,学习信号处理 / 深度学习 / 安卓 / Django。
Language: Python - Size: 68.3 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 86 - Forks: 5

ar1st0crat/NWaves
.NET DSP library with a lot of audio processing functions
Language: C# - Size: 7.28 MB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 484 - Forks: 77

axelkrnwn/indo-speech-classification
Indonesia word speech recognition using MFCC, PCA, and random forest
Language: Python - Size: 939 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Language: C++ - Size: 10.2 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 197 - Forks: 38

LHPT2009/Music-Genre-Recognition
Music genre classification using deep learning
Size: 0 Bytes - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

dxspeeder/Music-Genre-Recognition
Music genre classification using deep learning
Size: 5.86 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

SuperKogito/Voice-based-speaker-identification
:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM
Language: Python - Size: 105 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 54 - Forks: 15

piruty/voice_actor_recog
Extract MFCC from movie files and detect speaker using it
Language: Python - Size: 372 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 5 - Forks: 0

SuperKogito/Voice-based-gender-recognition
:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)
Language: Python - Size: 8.96 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 213 - Forks: 68

DevinWSoTuff/Music-Genre-Recognition
Music genre classification using deep learning
Size: 5.86 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

SaiSrujanReddyP/MachineLearning
Vivalyse is an AI-powered ML model that assesses confidence and clarity in viva speeches using NLP and audio processing. From MFCC and text embeddings like BERT, GloVe, etc., it focuses on confidence and clarity for classification. The model ensures objective and fair evaluations applicable in education, HR, and AI-driven hiring.
Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

sp-nitech/SPTK
A suite of speech signal processing tools
Language: C++ - Size: 5.57 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 232 - Forks: 27

adamstark/Gist
A C++ Library for Audio Analysis
Language: C++ - Size: 938 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 378 - Forks: 76

fmdb/audio-features
Python-based CLI application to generate various audio feature-vectors from MP3/FLAC files.
Language: Python - Size: 30.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

mathquis/node-personal-wakeword
Personal wake word detector
Language: JavaScript - Size: 103 KB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 63 - Forks: 8

FragJage/SpeakerVoiceIdentifier
SpeakerVoiceIdentifier can recognize the voice of a speaker by learning.
Language: C++ - Size: 20.3 MB - Last synced at: about 1 month ago - Pushed at: about 8 years ago - Stars: 33 - Forks: 14

justanotherinternetguy/XSpeech
XSpeech: A Novel Deep Learning Approach to Classifying Stutters
Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 0

HassanHayat08/Interpretable-CNN-for-Big-Five-Personality-Traits-using-Audio-Data
We developed an interpretable CNN for big five personality traits using human speech data. This project discovers the different frequency patterns of a human voice with respect to each five personality traits. This project will help us to understand the apparent personality of a human using his/her voice.
Language: Python - Size: 16.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 14 - Forks: 3

DataXujing/ASR-paper
:fire: ASR教程: https://dataxujing.github.io/ASR-paper/
Size: 1.07 GB - Last synced at: 17 days ago - Pushed at: 10 months ago - Stars: 24 - Forks: 6

woov2/Covid19_Classification_AI_Challenge
[경진대회] COVID-19 검출 AI 모델 개발
Language: Jupyter Notebook - Size: 12 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Mike014/Audio-Classification
This is a prototype Django application that allows users to upload audio files and classify them using machine learning techniques.
Language: Python - Size: 7.43 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

ShoYamanishi/AndroidMFCC
26-Point MFCC & 512-Point FFT Generator & Visualizer in Java, C++, and NEON intrinsics
Language: C++ - Size: 6.02 MB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 15 - Forks: 2

pavlosdais/Music-Genre-Recognition
Music genre classification using deep learning
Language: Jupyter Notebook - Size: 1.98 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

php-guy55/ocx
Language: PHP - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

certainlyWrong/mfcc_bee
Implementação do algoritmo de extração de características em dart.
Language: Dart - Size: 332 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

hallowshaw/Speech-Emotion-Recognition-with-MFCC
A project to classify emotions like happiness, sadness, and anger from speech using MFCCs, machine learning models, and visualizations for audio features and model performance.
Language: Jupyter Notebook - Size: 969 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

brucewlee/LAMA-Music-Genre-Dataset
.wav files, training dataset (MFCC), and graph plots (FFTs, MFCCs, Waveforms) from Latin America, Asia, MiddleEast, and Africa
Language: Python - Size: 23.8 MB - Last synced at: 15 days ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

mradovic38/dtw-speech-recognition
Speech recognition system that uses feature extraction and dynamic time warping (DTW) to identify words and to find the most similar speaker.
Language: Python - Size: 29.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

baggepinnen/LPVSpectral.jl
Least-squares (sparse) spectral estimation and (sparse) LPV spectral decomposition.
Language: Julia - Size: 424 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 12 - Forks: 6

dhruvesh13/Audio-Genre-Classification
Automatic music genre classification using Machine Learning algorithms like- Logistic Regression and K-Nearest Neighbours
Language: Python - Size: 11.7 KB - Last synced at: 19 days ago - Pushed at: over 7 years ago - Stars: 19 - Forks: 11

SeyedMuhammadHosseinMousavi/Persian-Classical-Music-Instrument-Recognition-PCMIR-Using-a-Novel-Persian-Music-Database
Persian Classical Music Instrument Recognition (PCMIR) Using a Novel Persian Music Database
Language: MATLAB - Size: 19.7 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

jsingh811/pyAudioProcessing
Audio feature extraction and classification
Language: Python - Size: 22.9 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 222 - Forks: 39

xgwang1119/Wireless_Standard_Identification_WSI
To generate the waveform demo, for paper "Wireless Standard Identification via Mel Frequency Cepstrum" in IEEE Communications Letters, vol. 26, no. 11, pp. 2656-2660, Nov. 2022
Language: MATLAB - Size: 69.3 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
Language: C - Size: 7.11 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 2,796 - Forks: 118

loharmurtaza/FoG_detection_subject_dependent
This repository is based on my research work "Detecting Freezing of Gait in Parkinson's Disease Patients Using Multi-Modal Machine Learning"
Size: 626 KB - Last synced at: 14 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

JavierAntoran/moby_dick_whale_audio_detection
Feature extraction, HMMs, Neural Nets, and Boosting for Kaggle Cornell Whale detection challenge.
Language: Jupyter Notebook - Size: 36.1 MB - Last synced at: 16 days ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 1

ragibson/MFCC-speech-recognition
Real-time speech recognition via "Mel-Frequency Cepstral Coefficients" neural networks.
Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: 1 day ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 0

mechanicalsea/spectra
Spectra extraction tutorials based on torch and torchaudio.
Language: Jupyter Notebook - Size: 3.31 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 40 - Forks: 4

PayalMh5/EmotionRecognition 📦
Emotion Recognition in Speech: This project leverages advanced machine learning techniques to classify emotions from speech using the Toronto Emotional Speech Set (TESS). By extracting Mel-Frequency Cepstral Coefficients (MFCC) and utilizing an LSTM-based deep learning model, the project accurately identifies emotions like anger, happiness, and sad
Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

aubio/vamp-aubio-plugins
aubio plugins for Vamp
Language: C++ - Size: 440 KB - Last synced at: 7 days ago - Pushed at: over 7 years ago - Stars: 48 - Forks: 12

waldekmaciejko/utils
Various scripts for machine learning
Language: Jupyter Notebook - Size: 392 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

jaykejriwal/Stress-detection
Stress detection using non-semantic speech representation
Language: Jupyter Notebook - Size: 177 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

alicex2020/Mandarin-Tone-Classification
Deep learning using CNN for Mandarin Chinese tone classification
Language: Jupyter Notebook - Size: 489 KB - Last synced at: 9 months ago - Pushed at: about 6 years ago - Stars: 31 - Forks: 7

mathquis/node-gist
Node binding for the Gist Audio Analysis Library
Language: C++ - Size: 171 KB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 2

OzymandiasTheGreat/wakeword-zero Fork of mathquis/node-personal-wakeword 📦
Personal wake word detector, ported to TypeScript/WASM
Language: TypeScript - Size: 711 KB - Last synced at: about 6 hours ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

MycroftAI/sonopy
A simple audio feature extraction library
Language: Python - Size: 8.79 KB - Last synced at: about 22 hours ago - Pushed at: almost 6 years ago - Stars: 79 - Forks: 21

CodersAcademy006/Speech-Recognition-System
The objective of this DLM (Deep Learning Model) is to recognize the emotions from speech.
Language: Python - Size: 59.6 MB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

heoseongjin/Pytorch_Project
Pytorch, Yolov3, Cnn, Librosa, Mfcc
Language: Python - Size: 23.1 MB - Last synced at: 3 days ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 1

KK4TEE/audio-classification-tool
Visualize and listen to audio samples while creating a custom audio dataset for machine learning.
Language: Python - Size: 170 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

linto-ai/sfeatpy
Library to extract MFCC features from audio signal
Language: Python - Size: 18.6 KB - Last synced at: about 5 hours ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

jubinjacob03/genre-classification-recommendation_Spotify
Project for classifying audio files into different genres using the K-Nearest Neighbors (KNN) algorithm.
Language: Python - Size: 1.17 GB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

harshraj11584/Voice_Command_Recognition
[ML] [Audio Classification] Recognises Voice Commands from System Microphone, using MFCC, Random forests and MLPs
Language: Jupyter Notebook - Size: 49.5 MB - Last synced at: 11 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

ringabout/scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Language: Nim - Size: 354 KB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 0

Papich23691/Speaker-Recognition
Speaker Recognition using MFCC feature vectors and GLA vector quantization models
Language: C - Size: 26.4 KB - Last synced at: 11 months ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 2

georgid/AlignmentDuration
Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.
Language: Python - Size: 342 MB - Last synced at: 11 months ago - Pushed at: about 5 years ago - Stars: 55 - Forks: 6

jan25/Speaker-Recognition
Language: Python - Size: 38 MB - Last synced at: 12 months ago - Pushed at: almost 9 years ago - Stars: 2 - Forks: 2

eigensharks/mfcc-speaker-recognition
Speaker Recognition deep learning model based on feature extraction from Mel Frequency Cepstral Coefficients
Language: Jupyter Notebook - Size: 834 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 1

devnithw/mfcc-speaker-recognition Fork of eigensharks/mfcc-speaker-recognition
Speaker Recognition deep learning model based on feature extraction from Mel Frequency Cepstral Coefficients. Solution code for Signal Processing Cup 2024.
Language: Jupyter Notebook - Size: 834 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

JavierAntoran/tiger-costume-voice-conversion
Voice Alignment and Conversion with Neural Networks and the WORLD codec.
Language: Jupyter Notebook - Size: 63.5 MB - Last synced at: 16 days ago - Pushed at: almost 6 years ago - Stars: 20 - Forks: 1

Beluga-T/Music-Genre-Classification-by-Neuron-Network-Models
Music genre Classification with different models fine tuning and performance comparisons
Language: Python - Size: 1.17 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Thorwig/Speaker-Recognition-AI
This project was originally developed for my own company, Delta Cognition, and later applied during my 2023 internship. It is a text-independent speaker recognition solution utilizing machine learning techniques.
Language: Python - Size: 4.25 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

dalabdgw/Hand_Landmark
Config files for my GitHub profile.
Language: Jupyter Notebook - Size: 3.65 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

reshalfahsi/music-genre-classification
Music Genre Classification using MFCC + ANN
Language: Jupyter Notebook - Size: 3.64 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

yashbhalgat/Emotion-from-speech-MFCC
Maltab code for extraction of Mel Frequency Cepstral Coefficients
Language: Matlab - Size: 279 KB - Last synced at: 5 months ago - Pushed at: about 9 years ago - Stars: 12 - Forks: 8

nnarenraju/sound-classification
Classification of Sounds Using Convolutional Neural Networks
Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 5 - Forks: 4

triabokon/signal-processing
KPI digital signal processing course
Language: Jupyter Notebook - Size: 9.21 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

fmeola/mfcc
TP2 Métodos Numéricos Avanzados 2C 2014
Language: Matlab - Size: 1.93 MB - Last synced at: about 1 year ago - Pushed at: over 10 years ago - Stars: 0 - Forks: 0

IhabBendidi/Voice-authentification-API
A RESTFUL API implementation of an authentification system using voice fingerprint
Language: Python - Size: 5.97 MB - Last synced at: 17 days ago - Pushed at: about 5 years ago - Stars: 24 - Forks: 2

wildanka/ASRBP
Speech Recognition experiment using MFCC Feature Extraction + Feed Forward Neural Network (training with Backpropagation)
Language: Java - Size: 104 KB - Last synced at: 6 months ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 2

Tonumoy/MFCCNet-A-Network-for-Earthquake-Early-Warning-Applications-using-Speech-Recognition-Techniques
A comparison between two Deep Learning Models to find an Optimum one for Real-Time EEW (Earthquake Early Warning) Applications
Language: Python - Size: 699 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

GuitarsAI/BasicsMusicalInstrumClassifi
Basics of Musical Instruments Classification using Machine Learning
Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 28 - Forks: 12

zafarrafii/Zaf-Julia
Zafar's Audio Functions in Julia for audio signal analysis: STFT, inverse STFT, CQT kernel, CQT spectrogram, CQT chromagram, MFCC, DCT, DST, MDCT, inverse MDCT.
Language: Jupyter Notebook - Size: 60.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

zafarrafii/Zaf-Matlab
Zafar's Audio Functions in Matlab for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.
Language: Jupyter Notebook - Size: 86 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 43 - Forks: 14

zafarrafii/Zaf-Python
Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.
Language: Jupyter Notebook - Size: 116 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 47 - Forks: 11

Saurabh620/Voice-Signal-Processing-using-Python-GUI
In this project we used TESS voice dataset and processed it and perform emotion prediction.
Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

srijachatterjee19/Bird-call-classifier
Implemented CNN and LSTM models in TensorFlow for classifying bird sounds across 10 species.
Language: Jupyter Notebook - Size: 162 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

zafarrafii/CQHC-Python
Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.
Language: Jupyter Notebook - Size: 84.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 23 - Forks: 1

FandosA/Singer_Recognition_Keras_TF
This project was my final Bachelor's degree thesis. In it I decided to mix my passion, music, and the syllabus that I liked the most in my degree, deep learning.
Language: Python - Size: 3.26 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

MoIzadloo/speaker-recognition
This repository houses a speaker recognition model built with MFCCs and machine learning to identify a specific target speaker in audio recordings.
Language: Jupyter Notebook - Size: 80.1 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Mixa26/Voice_controlled_drawing_interface
A simple AI drawing interface controlled by voice commands
Language: Jupyter Notebook - Size: 21 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

parvatijay2901/Footstep-Voice-Identification
MiiCare (Technical test): Detect the footstep
Language: Python - Size: 80.8 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

supikiti/PNCC
A implementation of Power Normalized Cepstral Coefficients: PNCC
Language: Python - Size: 25.4 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 47 - Forks: 10

robertocosta/vcr
Language: JavaScript - Size: 164 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

enter-opy/amygdala
song mood visualization plugin
Language: C++ - Size: 1.1 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

mohammadnabia/Matlab_DSP_MFCC
MATLAB code for audio signal processing, emphasizing Real Cepstrum and MFCC feature extraction. Reads a wave file, applies Hamming and Rectangular windows, then computes Real Cepstrum. Utilizes MATLAB's built-in functions for extracting MFCC features. Perfect for audio analysis and feature engineering.
Language: MATLAB - Size: 53.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ewan-xu/LibrosaCpp
LibrosaCpp is a c++ implemention of librosa to compute short-time fourier transform coefficients,mel spectrogram or mfcc
Language: C++ - Size: 2.55 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 139 - Forks: 39

U3or/voiceprint_cnn
Use MFCC+CNN to realize speaker voiceprint recognition, and it will be transplanted to embedded devices
Language: Jupyter Notebook - Size: 16.6 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
