An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: mfcc

8g6-new/c_spectrogram

A high performance spectrogram with STFT Mel and MFCC support in pure C

Language: C - Size: 190 MB - Last synced at: about 5 hours ago - Pushed at: about 15 hours ago - Stars: 4 - Forks: 0

NeuroByte-Consulting/Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs

Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MFCCs)

Language: Jupyter Notebook - Size: 15.6 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 9 - Forks: 0

stefantaubert/mel-cepstral-distance

A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the method proposed by Robert F. Kubichek in "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment".

Language: Python - Size: 59.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 52 - Forks: 10

Nickaine1/Music-Genre-Recognition

Music-genre-classification-using-deep-learning

Size: 3.91 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

RBGTOP/Music-Genre-Recognition

Music genre classification using deep learning

Size: 1.95 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 8 - Forks: 0

FragIt/fragit-main

FragIt main repository

Language: Python - Size: 529 KB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 26 - Forks: 12

idaishe/Music-Genre-Recognition

Music-genre-classification-using-deep-learning

Size: 2.93 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

ddbourgin/numpy-ml

Machine learning, in numpy

Language: Python - Size: 10 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 16,044 - Forks: 3,796

aubio/aubio

a library for audio and music analysis

Language: C - Size: 11.3 MB - Last synced at: 12 days ago - Pushed at: 9 months ago - Stars: 3,412 - Forks: 390

sp-nitech/diffsptk

A differentiable version of SPTK

Language: Python - Size: 1.59 MB - Last synced at: 7 days ago - Pushed at: 11 days ago - Stars: 180 - Forks: 15

SuperKogito/spafe

:sound: spafe: Simplified Python Audio Features Extraction

Language: Python - Size: 20.7 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 469 - Forks: 79

x4nth055/emotion-recognition-using-speech

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

Language: Python - Size: 944 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 621 - Forks: 242

tympanix/subsync

Synchronize your subtitles using machine learning

Language: Python - Size: 468 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 152 - Forks: 16

ZhuoZhuoCrayon/AcousticKeyBoard-Web

声学键盘|❓脑洞大开:做一个能听懂键盘敲击键位的「玩具」,学习信号处理 / 深度学习 / 安卓 / Django。

Language: Python - Size: 68.3 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 86 - Forks: 5

ar1st0crat/NWaves

.NET DSP library with a lot of audio processing functions

Language: C# - Size: 7.28 MB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 484 - Forks: 77

axelkrnwn/indo-speech-classification

Indonesia word speech recognition using MFCC, PCA, and random forest

Language: Python - Size: 939 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

csukuangfj/kaldifeat

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

Language: C++ - Size: 10.2 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 197 - Forks: 38

LHPT2009/Music-Genre-Recognition

Music genre classification using deep learning

Size: 0 Bytes - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

dxspeeder/Music-Genre-Recognition

Music genre classification using deep learning

Size: 5.86 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

SuperKogito/Voice-based-speaker-identification

:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM

Language: Python - Size: 105 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 54 - Forks: 15

piruty/voice_actor_recog

Extract MFCC from movie files and detect speaker using it

Language: Python - Size: 372 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 5 - Forks: 0

SuperKogito/Voice-based-gender-recognition

:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)

Language: Python - Size: 8.96 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 213 - Forks: 68

DevinWSoTuff/Music-Genre-Recognition

Music genre classification using deep learning

Size: 5.86 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

SaiSrujanReddyP/MachineLearning

Vivalyse is an AI-powered ML model that assesses confidence and clarity in viva speeches using NLP and audio processing. From MFCC and text embeddings like BERT, GloVe, etc., it focuses on confidence and clarity for classification. The model ensures objective and fair evaluations applicable in education, HR, and AI-driven hiring.

Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

sp-nitech/SPTK

A suite of speech signal processing tools

Language: C++ - Size: 5.57 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 232 - Forks: 27

adamstark/Gist

A C++ Library for Audio Analysis

Language: C++ - Size: 938 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 378 - Forks: 76

fmdb/audio-features

Python-based CLI application to generate various audio feature-vectors from MP3/FLAC files.

Language: Python - Size: 30.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

mathquis/node-personal-wakeword

Personal wake word detector

Language: JavaScript - Size: 103 KB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 63 - Forks: 8

FragJage/SpeakerVoiceIdentifier

SpeakerVoiceIdentifier can recognize the voice of a speaker by learning.

Language: C++ - Size: 20.3 MB - Last synced at: about 1 month ago - Pushed at: about 8 years ago - Stars: 33 - Forks: 14

justanotherinternetguy/XSpeech

XSpeech: A Novel Deep Learning Approach to Classifying Stutters

Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 0

HassanHayat08/Interpretable-CNN-for-Big-Five-Personality-Traits-using-Audio-Data

We developed an interpretable CNN for big five personality traits using human speech data. This project discovers the different frequency patterns of a human voice with respect to each five personality traits. This project will help us to understand the apparent personality of a human using his/her voice.

Language: Python - Size: 16.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 14 - Forks: 3

DataXujing/ASR-paper

:fire: ASR教程: https://dataxujing.github.io/ASR-paper/

Size: 1.07 GB - Last synced at: 17 days ago - Pushed at: 10 months ago - Stars: 24 - Forks: 6

woov2/Covid19_Classification_AI_Challenge

[경진대회] COVID-19 검출 AI 모델 개발

Language: Jupyter Notebook - Size: 12 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Mike014/Audio-Classification

This is a prototype Django application that allows users to upload audio files and classify them using machine learning techniques.

Language: Python - Size: 7.43 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

ShoYamanishi/AndroidMFCC

26-Point MFCC & 512-Point FFT Generator & Visualizer in Java, C++, and NEON intrinsics

Language: C++ - Size: 6.02 MB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 15 - Forks: 2

pavlosdais/Music-Genre-Recognition

Music genre classification using deep learning

Language: Jupyter Notebook - Size: 1.98 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

php-guy55/ocx

Language: PHP - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

certainlyWrong/mfcc_bee

Implementação do algoritmo de extração de características em dart.

Language: Dart - Size: 332 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

hallowshaw/Speech-Emotion-Recognition-with-MFCC

A project to classify emotions like happiness, sadness, and anger from speech using MFCCs, machine learning models, and visualizations for audio features and model performance.

Language: Jupyter Notebook - Size: 969 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

brucewlee/LAMA-Music-Genre-Dataset

.wav files, training dataset (MFCC), and graph plots (FFTs, MFCCs, Waveforms) from Latin America, Asia, MiddleEast, and Africa

Language: Python - Size: 23.8 MB - Last synced at: 15 days ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

mradovic38/dtw-speech-recognition

Speech recognition system that uses feature extraction and dynamic time warping (DTW) to identify words and to find the most similar speaker.

Language: Python - Size: 29.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

baggepinnen/LPVSpectral.jl

Least-squares (sparse) spectral estimation and (sparse) LPV spectral decomposition.

Language: Julia - Size: 424 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 12 - Forks: 6

dhruvesh13/Audio-Genre-Classification

Automatic music genre classification using Machine Learning algorithms like- Logistic Regression and K-Nearest Neighbours

Language: Python - Size: 11.7 KB - Last synced at: 19 days ago - Pushed at: over 7 years ago - Stars: 19 - Forks: 11

SeyedMuhammadHosseinMousavi/Persian-Classical-Music-Instrument-Recognition-PCMIR-Using-a-Novel-Persian-Music-Database

Persian Classical Music Instrument Recognition (PCMIR) Using a Novel Persian Music Database

Language: MATLAB - Size: 19.7 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

jsingh811/pyAudioProcessing

Audio feature extraction and classification

Language: Python - Size: 22.9 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 222 - Forks: 39

xgwang1119/Wireless_Standard_Identification_WSI

To generate the waveform demo, for paper "Wireless Standard Identification via Mel Frequency Cepstrum" in IEEE Communications Letters, vol. 26, no. 11, pp. 2656-2660, Nov. 2022

Language: MATLAB - Size: 69.3 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

libAudioFlux/audioFlux

A library for audio and music analysis, feature extraction.

Language: C - Size: 7.11 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 2,796 - Forks: 118

loharmurtaza/FoG_detection_subject_dependent

This repository is based on my research work "Detecting Freezing of Gait in Parkinson's Disease Patients Using Multi-Modal Machine Learning"

Size: 626 KB - Last synced at: 14 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

JavierAntoran/moby_dick_whale_audio_detection

Feature extraction, HMMs, Neural Nets, and Boosting for Kaggle Cornell Whale detection challenge.

Language: Jupyter Notebook - Size: 36.1 MB - Last synced at: 16 days ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 1

ragibson/MFCC-speech-recognition

Real-time speech recognition via "Mel-Frequency Cepstral Coefficients" neural networks.

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: 1 day ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 0

mechanicalsea/spectra

Spectra extraction tutorials based on torch and torchaudio.

Language: Jupyter Notebook - Size: 3.31 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 40 - Forks: 4

PayalMh5/EmotionRecognition 📦

Emotion Recognition in Speech: This project leverages advanced machine learning techniques to classify emotions from speech using the Toronto Emotional Speech Set (TESS). By extracting Mel-Frequency Cepstral Coefficients (MFCC) and utilizing an LSTM-based deep learning model, the project accurately identifies emotions like anger, happiness, and sad

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

aubio/vamp-aubio-plugins

aubio plugins for Vamp

Language: C++ - Size: 440 KB - Last synced at: 7 days ago - Pushed at: over 7 years ago - Stars: 48 - Forks: 12

waldekmaciejko/utils

Various scripts for machine learning

Language: Jupyter Notebook - Size: 392 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

jaykejriwal/Stress-detection

Stress detection using non-semantic speech representation

Language: Jupyter Notebook - Size: 177 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

alicex2020/Mandarin-Tone-Classification

Deep learning using CNN for Mandarin Chinese tone classification

Language: Jupyter Notebook - Size: 489 KB - Last synced at: 9 months ago - Pushed at: about 6 years ago - Stars: 31 - Forks: 7

mathquis/node-gist

Node binding for the Gist Audio Analysis Library

Language: C++ - Size: 171 KB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 2

OzymandiasTheGreat/wakeword-zero Fork of mathquis/node-personal-wakeword 📦

Personal wake word detector, ported to TypeScript/WASM

Language: TypeScript - Size: 711 KB - Last synced at: about 6 hours ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

MycroftAI/sonopy

A simple audio feature extraction library

Language: Python - Size: 8.79 KB - Last synced at: about 22 hours ago - Pushed at: almost 6 years ago - Stars: 79 - Forks: 21

CodersAcademy006/Speech-Recognition-System

The objective of this DLM (Deep Learning Model) is to recognize the emotions from speech.

Language: Python - Size: 59.6 MB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

heoseongjin/Pytorch_Project

Pytorch, Yolov3, Cnn, Librosa, Mfcc

Language: Python - Size: 23.1 MB - Last synced at: 3 days ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 1

KK4TEE/audio-classification-tool

Visualize and listen to audio samples while creating a custom audio dataset for machine learning.

Language: Python - Size: 170 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

linto-ai/sfeatpy

Library to extract MFCC features from audio signal

Language: Python - Size: 18.6 KB - Last synced at: about 5 hours ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

jubinjacob03/genre-classification-recommendation_Spotify

Project for classifying audio files into different genres using the K-Nearest Neighbors (KNN) algorithm.

Language: Python - Size: 1.17 GB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

harshraj11584/Voice_Command_Recognition

[ML] [Audio Classification] Recognises Voice Commands from System Microphone, using MFCC, Random forests and MLPs

Language: Jupyter Notebook - Size: 49.5 MB - Last synced at: 11 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

ringabout/scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

Language: Nim - Size: 354 KB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 0

Papich23691/Speaker-Recognition

Speaker Recognition using MFCC feature vectors and GLA vector quantization models

Language: C - Size: 26.4 KB - Last synced at: 11 months ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 2

georgid/AlignmentDuration

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

Language: Python - Size: 342 MB - Last synced at: 11 months ago - Pushed at: about 5 years ago - Stars: 55 - Forks: 6

jan25/Speaker-Recognition

Language: Python - Size: 38 MB - Last synced at: 12 months ago - Pushed at: almost 9 years ago - Stars: 2 - Forks: 2

eigensharks/mfcc-speaker-recognition

Speaker Recognition deep learning model based on feature extraction from Mel Frequency Cepstral Coefficients

Language: Jupyter Notebook - Size: 834 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 1

devnithw/mfcc-speaker-recognition Fork of eigensharks/mfcc-speaker-recognition

Speaker Recognition deep learning model based on feature extraction from Mel Frequency Cepstral Coefficients. Solution code for Signal Processing Cup 2024.

Language: Jupyter Notebook - Size: 834 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

JavierAntoran/tiger-costume-voice-conversion

Voice Alignment and Conversion with Neural Networks and the WORLD codec.

Language: Jupyter Notebook - Size: 63.5 MB - Last synced at: 16 days ago - Pushed at: almost 6 years ago - Stars: 20 - Forks: 1

Beluga-T/Music-Genre-Classification-by-Neuron-Network-Models

Music genre Classification with different models fine tuning and performance comparisons

Language: Python - Size: 1.17 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Thorwig/Speaker-Recognition-AI

This project was originally developed for my own company, Delta Cognition, and later applied during my 2023 internship. It is a text-independent speaker recognition solution utilizing machine learning techniques.

Language: Python - Size: 4.25 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

dalabdgw/Hand_Landmark

Config files for my GitHub profile.

Language: Jupyter Notebook - Size: 3.65 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

reshalfahsi/music-genre-classification

Music Genre Classification using MFCC + ANN

Language: Jupyter Notebook - Size: 3.64 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

yashbhalgat/Emotion-from-speech-MFCC

Maltab code for extraction of Mel Frequency Cepstral Coefficients

Language: Matlab - Size: 279 KB - Last synced at: 5 months ago - Pushed at: about 9 years ago - Stars: 12 - Forks: 8

nnarenraju/sound-classification

Classification of Sounds Using Convolutional Neural Networks

Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 5 - Forks: 4

triabokon/signal-processing

KPI digital signal processing course

Language: Jupyter Notebook - Size: 9.21 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

fmeola/mfcc

TP2 Métodos Numéricos Avanzados 2C 2014

Language: Matlab - Size: 1.93 MB - Last synced at: about 1 year ago - Pushed at: over 10 years ago - Stars: 0 - Forks: 0

IhabBendidi/Voice-authentification-API

A RESTFUL API implementation of an authentification system using voice fingerprint

Language: Python - Size: 5.97 MB - Last synced at: 17 days ago - Pushed at: about 5 years ago - Stars: 24 - Forks: 2

wildanka/ASRBP

Speech Recognition experiment using MFCC Feature Extraction + Feed Forward Neural Network (training with Backpropagation)

Language: Java - Size: 104 KB - Last synced at: 6 months ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 2

Tonumoy/MFCCNet-A-Network-for-Earthquake-Early-Warning-Applications-using-Speech-Recognition-Techniques

A comparison between two Deep Learning Models to find an Optimum one for Real-Time EEW (Earthquake Early Warning) Applications

Language: Python - Size: 699 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

GuitarsAI/BasicsMusicalInstrumClassifi

Basics of Musical Instruments Classification using Machine Learning

Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 28 - Forks: 12

zafarrafii/Zaf-Julia

Zafar's Audio Functions in Julia for audio signal analysis: STFT, inverse STFT, CQT kernel, CQT spectrogram, CQT chromagram, MFCC, DCT, DST, MDCT, inverse MDCT.

Language: Jupyter Notebook - Size: 60.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

zafarrafii/Zaf-Matlab

Zafar's Audio Functions in Matlab for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.

Language: Jupyter Notebook - Size: 86 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 43 - Forks: 14

zafarrafii/Zaf-Python

Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.

Language: Jupyter Notebook - Size: 116 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 47 - Forks: 11

Saurabh620/Voice-Signal-Processing-using-Python-GUI

In this project we used TESS voice dataset and processed it and perform emotion prediction.

Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

srijachatterjee19/Bird-call-classifier

Implemented CNN and LSTM models in TensorFlow for classifying bird sounds across 10 species.

Language: Jupyter Notebook - Size: 162 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

zafarrafii/CQHC-Python

Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.

Language: Jupyter Notebook - Size: 84.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 23 - Forks: 1

FandosA/Singer_Recognition_Keras_TF

This project was my final Bachelor's degree thesis. In it I decided to mix my passion, music, and the syllabus that I liked the most in my degree, deep learning.

Language: Python - Size: 3.26 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

MoIzadloo/speaker-recognition

This repository houses a speaker recognition model built with MFCCs and machine learning to identify a specific target speaker in audio recordings.

Language: Jupyter Notebook - Size: 80.1 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Mixa26/Voice_controlled_drawing_interface

A simple AI drawing interface controlled by voice commands

Language: Jupyter Notebook - Size: 21 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

parvatijay2901/Footstep-Voice-Identification

MiiCare (Technical test): Detect the footstep

Language: Python - Size: 80.8 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

supikiti/PNCC

A implementation of Power Normalized Cepstral Coefficients: PNCC

Language: Python - Size: 25.4 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 47 - Forks: 10

robertocosta/vcr

Language: JavaScript - Size: 164 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

enter-opy/amygdala

song mood visualization plugin

Language: C++ - Size: 1.1 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

mohammadnabia/Matlab_DSP_MFCC

MATLAB code for audio signal processing, emphasizing Real Cepstrum and MFCC feature extraction. Reads a wave file, applies Hamming and Rectangular windows, then computes Real Cepstrum. Utilizes MATLAB's built-in functions for extracting MFCC features. Perfect for audio analysis and feature engineering.

Language: MATLAB - Size: 53.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ewan-xu/LibrosaCpp

LibrosaCpp is a c++ implemention of librosa to compute short-time fourier transform coefficients,mel spectrogram or mfcc

Language: C++ - Size: 2.55 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 139 - Forks: 39

U3or/voiceprint_cnn

Use MFCC+CNN to realize speaker voiceprint recognition, and it will be transplanted to embedded devices

Language: Jupyter Notebook - Size: 16.6 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Related Keywords
mfcc 238 machine-learning 58 python 46 deep-learning 43 audio-processing 31 speech-recognition 28 audio 24 keras 22 librosa 22 cnn 21 tensorflow 20 signal-processing 20 mel-spectrogram 18 gmm 18 mfcc-features 18 speaker-recognition 17 speech-processing 17 pytorch 15 dtw 15 feature-extraction 14 lstm 14 audio-analysis 14 spectrogram 13 voice-recognition 13 classification 13 svm 11 neural-network 11 audio-classification 11 mel-frequency-cepstral-coefficients 10 speech 10 matlab 10 voice 10 music 10 pitch 9 deep-neural-networks 9 neural-networks 9 emotion-recognition 9 music-analysis 9 genre-classification 9 asr 9 stft 8 scikit-learn 8 sklearn 8 hmm 8 lpc 8 fft 7 music-information-retrieval 7 digital-signal-processing 7 music-genre-recognition 7 pattern-recognition 7 gaussian-mixture-models 7 python3 6 cnn-keras 6 dsp 6 mfcc-analysis 5 sound 5 automatic-speech-recognition 5 speaker-verification 5 speaker-identification 5 rnn 5 dynamic-time-warping 5 mfcc-extractor 5 chroma 5 nlp 5 artificial-intelligence 5 spectrum 5 plp 4 flask 4 cnn-classification 4 matplotlib 4 recurrent-neural-networks 4 support-vector-machine 4 signal 4 gru 4 voice-activity-detection 4 random-forest 4 cpp 4 gender-classification 4 data-science 4 numpy 4 discrete-cosine-transform 4 analysis 4 dct 4 cqt-spectrogram 4 mel-filterbank 4 knn 4 short-time-fourier-transform 4 gradient-boosting 4 recognition 4 speech-emotion-recognition 4 keras-tensorflow 4 mdct 4 mir 3 sentiment-analysis 3 speech-to-text 3 knn-classification 3 jupyter-notebook 3 java 3 django 3 streamlit 3