GitHub topics: mfcc

Repositories

libAudioFlux/audioFlux

A library for audio and music analysis, feature extraction.

Language: C - Size: 7.11 MB - Last synced at: about 10 hours ago - Pushed at: 12 months ago - Stars: 3,054 - Forks: 132

aubio/aubio

a library for audio and music analysis

Language: C - Size: 11.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3,432 - Forks: 391

sp-nitech/diffsptk

A differentiable version of SPTK

Language: Python - Size: 1.65 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 182 - Forks: 16

sp-nitech/SPTK

A suite of speech signal processing tools

Language: C++ - Size: 5.57 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 232 - Forks: 27

ar1st0crat/NWaves

.NET DSP library with a lot of audio processing functions

Language: C# - Size: 7.28 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 487 - Forks: 77

RBGTOP/Music-Genre-Recognition

Music genre classification using deep learning

Size: 1.95 KB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 8 - Forks: 0

stefantaubert/mel-cepstral-distance

A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the method proposed by Robert F. Kubichek in "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment".

Language: Python - Size: 59.8 MB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 53 - Forks: 10

SuperKogito/spafe

:sound: spafe: Simplified Python Audio Features Extraction

Language: Python - Size: 20.7 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 471 - Forks: 79

ahmed222220/Music-Genre-Recognition

Music-genre-classification-using-deep-learning

Size: 0 Bytes - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

jsingh811/pyAudioProcessing

Audio feature extraction and classification

Language: Python - Size: 22.9 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 225 - Forks: 39

rusenaite/speaker-identification-using-ML

A speaker recognition system using machine learning (SVM) with MFCC, chroma, and tonnetz features extracted from short audio clips.

Language: Jupyter Notebook - Size: 28.9 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

ddbourgin/numpy-ml

Machine learning, in numpy

Language: Python - Size: 10 MB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 16,067 - Forks: 3,801

SuyashMore/MevonAI-Speech-Emotion-Recognition

Identify the emotion of multiple speakers in an Audio Segment

Language: C - Size: 63.6 MB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 169 - Forks: 48

8g6-new/c_spectrogram

A high performance spectrogram with STFT Mel and MFCC support in pure C

Language: C - Size: 190 MB - Last synced at: 21 days ago - Pushed at: 22 days ago - Stars: 4 - Forks: 0

NeuroByte-Consulting/Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs

Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MFCCs)

Language: Jupyter Notebook - Size: 15.6 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 9 - Forks: 0

tympanix/subsync

Synchronize your subtitles using machine learning

Language: Python - Size: 468 KB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 153 - Forks: 16

Nickaine1/Music-Genre-Recognition

Music-genre-classification-using-deep-learning

Size: 3.91 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

FragIt/fragit-main

FragIt main repository

Language: Python - Size: 529 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 26 - Forks: 12

idaishe/Music-Genre-Recognition

Music-genre-classification-using-deep-learning

Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

x4nth055/emotion-recognition-using-speech

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

Language: Python - Size: 944 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 621 - Forks: 242

ZhuoZhuoCrayon/AcousticKeyBoard-Web

声学键盘｜❓脑洞大开：做一个能听懂键盘敲击键位的「玩具」，学习信号处理 / 深度学习 / 安卓 / Django。

Language: Python - Size: 68.3 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 86 - Forks: 5

axelkrnwn/indo-speech-classification

Indonesia word speech recognition using MFCC, PCA, and random forest

Language: Python - Size: 939 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

csukuangfj/kaldifeat

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

Language: C++ - Size: 10.3 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 197 - Forks: 38

LHPT2009/Music-Genre-Recognition

Music genre classification using deep learning

Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

dxspeeder/Music-Genre-Recognition

Music genre classification using deep learning

Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

SuperKogito/Voice-based-speaker-identification

:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM

Language: Python - Size: 105 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 54 - Forks: 15

piruty/voice_actor_recog

Extract MFCC from movie files and detect speaker using it

Language: Python - Size: 372 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 5 - Forks: 0

SuperKogito/Voice-based-gender-recognition

:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)

Language: Python - Size: 8.96 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 213 - Forks: 68

DevinWSoTuff/Music-Genre-Recognition

Music genre classification using deep learning

Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

SaiSrujanReddyP/MachineLearning

Vivalyse is an AI-powered ML model that assesses confidence and clarity in viva speeches using NLP and audio processing. From MFCC and text embeddings like BERT, GloVe, etc., it focuses on confidence and clarity for classification. The model ensures objective and fair evaluations applicable in education, HR, and AI-driven hiring.

Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

adamstark/Gist

A C++ Library for Audio Analysis

Language: C++ - Size: 938 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 378 - Forks: 76

fmdb/audio-features

Python-based CLI application to generate various audio feature-vectors from MP3/FLAC files.

Language: Python - Size: 30.3 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

mathquis/node-personal-wakeword

Personal wake word detector

Language: JavaScript - Size: 103 KB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 63 - Forks: 8

FragJage/SpeakerVoiceIdentifier

SpeakerVoiceIdentifier can recognize the voice of a speaker by learning.

Language: C++ - Size: 20.3 MB - Last synced at: about 2 months ago - Pushed at: about 8 years ago - Stars: 33 - Forks: 14

justanotherinternetguy/XSpeech

XSpeech: A Novel Deep Learning Approach to Classifying Stutters

Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

HassanHayat08/Interpretable-CNN-for-Big-Five-Personality-Traits-using-Audio-Data

We developed an interpretable CNN for big five personality traits using human speech data. This project discovers the different frequency patterns of a human voice with respect to each five personality traits. This project will help us to understand the apparent personality of a human using his/her voice.

Language: Python - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 14 - Forks: 3

DataXujing/ASR-paper

:fire: ASR教程: https://dataxujing.github.io/ASR-paper/

Size: 1.07 GB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 24 - Forks: 6

woov2/Covid19_Classification_AI_Challenge

[경진대회] COVID-19 검출 AI 모델 개발

Language: Jupyter Notebook - Size: 12 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Mike014/Audio-Classification

This is a prototype Django application that allows users to upload audio files and classify them using machine learning techniques.

Language: Python - Size: 7.43 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

ShoYamanishi/AndroidMFCC

26-Point MFCC & 512-Point FFT Generator & Visualizer in Java, C++, and NEON intrinsics

Language: C++ - Size: 6.02 MB - Last synced at: 28 days ago - Pushed at: over 5 years ago - Stars: 15 - Forks: 2

pavlosdais/Music-Genre-Recognition

Music genre classification using deep learning

Language: Jupyter Notebook - Size: 1.98 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

php-guy55/ocx

Language: PHP - Size: 8.79 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

certainlyWrong/mfcc_bee

Implementação do algoritmo de extração de características em dart.

Language: Dart - Size: 332 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

hallowshaw/Speech-Emotion-Recognition-with-MFCC

A project to classify emotions like happiness, sadness, and anger from speech using MFCCs, machine learning models, and visualizations for audio features and model performance.

Language: Jupyter Notebook - Size: 969 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

brucewlee/LAMA-Music-Genre-Dataset

.wav files, training dataset (MFCC), and graph plots (FFTs, MFCCs, Waveforms) from Latin America, Asia, MiddleEast, and Africa

Language: Python - Size: 23.8 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

mradovic38/dtw-speech-recognition

Speech recognition system that uses feature extraction and dynamic time warping (DTW) to identify words and to find the most similar speaker.

Language: Python - Size: 29.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

baggepinnen/LPVSpectral.jl

Least-squares (sparse) spectral estimation and (sparse) LPV spectral decomposition.

Language: Julia - Size: 424 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 12 - Forks: 6

dhruvesh13/Audio-Genre-Classification

Automatic music genre classification using Machine Learning algorithms like- Logistic Regression and K-Nearest Neighbours

Language: Python - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 19 - Forks: 11

SeyedMuhammadHosseinMousavi/Persian-Classical-Music-Instrument-Recognition-PCMIR-Using-a-Novel-Persian-Music-Database

Persian Classical Music Instrument Recognition (PCMIR) Using a Novel Persian Music Database

Language: MATLAB - Size: 19.7 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

xgwang1119/Wireless_Standard_Identification_WSI

To generate the waveform demo, for paper "Wireless Standard Identification via Mel Frequency Cepstrum" in IEEE Communications Letters, vol. 26, no. 11, pp. 2656-2660, Nov. 2022

Language: MATLAB - Size: 69.3 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

loharmurtaza/FoG_detection_subject_dependent

This repository is based on my research work "Detecting Freezing of Gait in Parkinson's Disease Patients Using Multi-Modal Machine Learning"

Size: 626 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

JavierAntoran/moby_dick_whale_audio_detection

Feature extraction, HMMs, Neural Nets, and Boosting for Kaggle Cornell Whale detection challenge.

Language: Jupyter Notebook - Size: 36.1 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 1

ragibson/MFCC-speech-recognition

Real-time speech recognition via "Mel-Frequency Cepstral Coefficients" neural networks.

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: 23 days ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 0

mechanicalsea/spectra

Spectra extraction tutorials based on torch and torchaudio.

Language: Jupyter Notebook - Size: 3.31 MB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 40 - Forks: 4

PayalMh5/EmotionRecognition 📦

Emotion Recognition in Speech: This project leverages advanced machine learning techniques to classify emotions from speech using the Toronto Emotional Speech Set (TESS). By extracting Mel-Frequency Cepstral Coefficients (MFCC) and utilizing an LSTM-based deep learning model, the project accurately identifies emotions like anger, happiness, and sad

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

aubio/vamp-aubio-plugins

aubio plugins for Vamp

Language: C++ - Size: 440 KB - Last synced at: 29 days ago - Pushed at: over 7 years ago - Stars: 48 - Forks: 12

waldekmaciejko/utils

Various scripts for machine learning

Language: Jupyter Notebook - Size: 392 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

jaykejriwal/Stress-detection

Stress detection using non-semantic speech representation

Language: Jupyter Notebook - Size: 177 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

alicex2020/Mandarin-Tone-Classification

Deep learning using CNN for Mandarin Chinese tone classification

Language: Jupyter Notebook - Size: 489 KB - Last synced at: 10 months ago - Pushed at: about 6 years ago - Stars: 31 - Forks: 7

mathquis/node-gist

Node binding for the Gist Audio Analysis Library

Language: C++ - Size: 171 KB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 2

OzymandiasTheGreat/wakeword-zero Fork of mathquis/node-personal-wakeword 📦

Personal wake word detector, ported to TypeScript/WASM

Language: TypeScript - Size: 711 KB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

MycroftAI/sonopy

A simple audio feature extraction library

Language: Python - Size: 8.79 KB - Last synced at: 22 days ago - Pushed at: almost 6 years ago - Stars: 79 - Forks: 21

CodersAcademy006/Speech-Recognition-System

The objective of this DLM (Deep Learning Model) is to recognize the emotions from speech.

Language: Python - Size: 59.6 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

heoseongjin/Pytorch_Project

Pytorch, Yolov3, Cnn, Librosa, Mfcc

Language: Python - Size: 23.1 MB - Last synced at: 6 days ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 1

KK4TEE/audio-classification-tool

Visualize and listen to audio samples while creating a custom audio dataset for machine learning.

Language: Python - Size: 170 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

linto-ai/sfeatpy

Library to extract MFCC features from audio signal

Language: Python - Size: 18.6 KB - Last synced at: 21 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

jubinjacob03/genre-classification-recommendation_Spotify

Project for classifying audio files into different genres using the K-Nearest Neighbors (KNN) algorithm.

Language: Python - Size: 1.17 GB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

harshraj11584/Voice_Command_Recognition

[ML] [Audio Classification] Recognises Voice Commands from System Microphone, using MFCC, Random forests and MLPs

Language: Jupyter Notebook - Size: 49.5 MB - Last synced at: 11 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

ringabout/scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

Language: Nim - Size: 354 KB - Last synced at: 7 months ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 0

Papich23691/Speaker-Recognition

Speaker Recognition using MFCC feature vectors and GLA vector quantization models

Language: C - Size: 26.4 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 2

georgid/AlignmentDuration

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

Language: Python - Size: 342 MB - Last synced at: 12 months ago - Pushed at: about 5 years ago - Stars: 55 - Forks: 6