GitHub topics: audio-processing-with-python

Repositories

mwasifanwar/VoicePrint-ID

Advanced speaker identification and verification system using deep learning. Features emotion recognition, language detection, and anti-spoofing capabilities for secure voice authentication applications.

Language: Python - Size: 60.5 KB - Last synced at: about 8 hours ago - Pushed at: about 10 hours ago - Stars: 0 - Forks: 0

NotAbhinavGamerz/emotion-aware-automatic-speech-recognition

🎤 Enhance speech recognition by detecting emotions in spoken language, combining OpenAI's Whisper and emotion analysis for deeper insights.

Language: Python - Size: 1.46 MB - Last synced at: about 17 hours ago - Pushed at: about 19 hours ago - Stars: 0 - Forks: 1

DevArqf/VoiceGuard

🛡️ Advanced Voice Authentication System using OpenAI ChatGPT & Whisper APIs. Secure voice biometric identification with AI-powered analysis, multi-sample enrollment, and enterprise-grade authentication logging. Python-based with SQLite database.

Language: Python - Size: 66.4 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3 - Forks: 0

krushna4141/VoiceGuard

🔒 Enhance security with VoiceGuard, an AI-driven voice authentication system powered by OpenAI’s ChatGPT and Whisper for reliable voice identification.

Language: Python - Size: 38.1 KB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

antarades/emotion-aware-automatic-speech-recognition

An intelligent speech recognition system that combines OpenAI's Whisper for accurate transcription with dual emotion detection models. Analyzes both audio characteristics (tone, pitch, intensity) and textual content to provide comprehensive emotional context alongside transcriptions.

Language: Python - Size: 199 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Amin-moniry-pr7/Speech-to-Text-Transcription

This project automates audio processing by removing silence, transcribing speech to text, and storing the output in an SQLite database. It supports multiple audio formats and leverages Google Speech Recognition for high accuracy.

Language: Python - Size: 2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

HealthTrack-app/custom_KWS

Custom KWS pipeline for single-speaker keyword spotting with CNN and MFCCs, 1s 16kHz audio, rich augmentations, and TensorFlow exports (.h5, .tflite) 🐱💻

Language: Python - Size: 313 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

moego0/custom_KWS

End-to-end pipeline for training a custom keyword detection model with TensorFlow & TFLite expor

Language: Python - Size: 86.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

akspa0/The-Machine

Multimedia context generation tool using off-the-shelf components. Leverages several local ML/AI tools to accomplish transcription, context clues, and llm-driven tasks. Designed with extensibility in mind. Dataset preparation tool. Adds context to video and audio inputs.

Language: Python - Size: 2.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

ap-atul/Audio-Denoising

Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink thresholding technique

Language: Python - Size: 26.4 KB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 190 - Forks: 21

JanWilczek/dspyplot

Convenience functions for commonly used digital signal processing plots.

Language: Python - Size: 409 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

RubisetCie/spectrogram-converter Fork of muhdhuz/audio2spec

Scripts to convert audio files to spectrograms and back.

Language: Python - Size: 17.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

glau-bd/audio-fingerprinting-fyp

FYP project of Gerald Lau, submitted to the Nanyang Technological University in partial fulfillment of the requirements for the Degree of Bachelor of Engineering (Computer Science). An application to embed links into the audio track of videos, using audio watermarking and audio fingerprinting technology.

Language: Vue - Size: 3.6 MB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 3

Thato-Mot/bird-species-classification-app

A sophisticated web application that identifies bird species from audio recordings using deep learning. Features multiple neural network models (MobileNetV2/VGG16), real-time audio visualization, and window-based analysis system. Built with Flask, TensorFlow, and Librosa.

Language: Python - Size: 179 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Chandana-20/Conversation-Mixer-Tool

Conversation-Mixer-Tool: A Python utility to merge two audio files (caller and receiver) into a seamless, conversation-like output. Features include speech detection, bandpass filtering, noise reduction, and smooth audio transitions.

Language: Python - Size: 4.84 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

dhiogoboza/audio-convolution

🐍 :sound: | Audio convolution with python

Language: Python - Size: 14.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

PavlosIsaris/music-file-transformer

A simple python script that recursively searches for files and transforms them to mp3, using ffmpeg.

Language: Python - Size: 12.7 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

davidtkeane/Jervis-ChatGPT

This Python script is for a voice interface chatbot named Jervis. It uses OpenAI's GPT-3.5-turbo-instruct model to respond to user input. The chatbot responds by Elevenlabs Voices. Conversation are saved to MongoDB, and MP3 file local and can be emailed if needed.

Language: Python - Size: 928 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

migperfer/harmonic_compatibility

Repo of my Master Thesis in Pompeu Fabra University: Harmonic Compatibility for Loops in Electronic Music (demo website might take a little bit to load)

Language: Python - Size: 2.08 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 8 - Forks: 1

konradmaciejczyk/audio-signal-preprocessing-for-ml-classification-models

Language: Jupyter Notebook - Size: 4.67 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

makerportal/rpi_i2s

Raspberry Pi I2S Stereo Microphone Analyses in Python

Language: Python - Size: 985 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 15 - Forks: 5

rahuls98/audio-tagging

Audio tagging is the process of inferring descriptive labels from audio clips (Multi label classification task). This repository contains exploratory code/scripts for audio preprocessing and model fitting for the task of audio tagging and its applications.

Language: Jupyter Notebook - Size: 7 MB - Last synced at: 9 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

makerportal/quadmic

QuadMic Python Scripts for 4-microphone array audio analysis

Language: Python - Size: 10.7 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

DebdutBiswas/audio-processing-with-python

e-yantra robotics competetion audio processing with python

Language: Python - Size: 5.32 MB - Last synced at: over 2 years ago - Pushed at: almost 9 years ago - Stars: 1 - Forks: 1

Related Keywords

audio-processing-with-python 24 audio-processing 13 python 7 audio 6 deep-learning 5 speech-recognition 5 machine-learning 4 python3 4 nlp 3 ai 3 tensorflow 3 speech-to-text 3 whisper 3 music 2 openai 2 security 2 sqllite 2 voice-identification 2 acoustics 2 voice-recognition 2 audio-augmentation 2 cnn-model 2 edge-ai 2 esc50 2 keras 2 keyword-spotting 2 mfcc 2 tflite 2 voice-detection 2 raspberry-pi 2 microphone 2 ffmpeg 2 emotion-ai 2 emotion-detection 2 emotion-detection-emotion-classification 2 python-projects 2 sentiment-analysis 2 sentiment-analysis-model 2 speech-recognition-model 2 whisper-asr-model 2 authentication 2 biomet 2 chatgpt 2 cli 2 elevenlabs-api 1 gmail-smtp 1 mp3 1 openai-api 1 openai-chatgpt 1 pymongo-database 1 smtplib 1 creativity 1 loops 1 feature-engineering 1 feature-extraction 1 visualization 1 dotenv 1 chatbot 1 pyhon3 1 python-wave-lib 1 ffmpeg-python 1 wav-audio 1 audio-effects 1 audio-effect 1 audio-convolution-tests 1 audio-convolution 1 audio-noisereduce 1 audio-mixing 1 rpi4 1 rpi-i2s 1 audio-analysis 1 audio-tagging 1 deep-neural-networks 1 image-processing 1 librosa 1 multi-label-classification 1 python-visualization 1 microphone-array 1 mems-microphone 1 microphone-array-processing 1 mems 1 inmp441 1 i2s-microphone 1 i2s-audio 1 i2s 1 plot 1 signal-processing 1 e-yantra 1 gtzan-dataset 1 gtzan 1 python-audio-processing 1 audio-denoising 1 stt 1 speech-processing 1 pytorch 1 pyannote-audio 1 parakeet 1 lmstudio 1 dataset-generator 1 cuda 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos