GitHub topics: voice-processing
Picovoice/web-voice-processor
A library for real-time voice processing in web browsers
Language: TypeScript - Size: 2.59 MB - Last synced at: 15 days ago - Pushed at: 4 months ago - Stars: 220 - Forks: 22

AmirMahdyJebreily/Microphone-quality-evaloution
Live microphone quality detection system in browser Js
Language: JavaScript - Size: 122 KB - Last synced at: 27 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

Erfanafshar/speech-gender-detection
An audio signal processing project that detects speaker gender from recorded voice samples and enhances speech using spectral subtraction techniques in MATLAB.
Language: MATLAB - Size: 4.02 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

madhurimarawat/AI-CallConnect
A cutting-edge AI-powered phone agent designed for seamless voice interactions, dynamic data handling, and scalable communication. Perfect for modern sales and customer engagement solutions.
Language: Jupyter Notebook - Size: 21.6 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 2

kristofferv98/VoiceProcessingToolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
Language: Python - Size: 34.3 MB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

hhoangphuoc/R2D2TimbreTransfer
Timbre Transfer for R2D2-alike Robot voice turning into instrument using Diffusion Model
Language: Python - Size: 31.3 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

kristofferv98/SemanthaVoiceAssistant
A comprehensive AI companion leveraging advanced semantic analysis, sentiment detection, and voice processing to provide personalized and context-aware interactions using Autogen, semantic-router, and VoiceProcessingToolkit.
Language: Python - Size: 85 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

sezer-muhammed/Anadolu-Ajans--Medya-Teknolojileri-hackathon
AI-powered platform for creative content generation and management, featuring advanced AI integrations, seamless accessibility, and community collaboration.
Language: Python - Size: 1.16 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

Ceviess/tgvoice2text
A Telegram bot that processes voice messages using Sber's speech recognition API. This bot converts audio formats, generates authentication tokens, and transcribes voice messages into text, enabling seamless communication via Telegram.
Language: Python - Size: 11.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mohammad-safari/Speech-Spectral-Substraction-and-Noise-Remove
Final_Project_of_Siganls_&_Sytems_Spring_1401
Language: Jupyter Notebook - Size: 23.8 MB - Last synced at: 7 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Gordon-Yeh/Memory-Frame
🖼️ framed picture cloud base smart photo frame with voice activation paired with an android app
Language: Java - Size: 20.7 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 1

Namratha2301/dogcat
Web Application that Identifies Animal from their Sound. Right now restricted to binary classification between cat and dog sounds.
Language: PureBasic - Size: 10.3 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

simonnchong/Human-Voice-Segmentation
This is an algorithm to identify human voice and do segmentation automatically. The result will be compared to the manual segmentation data, then a accuracy report will be generated based on match rate, insertion rate and omission rate.
Language: MATLAB - Size: 16.7 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Kammann123/vocoder
Coursework 1 of the Voice Signal Processing course at ITBA. Real-time LPC Vocoder written in Python
Language: Jupyter Notebook - Size: 28 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

KenyonY/guang
Universal Function Library of Scientific Calculation
Language: Jupyter Notebook - Size: 88.4 MB - Last synced at: 3 days ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Chintan2108/Consumer-Complaint-Classification-OPEN-AI
This repository is made in lieu of submission towards the solution of problem statement 2 of the OPEN AI NLP hackathon. The objective here is to classify the voice recordings of a call center proceeding by treating them as consumer complaints into the said categories of the automotive industry.
Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0
