GitHub topics: speech-detection
gtreshchev/RuntimeSpeechRecognizer 📦
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
Language: C++ - Size: 24.8 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 297 - Forks: 47

ina-foss/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Language: Python - Size: 36.6 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 815 - Forks: 139

smacke/ffsubsync
Automagically synchronize subtitles with video.
Language: Python - Size: 3.7 MB - Last synced at: 22 days ago - Pushed at: 4 months ago - Stars: 7,188 - Forks: 295

gkonovalov/android-vad
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Language: C - Size: 5.18 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 342 - Forks: 76

baochuquan/ios-vad
iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Language: Swift - Size: 4.5 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 15 - Forks: 0

sepnic/litevad
Speech-end detection library, based on WebRTC's VAD engine
Language: C - Size: 453 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 22 - Forks: 5

AimiliosKourpas/sound-signal-processing
A Python-based system for automatic word segmentation in speech using ML models like SVM, MLP, and RNN.
Language: Python - Size: 3.34 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

filippogiruzzi/voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
Language: Python - Size: 238 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 363 - Forks: 69

tympanix/subsync
Synchronize your subtitles using machine learning
Language: Python - Size: 468 KB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 153 - Forks: 16

Danand/inaSpeechSegmenter-webui
`inaSpeechSegmenter` web UI.
Language: Python - Size: 5.86 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

DPigeon/SeeText
A mobile application that shows you what you say and objects around.
Language: Java - Size: 37.8 MB - Last synced at: 10 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

edusense/edusense
EduSense: Practical Classroom Sensing at Scale
Language: Python - Size: 242 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 11

bbc/bbc-speech-segmenter
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
Language: Shell - Size: 62.6 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 22 - Forks: 2

isbendiyarovanezrin/SpeechDetection
Speech Detection 💬
Language: CSS - Size: 26.4 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Mixa26/Spoken-digits-recognizer-with-dynamic-time-warping
Language: Jupyter Notebook - Size: 6.39 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

PranavPutsa1006/Speaker-Diarization
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Language: Jupyter Notebook - Size: 20.2 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 1

andreahergert/speech_detection
30 Days of Javascript Day 20
Language: JavaScript - Size: 88.9 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
