An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: speech-detection

gtreshchev/RuntimeSpeechRecognizer 📦

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

Language: C++ - Size: 24.8 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 297 - Forks: 47

ina-foss/inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Language: Python - Size: 36.6 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 815 - Forks: 139

smacke/ffsubsync

Automagically synchronize subtitles with video.

Language: Python - Size: 3.7 MB - Last synced at: 22 days ago - Pushed at: 4 months ago - Stars: 7,188 - Forks: 295

gkonovalov/android-vad

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: C - Size: 5.18 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 342 - Forks: 76

baochuquan/ios-vad

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: Swift - Size: 4.5 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 15 - Forks: 0

sepnic/litevad

Speech-end detection library, based on WebRTC's VAD engine

Language: C - Size: 453 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 22 - Forks: 5

AimiliosKourpas/sound-signal-processing

A Python-based system for automatic word segmentation in speech using ML models like SVM, MLP, and RNN.

Language: Python - Size: 3.34 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

filippogiruzzi/voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

Language: Python - Size: 238 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 363 - Forks: 69

tympanix/subsync

Synchronize your subtitles using machine learning

Language: Python - Size: 468 KB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 153 - Forks: 16

Danand/inaSpeechSegmenter-webui

`inaSpeechSegmenter` web UI.

Language: Python - Size: 5.86 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

DPigeon/SeeText

A mobile application that shows you what you say and objects around.

Language: Java - Size: 37.8 MB - Last synced at: 10 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

edusense/edusense

EduSense: Practical Classroom Sensing at Scale

Language: Python - Size: 242 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 11

bbc/bbc-speech-segmenter

A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.

Language: Shell - Size: 62.6 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 22 - Forks: 2

isbendiyarovanezrin/SpeechDetection

Speech Detection 💬

Language: CSS - Size: 26.4 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Mixa26/Spoken-digits-recognizer-with-dynamic-time-warping

Language: Jupyter Notebook - Size: 6.39 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

PranavPutsa1006/Speaker-Diarization

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python

Language: Jupyter Notebook - Size: 20.2 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 1

andreahergert/speech_detection

30 Days of Javascript Day 20

Language: JavaScript - Size: 88.9 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0