GitHub topics: speech-detection

Repositories

baochuquan/ios-vad

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: Swift - Size: 4.5 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 21 - Forks: 2

tympanix/subsync

Synchronize your subtitles using machine learning

Language: Python - Size: 468 KB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 155 - Forks: 16

smacke/ffsubsync

Automagically synchronize subtitles with video.

Language: Python - Size: 3.7 MB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 7,250 - Forks: 296

gtreshchev/RuntimeSpeechRecognizer 📦

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

Language: C++ - Size: 24.8 MB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 298 - Forks: 46

gkonovalov/android-vad

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: C - Size: 5.16 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 374 - Forks: 79

ina-foss/inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Language: Python - Size: 36.6 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 815 - Forks: 139

sepnic/litevad

Speech-end detection library, based on WebRTC's VAD engine

Language: C - Size: 453 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 22 - Forks: 5

AimiliosKourpas/sound-signal-processing

A Python-based system for automatic word segmentation in speech using ML models like SVM, MLP, and RNN.

Language: Python - Size: 3.34 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

filippogiruzzi/voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

Language: Python - Size: 238 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 363 - Forks: 69

Danand/inaSpeechSegmenter-webui

`inaSpeechSegmenter` web UI.

Language: Python - Size: 5.86 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

DPigeon/SeeText

A mobile application that shows you what you say and objects around.

Language: Java - Size: 37.8 MB - Last synced at: 11 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

edusense/edusense

EduSense: Practical Classroom Sensing at Scale

Language: Python - Size: 242 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 11

bbc/bbc-speech-segmenter

A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.

Language: Shell - Size: 62.6 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 22 - Forks: 2

isbendiyarovanezrin/SpeechDetection

Speech Detection 💬

Language: CSS - Size: 26.4 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Mixa26/Spoken-digits-recognizer-with-dynamic-time-warping

Language: Jupyter Notebook - Size: 6.39 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

PranavPutsa1006/Speaker-Diarization

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python

Language: Jupyter Notebook - Size: 20.2 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 1

andreahergert/speech_detection

30 Days of Javascript Day 20

Language: JavaScript - Size: 88.9 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Related Keywords

speech-detection 17 voice-activity-detection 8 speech-recognition 5 machine-learning 4 vad 4 speech-to-text 4 neural-networks 3 webrtc 3 audio-processing 3 gender-classification 2 audio-analysis 2 deep-neural-networks 2 android 2 computer-vision 2 deep-learning 2 speech 2 transgender 2 speech-segmentation 2 audio 2 subtitles 2 subtitle 2 mfcc 2 dnn 2 gmm 2 silero 2 on-device-ai 2 yamnet 2 voice-detection 2 voice-activity-detector 2 offline 2 silero-vad 2 real-time 2 mfcc-features 1 python 1 spectral-clustering 1 resnet 1 tensorflow 1 time-series 1 time-series-classification 1 voice-training 1 webui 1 embeddings-extraction 1 librispeech-dataset 1 librispeech 1 deeplearning 1 speech-transcription 1 artificial-intelligence 1 svm 1 rnn 1 python3 1 mlp 1 fundamental-frequency 1 audio-classification 1 javascript 1 speaker-diarization 1 spoken-digits-recognition 1 spoken-digits 1 sound 1 digit-recognition 1 web-speech-api 1 vanilla-javascript 1 javascript30 1 x-vectors 1 endpoint-detection 1 automatic-speech-recognition 1 tracking 1 teachers 1 sensing 1 posture 1 pedagogy 1 instructors 1 hand-raise 1 gaze 1 classroom 1 object-detection 1 mobile 1 languages 1 definitions 1 ctext 1 ai 1 ue4 1 speech-processing 1 openai 1 vlc-media-player 1 vlc 1 video 1 synchronization 1 sync 1 string-alignment 1 srt-subtitles 1 srt 1 fft 1 ffmpeg 1 fast-fourier-transform 1 captions 1 caption 1 alignment 1 subsync 1 shift-subtitle 1 shift 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos