Topic: "speech-activity-detection"
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language: Jupyter Notebook - Size: 252 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7,671 - Forks: 889

jtkim-kaist/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Language: MATLAB - Size: 261 MB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 854 - Forks: 234

ina-foss/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Language: Python - Size: 36.6 MB - Last synced at: 29 days ago - Pushed at: 5 months ago - Stars: 802 - Forks: 138

RicherMans/GPV
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
Language: Python - Size: 8.85 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 142 - Forks: 29

RicherMans/Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
Language: Python - Size: 20.7 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 84 - Forks: 21

HHousen/speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
Language: Jupyter Notebook - Size: 7.83 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 51 - Forks: 6

bigcash/awesome-vad
A curated list of awesome voice activity detection
Size: 9.77 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 50 - Forks: 2

jsvir/vad
[Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection
Language: Python - Size: 1.71 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 26 - Forks: 3

vimalmanohar/kaldi Fork of kaldi-asr/kaldi
Fork of the official kaldi.
Language: Shell - Size: 121 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 23 - Forks: 3

idiap/zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Language: Python - Size: 631 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 1

ina-foss/InaGVAD
Voice activity detection and speaker gender segmentation audiovisual corpus
Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 13 - Forks: 1

AmirHoseein99/Depression-Engine
Detecting depressed Patient based on Speech Activity, Pauses in Speech and Using Deep learning Approach
Language: Python - Size: 1.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 2

aditya-joglekar/FS02_Scoring_Toolkit
Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks
Language: Python - Size: 6.17 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 3

rafaelgreca/voxseg-pytorch
The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.
Language: Python - Size: 35.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

KF-R/turk-chat
Lightweight speech-to-speech web-based chat app combining speech recognition, LLM completion and text-to-speech. Implemented with Python (Flask) and vanilla JavaScript only.
Language: Python - Size: 296 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

sajR/V-SAD
Language: Python - Size: 36.1 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

dangvansam/pyannote-onnx Fork of pyannote/pyannote-audio
PyAnnote with ONNX model
Language: Jupyter Notebook - Size: 273 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
