speech-activity-detection | Topic

Topic: "speech-activity-detection"

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language: Jupyter Notebook - Size: 252 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 7,671 - Forks: 889

jtkim-kaist/VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Language: MATLAB - Size: 261 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 854 - Forks: 234

ina-foss/inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Language: Python - Size: 36.6 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 815 - Forks: 139

RicherMans/GPV

Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper

Language: Python - Size: 8.85 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 142 - Forks: 29

RicherMans/Datadriven-GPVAD

The codebase for Data-driven general-purpose voice activity detection.

Language: Python - Size: 20.7 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 84 - Forks: 21

bigcash/awesome-vad

A curated list of awesome voice activity detection

Size: 9.77 KB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 59 - Forks: 3

HHousen/speaker-change-detection

Speaker change detection using SincNet and an LSTM/Transformer

Language: Jupyter Notebook - Size: 7.83 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 51 - Forks: 6

jsvir/vad

[Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection

Language: Python - Size: 1.71 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 26 - Forks: 3

vimalmanohar/kaldi Fork of kaldi-asr/kaldi

Fork of the official kaldi.

Language: Shell - Size: 121 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 23 - Forks: 3

dangvansam/pyannote-onnx Fork of pyannote/pyannote-audio

PyAnnote with ONNX model

Language: Jupyter Notebook - Size: 273 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Related Topics

voice-activity-detection 10 machine-learning 5 speech 5 pytorch 5 vad 4 speech-processing 3 speech-recognition 3 speaker-change-detection 2 speech-segmentation 2 audio-segmentation 2 speaker-diarization 2 python 2 audio-processing 2 noise-robust 2 gender 2 sad 2 speaker-gender 2 lstm 2 gender-prediction 1 gender-representation 1 radio 1 speech-corpus 1 speech-dataset 1 tv 1 noise-robust-asr 1 speaker-embedding 1 gender-bias 1 dataset 1 corpus 1 benchmark 1 audiovisual-dataset 1 audio-dataset 1 acoustic-diversity 1 list 1 awesome 1 whisper 1 web-capabilities 1 bilstm 1 pretrained-models 1 overlapped-speech-detection 1 transgender 1 speech-music 1 speech-detection 1 segmentation 1 praat 1 noise 1 music-detection 1 music 1 speaker-recognition 1 speaker-verification 1 mirex 1 male 1 gender-equality 1 gender-classification 1 female 1 speaker-segmentation 1 audio-analysis 1 signal-processing 1 transformers 1 sound-activity 1 voice-ac 1 speech-separation 1 pyannote 1 onnx 1 audio-splitter 1 audio-split 1 tinyml 1 sparse-representation 1 edge-computing 1 hybrid-model 1 depression-detection 1 deep-learning 1 daic-woz 1 speaker-identification 1 scoring-code 1 transfer-learning 1 semi-supervised-learning 1 multilingual-speech-recognition 1 lightly-supervised-training 1 domain-adaptation 1 visual-speech-recognition 1 neural-network 1 text-to-speech 1 ring-buffer 1 pure-javascript 1 prompt-engineering 1 openai-api-chatbot 1 openai-api 1 llm 1 larsonscanner 1 knight-rider-effect 1 flask 1 faster-whisper 1 elevenlabs 1 conversational-ai 1 conversation 1 voice-detection 1 dnn 1 data 1 bdnn 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

Topic: "speech-activity-detection"

pyannote/pyannote-audio

jtkim-kaist/VAD

ina-foss/inaSpeechSegmenter

RicherMans/GPV

RicherMans/Datadriven-GPVAD

bigcash/awesome-vad

HHousen/speaker-change-detection

jsvir/vad

vimalmanohar/kaldi Fork of kaldi-asr/kaldi

idiap/zff_vad

ina-foss/InaGVAD

AmirHoseein99/Depression-Engine

aditya-joglekar/FS02_Scoring_Toolkit

rafaelgreca/voxseg-pytorch

KF-R/turk-chat

sajR/V-SAD

dangvansam/pyannote-onnx Fork of pyannote/pyannote-audio