An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: silero-vad

TEN-framework/ten-vad

TEN VAD: low-latency high-performance Voice Activity Detector

Language: C - Size: 9.58 MB - Last synced at: about 2 hours ago - Pushed at: about 6 hours ago - Stars: 435 - Forks: 38

helloooideeeeea/RealTimeCutVADLibraryForAndroid

Real-time Voice Activity Detection (VAD) library for Android using Silero models powered by ONNX Runtime. Includes advanced noise suppression and audio preprocessing with WebRTC APM, supporting seamless WAV data output with header metadata.

Language: Kotlin - Size: 4.09 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

pranjal-pravesh/stt-silero-whisper

Real-time speech to text using voice activity detection (with silero-VAD) and transcriptions using faster-whisper model

Language: Python - Size: 35.2 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

AbdullahHendy/live-translation

Real-time speech-to-text translation over WebSocket. Streams raw PCM audio from client to server for live transcription and optional translation. Supports CLI and Python API.

Language: Python - Size: 21.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 1

Swap98-Coder/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

helloooideeeeea/RealTimeCutVADCXXLibrary

C++ implementation of real-time Voice Activity Detection (VAD) using Silero models with ONNX Runtime and WebRTC Audio Processing. Provides precise voice segmentation and cross-platform XCFramework support.

Language: C++ - Size: 4.05 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 6 - Forks: 3

Saga9103/t2yLLM

A voice assistant with local LLM as a backend

Language: Python - Size: 422 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0

helloooideeeeea/RealTimeCutVADLibrary

A real-time Voice Activity Detection (VAD) library for iOS and macOS using Silero models powered by ONNX Runtime. Includes advanced noise suppression and audio preprocessing with WebRTC APM, supporting seamless WAV data output with header metadata.

Language: Swift - Size: 3.97 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 7

gkonovalov/android-vad

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: C - Size: 5.18 MB - Last synced at: 13 days ago - Pushed at: 4 months ago - Stars: 342 - Forks: 76

guynich/vad_eval_comparison

Test comparison of two VAD models with English speech dataset

Language: Python - Size: 316 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 2 - Forks: 0

guynich/vad_eval_curves

AUC curve metrics example

Language: Python - Size: 276 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

Kazuhito00/Silero-VAD-ONNX-Sample

Silero VADのONNX推論(PyTorch依存処理無し)サンプル

Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: 5 days ago - Pushed at: 27 days ago - Stars: 1 - Forks: 0

baochuquan/ios-vad

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: Swift - Size: 4.5 MB - Last synced at: 24 days ago - Pushed at: 7 months ago - Stars: 15 - Forks: 0

ZygoteCode/VadSharp

Enterprise VAD (Voice Activity Detection) in C#.NET (.NET 6.0+) with Microsoft.ML.Net, ONNXRuntime and DirectML. The easiest, efficient, and performant Silero VAD implementation! Always open for PRs.

Language: C# - Size: 354 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 12 - Forks: 1

mgoltzsche/ai-assistant-vui

Experimental voice user interface (VUI) to interact with an AI assistant

Language: Go - Size: 7 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

JRWSP/SileroVAD_for_Whisper-cpp

Python script for detect silences with Silero-VAD and transcribing with the whisper AI model.

Language: Python - Size: 33.2 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 0

mbotsu/mlx_speech2text

Audio transcription using mlx whisper and vad silence processing

Language: Python - Size: 18.6 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 14 - Forks: 1

DictationDaddy/VAD_WEB_DEMO

In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.

Language: JavaScript - Size: 2 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 20 - Forks: 1

numq/voice-activity-detection

JVM library for voice activity detection written in Kotlin based on C library fvad and Silero

Language: Kotlin - Size: 2.94 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lukaszliniewicz/whisperX_silero Fork of m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) with support for Silero VAD.

Language: Python - Size: 23 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

chenqianhe/VAD-addon

This repo provides an addon that can perform VAD model reasoning in nodes and electric environments, based on cmake-js and Fastdeploy. Silero VAD is a pre-trained enterprise-grade Voice Activity Detector.

Language: C++ - Size: 174 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 1

DictationDaddy/VAD

It's typescript based VAD that uses silero ai VAD under the hood. It's highly robust for Voice Activity Detection. It only works in the browser.

Language: TypeScript - Size: 1.85 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

IntendedConsequence/vadc

Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech

Language: C++ - Size: 8.45 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

ckaznable/yt-cli-live

Youtube Text Live Streaming in CLI

Language: Rust - Size: 905 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

botisan-ai/whisper-aws-stack

Deplay Whisper on AWS Scalably

Language: Python - Size: 190 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

qkdxorjs1002/silero-vad-restapi-demo

A demo project to test silero-vad using REST API

Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 3