silero-vad | Topic | Ecosyste.ms: Repos

Topic: "silero-vad"

TEN-framework/ten-vad

Voice Activity Detector(VAD) from TEN: low-latency, high-performance and lightweight

Language: C - Size: 9.55 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 972 - Forks: 88

gkonovalov/android-vad

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: C - Size: 5.16 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 374 - Forks: 79

helloooideeeeea/RealTimeCutVADLibrary

A real-time Voice Activity Detection (VAD) library for iOS and macOS using Silero models powered by ONNX Runtime. Includes advanced noise suppression and audio preprocessing with WebRTC APM, supporting seamless WAV data output with header metadata.

Language: Swift - Size: 3.97 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 29 - Forks: 11

DictationDaddy/VAD_WEB_DEMO

In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.

Language: JavaScript - Size: 2 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 20 - Forks: 1

baochuquan/ios-vad

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: Swift - Size: 4.5 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 15 - Forks: 0

mbotsu/mlx_speech2text

Audio transcription using mlx whisper and vad silence processing

Language: Python - Size: 18.6 KB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 14 - Forks: 1

helloooideeeeea/RealTimeCutVADCXXLibrary

C++ implementation of real-time Voice Activity Detection (VAD) using Silero models with ONNX Runtime and WebRTC Audio Processing. Provides precise voice segmentation and cross-platform XCFramework support.

Language: C++ - Size: 4.05 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 10 - Forks: 6

chenqianhe/VAD-addon

This repo provides an addon that can perform VAD model reasoning in nodes and electric environments, based on cmake-js and Fastdeploy. Silero VAD is a pre-trained enterprise-grade Voice Activity Detector.

Language: C++ - Size: 174 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 1

guynich/vad_eval_comparison

Test comparison of two VAD models with English and multilingual speech datasets

Language: Python - Size: 2.11 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 8 - Forks: 0

mgoltzsche/ai-assistant-vui

Experimental voice user interface (VUI) to interact with an AI assistant

Language: Go - Size: 499 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 8 - Forks: 0

Saga9103/t2yLLM

A voice assistant with local LLM as a backend

Language: Python - Size: 213 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 6 - Forks: 0

ckaznable/yt-cli-live

Youtube Text Live Streaming in CLI

Language: Rust - Size: 905 KB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

qkdxorjs1002/silero-vad-restapi-demo

A demo project to test silero-vad using REST API

Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 3

AbdullahHendy/live-translation

Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client to server for live transcription and optional translation. Supports CLI and Python API.

Language: Python - Size: 23.7 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 3

JRWSP/SileroVAD_for_Whisper-cpp

Python script for detect silences with Silero-VAD and transcribing with the whisper AI model.

Language: Python - Size: 33.2 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Swap98-Coder/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Size: 1.95 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 4 - Forks: 0

IntendedConsequence/vadc

Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech

Language: C++ - Size: 8.45 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

botisan-ai/whisper-aws-stack

Deplay Whisper on AWS Scalably

Language: Python - Size: 190 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

helloooideeeeea/RealTimeCutVADLibraryForAndroid

Real-time Voice Activity Detection (VAD) library for Android using Silero models powered by ONNX Runtime. Includes advanced noise suppression and audio preprocessing with WebRTC APM, supporting seamless WAV data output with header metadata.

Language: Kotlin - Size: 4.09 MB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

Kazuhito00/Silero-VAD-ONNX-Sample

Silero VADのONNX推論(PyTorch依存処理無し)サンプル

Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

DictationDaddy/VAD

It's typescript based VAD that uses silero ai VAD under the hood. It's highly robust for Voice Activity Detection. It only works in the browser.

Language: TypeScript - Size: 1.85 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

konstantin-eu/langrepeater

A Python-based stack for German language learning, leveraging STT, TTS, and ML to generate custom media where German phrases are detected, translated, and repeated for immersive listening during daily routines. Supports markdown text from LLMs and WAV audio inputs, producing subtitled audio/video files compatible with a custom Android player.

Language: Python - Size: 6.83 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

danieladdisonorg/Vocal-Agent

A ophisticated real-time voice assistant that seamlessly integrates speech recognition, AI reasoning, and neural text-to-speech synthesis. It is designed for natural conversational interactions with advanced tool-calling capabilities.

Language: Python - Size: 428 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

pranjal-pravesh/stt-silero-whisper

Real-time speech to text using voice activity detection (with silero-VAD) and transcriptions using faster-whisper model

Language: Python - Size: 35.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

guynich/vad_eval_curves

AUC curve metrics example

Language: Python - Size: 276 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ZygoteCode/VadSharp

Enterprise VAD (Voice Activity Detection) in C#.NET (.NET 6.0+) with Microsoft.ML.Net, ONNXRuntime and DirectML. The easiest, efficient, and performant Silero VAD implementation! Always open for PRs.

Language: C# - Size: 354 KB - Last synced at: 20 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

numq/voice-activity-detection

JVM library for voice activity detection written in Kotlin based on C library fvad and Silero

Language: Kotlin - Size: 2.94 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lukaszliniewicz/whisperX_silero Fork of m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) with support for Silero VAD.

Language: Python - Size: 23 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos