GitHub topics: silero

Repositories

lukaszliniewicz/Pandrator

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

Language: Python - Size: 8.11 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 444 - Forks: 35

gkonovalov/android-vad

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: C - Size: 5.18 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 323 - Forks: 69

Navatusein/Silero-TTS-Service

Silero TTS backend service. Can be used with Home Assistant and Rhasspy.

Language: Python - Size: 387 KB - Last synced at: 10 days ago - Pushed at: 10 months ago - Stars: 43 - Forks: 10

baochuquan/ios-vad

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: Swift - Size: 4.5 MB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 10 - Forks: 0

DictationDaddy/VAD_WEB_DEMO

In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.

Language: JavaScript - Size: 2 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 20 - Forks: 1

Ave-Sergeev/Dictator

Speech-to-Text translation service (Rust, Tonic) (2025)

Language: Rust - Size: 49.3 MB - Last synced at: 18 days ago - Pushed at: 28 days ago - Stars: 6 - Forks: 0

POMXARK/SmartDictor_0.1_Nuitka_cleer

Распознание и озвучивание голосовым движком текста с экрана.

Language: Python - Size: 5.38 MB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

eja/wav2vad

A command line tool for voice activity detection.

Language: C - Size: 21.1 MB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

numq/noise-reduction

JVM library for noise reduction written in Kotlin based on the ML model Silero

Language: Kotlin - Size: 137 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

numq/voice-activity-detection

JVM library for voice activity detection written in Kotlin based on C library fvad and Silero

Language: Kotlin - Size: 2.94 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

muclana/audiobook-creator

Audiobook Creator is an open-source tool that converts books (EPUB, PDF, TXT) into fully voiced audiobooks with intelligent character voice attribution. It uses NLP, LLMs, and Kokoro TTS to generate engaging, multi-voice audiobooks. Features include text cleaning, character identification, and customizable narration. Licensed under GPL-3.0.

Size: 1000 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

atomlayer/llama_cute_voice_assistant

Llama cute voice assistant

Language: Python - Size: 573 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 1

GhostNaN/silero-webui

Silero TTS web UI

Language: Python - Size: 172 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 12 - Forks: 1

twirapp/silero-tts-api-server

This is a simple server that uses Silero models to convert text to audio files over HTTP

Language: Python - Size: 488 KB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 13 - Forks: 4

DictationDaddy/VAD

It's typescript based VAD that uses silero ai VAD under the hood. It's highly robust for Voice Activity Detection. It only works in the browser.

Language: TypeScript - Size: 1.85 MB - Last synced at: 29 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

BenAAndrew/speech-transcriber

A web-app/library for transcribing speech

Language: Python - Size: 796 KB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

daswer123/silero-tts-enhanced

Silero TTS Enhanced is a Python library that enhances the original Silero TTS project, providing a convenient way to synthesize speech from text using Silero TTS models. It offers a user-friendly interface for both standalone script usage and integration into Python projects, along with additional features

Language: Python - Size: 46.9 KB - Last synced at: 9 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

Related Keywords

silero 20 tts 5 voice-activity-detection 5 silero-vad 5 vad 4 offline 3 llm 3 audio-processing 2 dnn 2 gmm 2 neural-networks 2 on-device-ai 2 real-time 2 speech-detection 2 tts-api 2 voice-activity-detector 2 voice-detection 2 webrtc 2 yamnet 2 sileros-tts 2 python 2 ai 2 ml 2 kotlin 2 whisper 2 ffmpeg 2 oobabooga 2 rvc 2 browser 2 torch 2 text-processing 2 text-to-speech 2 vosk 2 voice-cloning 2 xtts 2 assistant 2 styletts2 1 llama2 1 llama 1 telegram 1 gradio 1 pyrogram-bot 1 pdf-to-audiobook 1 multilingual 1 m4b 1 epub 1 onnx 1 libfvad 1 jvm 1 jni 1 java 1 fvad 1 cpp 1 videoacceleration 1 video 1 mkvmerge 1 speech-to-text 1 num2words 1 aiogram 1 vtuber 1 aivtube 1 transcription 1 librispeech 1 cmu-sphinx 1 voiceactivitydetection 1 typescript 1 onnx-webruntime 1 npm 1 server 1 restapi 1 python39 1 python312 1 litestar-framework 1 litestar-api 1 litestar 1 http-server 1 http 1 api-rest 1 api 1 pytorch 1 speech-recognition 1 onnex-models 1 ios 1 deep-neural-network 1 rhasspy 1 home-assistant 1 docker-compose 1 docker 1 speech-recoginition 1 onnx-models 1 deep-neural-networks 1 android 1 xttsv2 1 voicecraft 1 voice-clone 1 tkinter-gui 1 subtitle-to-voice 1 subtitle-to-speech 1 pdf-to-audio 1 dubbing 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos