An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: silero

lukaszliniewicz/Pandrator

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

Language: Python - Size: 8.11 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 444 - Forks: 35

gkonovalov/android-vad

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: C - Size: 5.18 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 323 - Forks: 69

Navatusein/Silero-TTS-Service

Silero TTS backend service. Can be used with Home Assistant and Rhasspy.

Language: Python - Size: 387 KB - Last synced at: 10 days ago - Pushed at: 10 months ago - Stars: 43 - Forks: 10

baochuquan/ios-vad

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Language: Swift - Size: 4.5 MB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 10 - Forks: 0

DictationDaddy/VAD_WEB_DEMO

In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.

Language: JavaScript - Size: 2 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 20 - Forks: 1

Ave-Sergeev/Dictator

Speech-to-Text translation service (Rust, Tonic) (2025)

Language: Rust - Size: 49.3 MB - Last synced at: 18 days ago - Pushed at: 28 days ago - Stars: 6 - Forks: 0

POMXARK/SmartDictor_0.1_Nuitka_cleer

Распознание и озвучивание голосовым движком текста с экрана.

Language: Python - Size: 5.38 MB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

eja/wav2vad

A command line tool for voice activity detection.

Language: C - Size: 21.1 MB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

numq/noise-reduction

JVM library for noise reduction written in Kotlin based on the ML model Silero

Language: Kotlin - Size: 137 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

numq/voice-activity-detection

JVM library for voice activity detection written in Kotlin based on C library fvad and Silero

Language: Kotlin - Size: 2.94 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

muclana/audiobook-creator

Audiobook Creator is an open-source tool that converts books (EPUB, PDF, TXT) into fully voiced audiobooks with intelligent character voice attribution. It uses NLP, LLMs, and Kokoro TTS to generate engaging, multi-voice audiobooks. Features include text cleaning, character identification, and customizable narration. Licensed under GPL-3.0.

Size: 1000 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

atomlayer/llama_cute_voice_assistant

Llama cute voice assistant

Language: Python - Size: 573 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 1

GhostNaN/silero-webui

Silero TTS web UI

Language: Python - Size: 172 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 12 - Forks: 1

twirapp/silero-tts-api-server

This is a simple server that uses Silero models to convert text to audio files over HTTP

Language: Python - Size: 488 KB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 13 - Forks: 4

DictationDaddy/VAD

It's typescript based VAD that uses silero ai VAD under the hood. It's highly robust for Voice Activity Detection. It only works in the browser.

Language: TypeScript - Size: 1.85 MB - Last synced at: 29 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

BenAAndrew/speech-transcriber

A web-app/library for transcribing speech

Language: Python - Size: 796 KB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

daswer123/silero-tts-enhanced

Silero TTS Enhanced is a Python library that enhances the original Silero TTS project, providing a convenient way to synthesize speech from text using Silero TTS models. It offers a user-friendly interface for both standalone script usage and integration into Python projects, along with additional features

Language: Python - Size: 46.9 KB - Last synced at: 9 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

i6od/Voice-to-Voice-Ooba

voice to voice with ai text generator that can be hooked up to vtube studio like an ai assistant.

Language: Python - Size: 30.3 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 8 - Forks: 0

tochilkinva/tg_bot_stt_tts

Telegram bot with voice message recognition and generation. Speech to Text and Text to Speech

Language: Python - Size: 41 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 2

mishadobrits/SVA4

Automatically cuts out parts without speech from given video, making it shorter and more enjoyable to watch (look examples). Usage on google.collab in several clicks.

Language: Python - Size: 115 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0