An open API service providing repository metadata for many open source software ecosystems.

Topic: "speech-to-text"

ggml-org/whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language: C++ - Size: 20.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 39,736 - Forks: 4,184

mozilla/DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language: C++ - Size: 48.2 MB - Last synced at: 12 minutes ago - Pushed at: 8 months ago - Stars: 26,319 - Forks: 4,041

leon-ai/leon

🧠 Leon is your open-source personal assistant.

Language: TypeScript - Size: 21.3 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 16,220 - Forks: 1,347

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

Language: Python - Size: 36.6 MB - Last synced at: 7 days ago - Pushed at: 12 days ago - Stars: 15,802 - Forks: 1,316

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language: Python - Size: 38.6 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 15,321 - Forks: 1,657

kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language: Shell - Size: 120 MB - Last synced at: 10 minutes ago - Pushed at: 14 days ago - Stars: 14,834 - Forks: 5,355

jianchang512/pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。

Language: Python - Size: 452 MB - Last synced at: 3 days ago - Pushed at: 16 days ago - Stars: 12,702 - Forks: 1,407

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Language: Python - Size: 97.8 MB - Last synced at: 6 days ago - Pushed at: 16 days ago - Stars: 9,783 - Forks: 1,485

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: 7 days ago - Pushed at: 11 days ago - Stars: 9,406 - Forks: 1,266

Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Language: Python - Size: 106 MB - Last synced at: 6 days ago - Pushed at: 26 days ago - Stars: 8,708 - Forks: 2,416

nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Language: Python - Size: 7.77 MB - Last synced at: 14 days ago - Pushed at: 8 months ago - Stars: 8,106 - Forks: 1,907

KoljaB/RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Language: Python - Size: 920 KB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 7,025 - Forks: 566

TalAter/annyang

💬 Speech recognition for your site

Language: JavaScript - Size: 1.64 MB - Last synced at: 18 days ago - Pushed at: 9 months ago - Stars: 6,660 - Forks: 1,046

k2-fsa/sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 11 programming languages

Language: C++ - Size: 9.1 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 5,836 - Forks: 659

FunAudioLLM/SenseVoice

Multilingual Voice Understanding Model

Language: Python - Size: 6.51 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 5,556 - Forks: 495

snakers4/silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Language: Jupyter Notebook - Size: 488 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 5,253 - Forks: 336

sanchit-gandhi/whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language: Jupyter Notebook - Size: 8.75 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 4,590 - Forks: 402

modelscope/FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Language: Python - Size: 43.2 MB - Last synced at: 14 days ago - Pushed at: 2 months ago - Stars: 4,480 - Forks: 516

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language: Jupyter Notebook - Size: 435 KB - Last synced at: 14 days ago - Pushed at: 20 days ago - Stars: 4,432 - Forks: 411

huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Language: Python - Size: 299 KB - Last synced at: 4 days ago - Pushed at: 27 days ago - Stars: 4,011 - Forks: 441

gradio-app/fastrtc

The python library for real-time communication

Language: JavaScript - Size: 4.54 MB - Last synced at: about 15 hours ago - Pushed at: about 15 hours ago - Stars: 3,852 - Forks: 332

abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Language: Python - Size: 78 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,648 - Forks: 271

jianchang512/stt

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

Language: Python - Size: 129 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 3,255 - Forks: 350

ictnlp/LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Language: Python - Size: 3.27 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 2,891 - Forks: 195

tensorflow/lingvo

Lingvo

Language: Python - Size: 142 MB - Last synced at: 4 days ago - Pushed at: 12 days ago - Stars: 2,838 - Forks: 451

HeyWillow/willow

Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

Language: C - Size: 1.83 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 2,773 - Forks: 104

ahmetoner/whisper-asr-webservice

OpenAI Whisper ASR Webservice API

Language: Python - Size: 1.76 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2,517 - Forks: 449

coqui-ai/STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Language: C++ - Size: 53.4 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 2,425 - Forks: 286

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Language: Python - Size: 4.49 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 2,368 - Forks: 181

pannous/tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Language: Python - Size: 31.1 MB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 2,171 - Forks: 635

pluja/whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Language: Svelte - Size: 14.1 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2,161 - Forks: 121

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

Size: 207 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 1,925 - Forks: 90

jarikomppa/soloud

Free, easy, portable audio engine for games

Language: C - Size: 12 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 1,896 - Forks: 294

mesolitica/NLP-Models-Tensorflow 📦

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

Language: Jupyter Notebook - Size: 46.2 MB - Last synced at: 11 days ago - Pushed at: almost 5 years ago - Stars: 1,791 - Forks: 726

bugbakery/audapolis

an editor for spoken-word audio with automatic transcription

Language: TypeScript - Size: 4.05 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,732 - Forks: 43

kalliope-project/kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Language: Python - Size: 20.9 MB - Last synced at: 29 days ago - Pushed at: almost 2 years ago - Stars: 1,729 - Forks: 229

sindresorhus/awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

Size: 1.14 MB - Last synced at: about 6 hours ago - Pushed at: 23 days ago - Stars: 1,662 - Forks: 82

NVIDIA/OpenSeq2Seq 📦

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Language: Python - Size: 57.4 MB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 1,558 - Forks: 369

DragonComputer/Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

Language: Python - Size: 24.1 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 1,389 - Forks: 215

coqui-ai/open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Size: 139 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 1,318 - Forks: 142

sdkcarlos/artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Language: JavaScript - Size: 1.08 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 1,255 - Forks: 367

AlekPet/ComfyUI_Custom_Nodes_AlekPet

Custom nodes that extend the capabilities of Comfyui

Language: JavaScript - Size: 11.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,183 - Forks: 74

Kyubyong/dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Language: Python - Size: 3.06 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1,159 - Forks: 369

Robitx/gp.nvim

Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]

Language: Lua - Size: 593 KB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 1,158 - Forks: 98

modal-labs/quillman

A voice chat app

Language: Python - Size: 4.3 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,120 - Forks: 133

R3gm/SoniTranslate

Synchronized Translation for Videos. Video dubbing

Language: Python - Size: 19.4 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1,096 - Forks: 226

Blaizzy/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Language: Python - Size: 4.07 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,063 - Forks: 80

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Language: Python - Size: 18.2 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 1,053 - Forks: 80

tmoroney/auto-subs

Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.

Language: TypeScript - Size: 80.7 MB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 1,040 - Forks: 53

Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Language: Python - Size: 1.14 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,009 - Forks: 91

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Language: Python - Size: 89.8 MB - Last synced at: 13 minutes ago - Pushed at: about 1 hour ago - Stars: 969 - Forks: 245

codeforequity-at/botium-speech-processing

Botium Speech Processing

Language: JavaScript - Size: 583 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 945 - Forks: 59

ardha27/AI-Waifu-Vtuber

AI Vtuber for Streaming on Youtube/Twitch

Language: Python - Size: 6.2 MB - Last synced at: 30 days ago - Pushed at: 10 months ago - Stars: 926 - Forks: 148

backmeupplz/voicy

@voicybot Telegram bot main repository

Language: TypeScript - Size: 6.61 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 903 - Forks: 163

mikeyy/nonoCAPTCHA

An asynchronized Python library to automate solving ReCAPTCHA v2 using audio

Language: Python - Size: 117 MB - Last synced at: 25 days ago - Pushed at: almost 2 years ago - Stars: 896 - Forks: 192

mkiol/dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

Language: C++ - Size: 75.2 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 875 - Forks: 37

yeyupiaoling/PPASR

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Language: Python - Size: 17.7 MB - Last synced at: 5 days ago - Pushed at: 11 days ago - Stars: 859 - Forks: 129

Saik0s/Whisperboard

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

Language: Swift - Size: 179 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 842 - Forks: 83

alesaccoia/VoiceStreamAI

Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS

Language: Python - Size: 14 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 840 - Forks: 123

srvk/eesen

The official repository of the Eesen project

Language: C++ - Size: 5.86 MB - Last synced at: 10 days ago - Pushed at: almost 6 years ago - Stars: 829 - Forks: 342

deepgram/kur

Descriptive Deep Learning

Language: Python - Size: 1.79 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 822 - Forks: 108

VladislavAntonyuk/MauiSamples

.NET MAUI Samples

Language: C# - Size: 17 MB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 820 - Forks: 202

saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

Language: Python - Size: 407 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 813 - Forks: 141

SlapBot/stephanie-va

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Language: Python - Size: 99.6 KB - Last synced at: 6 months ago - Pushed at: over 6 years ago - Stars: 799 - Forks: 127

snakers4/open_stt 📦

Open STT

Language: Python - Size: 87.9 KB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 794 - Forks: 84

hayabhay/frogbase 📦

Transform audio-visual content into navigable knowledge.

Language: Python - Size: 1.22 MB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 786 - Forks: 95

JamesBrill/react-speech-recognition

💬Speech recognition for your React app

Language: JavaScript - Size: 905 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 777 - Forks: 128

locaal-ai/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

Language: C++ - Size: 70.1 MB - Last synced at: about 3 hours ago - Pushed at: 3 months ago - Stars: 771 - Forks: 59

mallorbc/whisper_mic

Project that allows one to use a microphone with OpenAI whisper.

Language: Python - Size: 54.7 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 762 - Forks: 167

mezbaul-h/june

Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit

Language: Python - Size: 12.5 MB - Last synced at: 29 days ago - Pushed at: 9 months ago - Stars: 759 - Forks: 49

savbell/whisper-writer

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

Language: Python - Size: 905 KB - Last synced at: 30 days ago - Pushed at: 9 months ago - Stars: 757 - Forks: 105

yeyupiaoling/PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

Language: Python - Size: 15 MB - Last synced at: 28 days ago - Pushed at: 5 months ago - Stars: 726 - Forks: 148

jitsi/jiwer

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Language: Python - Size: 1.68 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 722 - Forks: 103

sandrohanea/whisper.net

Whisper.net. Speech to text made simple using Whisper Models

Language: C# - Size: 59.1 MB - Last synced at: 4 days ago - Pushed at: 17 days ago - Stars: 719 - Forks: 106

MycroftAI/adapt

Adapt Intent Parser

Language: Python - Size: 361 KB - Last synced at: about 11 hours ago - Pushed at: 10 months ago - Stars: 716 - Forks: 156

Baidu-AIP/speech-demo

语音api示例

Language: Java - Size: 4.05 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 702 - Forks: 761

googleapis/nodejs-speech 📦

This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.

Size: 11.4 MB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 688 - Forks: 290

EmulationAI/awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Size: 6.56 MB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 672 - Forks: 39

VRCWizard/TTS-Voice-Wizard

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

Language: C# - Size: 42.5 MB - Last synced at: about 12 hours ago - Pushed at: about 1 month ago - Stars: 671 - Forks: 74

exPHAT/SwiftWhisper

🎤 The easiest way to transcribe audio in Swift

Language: Swift - Size: 720 KB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 670 - Forks: 85

yeyupiaoling/MASR

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

Language: Python - Size: 9.43 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 669 - Forks: 112

evancohen/sonus

:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword detection

Language: JavaScript - Size: 1.34 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 636 - Forks: 79

Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

Language: Python - Size: 290 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 621 - Forks: 71

lobehub/lobe-tts

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

Language: TypeScript - Size: 428 KB - Last synced at: 7 days ago - Pushed at: 14 days ago - Stars: 601 - Forks: 75

abhirooptalasila/AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

Language: Python - Size: 91.8 KB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 596 - Forks: 103

Picovoice/speech-to-text-benchmark

speech to text benchmark framework

Language: Python - Size: 159 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 577 - Forks: 62

vardanagarwal/Proctoring-AI

Creating a software for automatic monitoring in online proctoring

Language: Python - Size: 383 MB - Last synced at: 28 days ago - Pushed at: 5 months ago - Stars: 573 - Forks: 339

Capsize-Games/airunner

Privacy focused, local-first, multi-modal inference engine and agent platform for running LLMs, image generation, speech processing, and tool-based automation

Language: Python - Size: 21.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 567 - Forks: 40

algolia/voice-overlay-ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Language: Swift - Size: 22.1 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 549 - Forks: 62

YoavRamon/awesome-kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Size: 18.6 KB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 537 - Forks: 84

zh-plus/openlrc

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。

Language: Python - Size: 8.23 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 535 - Forks: 34

SakiRinn/LiveCaptions-Translator

A real-time audio/speech translation tool based on Windows LiveCaptions.

Language: C# - Size: 479 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 533 - Forks: 47

soupslurpr/Transcribro

Private and on-device speech recognition keyboard and service for Android.

Language: Kotlin - Size: 119 MB - Last synced at: 1 day ago - Pushed at: 13 days ago - Stars: 533 - Forks: 15

lkuza2/java-speech-api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Language: Java - Size: 423 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 531 - Forks: 304

Macoron/whisper.unity

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

Language: C# - Size: 114 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 515 - Forks: 116

matthiasn/lotti

Achieve your goals and keep your data private with Lotti. This life tracking app is designed to help you stay motivated and on track, all while keeping your personal information safe and secure. Now with on-device speech recognition.

Language: Dart - Size: 42.8 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 513 - Forks: 53

amanvirparhar/chaplin

A real-time silent speech recognition tool.

Language: Python - Size: 18.4 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 479 - Forks: 35

Kyubyong/css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Language: HTML - Size: 179 MB - Last synced at: 7 days ago - Pushed at: about 5 years ago - Stars: 470 - Forks: 61

hackingbeauty/react-mic

Record audio from a user's microphone and display a cool visualization.

Language: JavaScript - Size: 15.3 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 463 - Forks: 157

reriiasu/speech-to-text

Real-time transcription using faster-whisper

Language: HTML - Size: 3.07 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 457 - Forks: 76