Topic: "speech-to-text"
ggml-org/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language: C++ - Size: 20.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 39,736 - Forks: 4,184

mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Language: C++ - Size: 48.2 MB - Last synced at: 12 minutes ago - Pushed at: 8 months ago - Stars: 26,319 - Forks: 4,041

leon-ai/leon
🧠 Leon is your open-source personal assistant.
Language: TypeScript - Size: 21.3 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 16,220 - Forks: 1,347

SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language: Python - Size: 36.6 MB - Last synced at: 7 days ago - Pushed at: 12 days ago - Stars: 15,802 - Forks: 1,316

m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language: Python - Size: 38.6 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 15,321 - Forks: 1,657

kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language: Shell - Size: 120 MB - Last synced at: 10 minutes ago - Pushed at: 14 days ago - Stars: 14,834 - Forks: 5,355

jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
Language: Python - Size: 452 MB - Last synced at: 3 days ago - Pushed at: 16 days ago - Stars: 12,702 - Forks: 1,407

speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language: Python - Size: 97.8 MB - Last synced at: 6 days ago - Pushed at: 16 days ago - Stars: 9,783 - Forks: 1,485

alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: 7 days ago - Pushed at: 11 days ago - Stars: 9,406 - Forks: 1,266

Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Language: Python - Size: 106 MB - Last synced at: 6 days ago - Pushed at: 26 days ago - Stars: 8,708 - Forks: 2,416

nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Language: Python - Size: 7.77 MB - Last synced at: 14 days ago - Pushed at: 8 months ago - Stars: 8,106 - Forks: 1,907

KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Language: Python - Size: 920 KB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 7,025 - Forks: 566

TalAter/annyang
💬 Speech recognition for your site
Language: JavaScript - Size: 1.64 MB - Last synced at: 18 days ago - Pushed at: 9 months ago - Stars: 6,660 - Forks: 1,046

k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 11 programming languages
Language: C++ - Size: 9.1 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 5,836 - Forks: 659

FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language: Python - Size: 6.51 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 5,556 - Forks: 495

snakers4/silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Language: Jupyter Notebook - Size: 488 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 5,253 - Forks: 336

sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Language: Jupyter Notebook - Size: 8.75 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 4,590 - Forks: 402

modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Language: Python - Size: 43.2 MB - Last synced at: 14 days ago - Pushed at: 2 months ago - Stars: 4,480 - Forks: 516

MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Language: Jupyter Notebook - Size: 435 KB - Last synced at: 14 days ago - Pushed at: 20 days ago - Stars: 4,432 - Forks: 411

huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language: Python - Size: 299 KB - Last synced at: 4 days ago - Pushed at: 27 days ago - Stars: 4,011 - Forks: 441

gradio-app/fastrtc
The python library for real-time communication
Language: JavaScript - Size: 4.54 MB - Last synced at: about 15 hours ago - Pushed at: about 15 hours ago - Stars: 3,852 - Forks: 332

abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language: Python - Size: 78 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,648 - Forks: 271

jianchang512/stt
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
Language: Python - Size: 129 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 3,255 - Forks: 350

ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language: Python - Size: 3.27 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 2,891 - Forks: 195

tensorflow/lingvo
Lingvo
Language: Python - Size: 142 MB - Last synced at: 4 days ago - Pushed at: 12 days ago - Stars: 2,838 - Forks: 451

HeyWillow/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Language: C - Size: 1.83 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 2,773 - Forks: 104

ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
Language: Python - Size: 1.76 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2,517 - Forks: 449

coqui-ai/STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Language: C++ - Size: 53.4 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 2,425 - Forks: 286

linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Language: Python - Size: 4.49 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 2,368 - Forks: 181

pannous/tensorflow-speech-recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Language: Python - Size: 31.1 MB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 2,171 - Forks: 635

pluja/whishper
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Language: Svelte - Size: 14.1 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2,161 - Forks: 121

Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Size: 207 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 1,925 - Forks: 90

jarikomppa/soloud
Free, easy, portable audio engine for games
Language: C - Size: 12 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 1,896 - Forks: 294

mesolitica/NLP-Models-Tensorflow 📦
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Language: Jupyter Notebook - Size: 46.2 MB - Last synced at: 11 days ago - Pushed at: almost 5 years ago - Stars: 1,791 - Forks: 726

bugbakery/audapolis
an editor for spoken-word audio with automatic transcription
Language: TypeScript - Size: 4.05 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,732 - Forks: 43

kalliope-project/kalliope
Kalliope is a framework that will help you to create your own personal assistant.
Language: Python - Size: 20.9 MB - Last synced at: 29 days ago - Pushed at: almost 2 years ago - Stars: 1,729 - Forks: 229

sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
Size: 1.14 MB - Last synced at: about 6 hours ago - Pushed at: 23 days ago - Stars: 1,662 - Forks: 82

NVIDIA/OpenSeq2Seq 📦
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Language: Python - Size: 57.4 MB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 1,558 - Forks: 369

DragonComputer/Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Language: Python - Size: 24.1 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 1,389 - Forks: 215

coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Size: 139 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 1,318 - Forks: 142

sdkcarlos/artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Language: JavaScript - Size: 1.08 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 1,255 - Forks: 367

AlekPet/ComfyUI_Custom_Nodes_AlekPet
Custom nodes that extend the capabilities of Comfyui
Language: JavaScript - Size: 11.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,183 - Forks: 74

Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Language: Python - Size: 3.06 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1,159 - Forks: 369

Robitx/gp.nvim
Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]
Language: Lua - Size: 593 KB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 1,158 - Forks: 98

modal-labs/quillman
A voice chat app
Language: Python - Size: 4.3 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,120 - Forks: 133

R3gm/SoniTranslate
Synchronized Translation for Videos. Video dubbing
Language: Python - Size: 19.4 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1,096 - Forks: 226

Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language: Python - Size: 4.07 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,063 - Forks: 80

ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language: Python - Size: 18.2 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 1,053 - Forks: 80

tmoroney/auto-subs
Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.
Language: TypeScript - Size: 80.7 MB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 1,040 - Forks: 53

Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
Language: Python - Size: 1.14 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,009 - Forks: 91

TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Language: Python - Size: 89.8 MB - Last synced at: 13 minutes ago - Pushed at: about 1 hour ago - Stars: 969 - Forks: 245

codeforequity-at/botium-speech-processing
Botium Speech Processing
Language: JavaScript - Size: 583 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 945 - Forks: 59

ardha27/AI-Waifu-Vtuber
AI Vtuber for Streaming on Youtube/Twitch
Language: Python - Size: 6.2 MB - Last synced at: 30 days ago - Pushed at: 10 months ago - Stars: 926 - Forks: 148

backmeupplz/voicy
@voicybot Telegram bot main repository
Language: TypeScript - Size: 6.61 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 903 - Forks: 163

mikeyy/nonoCAPTCHA
An asynchronized Python library to automate solving ReCAPTCHA v2 using audio
Language: Python - Size: 117 MB - Last synced at: 25 days ago - Pushed at: almost 2 years ago - Stars: 896 - Forks: 192

mkiol/dsnote
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Language: C++ - Size: 75.2 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 875 - Forks: 37

yeyupiaoling/PPASR
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
Language: Python - Size: 17.7 MB - Last synced at: 5 days ago - Pushed at: 11 days ago - Stars: 859 - Forks: 129

Saik0s/Whisperboard
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Language: Swift - Size: 179 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 842 - Forks: 83

alesaccoia/VoiceStreamAI
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
Language: Python - Size: 14 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 840 - Forks: 123

srvk/eesen
The official repository of the Eesen project
Language: C++ - Size: 5.86 MB - Last synced at: 10 days ago - Pushed at: almost 6 years ago - Stars: 829 - Forks: 342

deepgram/kur
Descriptive Deep Learning
Language: Python - Size: 1.79 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 822 - Forks: 108

VladislavAntonyuk/MauiSamples
.NET MAUI Samples
Language: C# - Size: 17 MB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 820 - Forks: 202

saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
Language: Python - Size: 407 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 813 - Forks: 141

SlapBot/stephanie-va
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Language: Python - Size: 99.6 KB - Last synced at: 6 months ago - Pushed at: over 6 years ago - Stars: 799 - Forks: 127

snakers4/open_stt 📦
Open STT
Language: Python - Size: 87.9 KB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 794 - Forks: 84

hayabhay/frogbase 📦
Transform audio-visual content into navigable knowledge.
Language: Python - Size: 1.22 MB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 786 - Forks: 95

JamesBrill/react-speech-recognition
💬Speech recognition for your React app
Language: JavaScript - Size: 905 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 777 - Forks: 128

locaal-ai/obs-localvocal
OBS plugin for local speech recognition and captioning using AI
Language: C++ - Size: 70.1 MB - Last synced at: about 3 hours ago - Pushed at: 3 months ago - Stars: 771 - Forks: 59

mallorbc/whisper_mic
Project that allows one to use a microphone with OpenAI whisper.
Language: Python - Size: 54.7 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 762 - Forks: 167

mezbaul-h/june
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
Language: Python - Size: 12.5 MB - Last synced at: 29 days ago - Pushed at: 9 months ago - Stars: 759 - Forks: 49

savbell/whisper-writer
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
Language: Python - Size: 905 KB - Last synced at: 30 days ago - Pushed at: 9 months ago - Stars: 757 - Forks: 105

yeyupiaoling/PaddlePaddle-DeepSpeech
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。
Language: Python - Size: 15 MB - Last synced at: 28 days ago - Pushed at: 5 months ago - Stars: 726 - Forks: 148

jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Language: Python - Size: 1.68 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 722 - Forks: 103

sandrohanea/whisper.net
Whisper.net. Speech to text made simple using Whisper Models
Language: C# - Size: 59.1 MB - Last synced at: 4 days ago - Pushed at: 17 days ago - Stars: 719 - Forks: 106

MycroftAI/adapt
Adapt Intent Parser
Language: Python - Size: 361 KB - Last synced at: about 11 hours ago - Pushed at: 10 months ago - Stars: 716 - Forks: 156

Baidu-AIP/speech-demo
语音api示例
Language: Java - Size: 4.05 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 702 - Forks: 761

googleapis/nodejs-speech 📦
This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.
Size: 11.4 MB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 688 - Forks: 290

EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Size: 6.56 MB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 672 - Forks: 39

VRCWizard/TTS-Voice-Wizard
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
Language: C# - Size: 42.5 MB - Last synced at: about 12 hours ago - Pushed at: about 1 month ago - Stars: 671 - Forks: 74

exPHAT/SwiftWhisper
🎤 The easiest way to transcribe audio in Swift
Language: Swift - Size: 720 KB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 670 - Forks: 85

yeyupiaoling/MASR
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
Language: Python - Size: 9.43 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 669 - Forks: 112

evancohen/sonus
:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword detection
Language: JavaScript - Size: 1.34 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 636 - Forks: 79

Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
Language: Python - Size: 290 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 621 - Forks: 71

lobehub/lobe-tts
🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser
Language: TypeScript - Size: 428 KB - Last synced at: 7 days ago - Pushed at: 14 days ago - Stars: 601 - Forks: 75

abhirooptalasila/AutoSub
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
Language: Python - Size: 91.8 KB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 596 - Forks: 103

Picovoice/speech-to-text-benchmark
speech to text benchmark framework
Language: Python - Size: 159 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 577 - Forks: 62

vardanagarwal/Proctoring-AI
Creating a software for automatic monitoring in online proctoring
Language: Python - Size: 383 MB - Last synced at: 28 days ago - Pushed at: 5 months ago - Stars: 573 - Forks: 339

Capsize-Games/airunner
Privacy focused, local-first, multi-modal inference engine and agent platform for running LLMs, image generation, speech processing, and tool-based automation
Language: Python - Size: 21.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 567 - Forks: 40

algolia/voice-overlay-ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Language: Swift - Size: 22.1 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 549 - Forks: 62

YoavRamon/awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Size: 18.6 KB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 537 - Forks: 84

zh-plus/openlrc
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
Language: Python - Size: 8.23 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 535 - Forks: 34

SakiRinn/LiveCaptions-Translator
A real-time audio/speech translation tool based on Windows LiveCaptions.
Language: C# - Size: 479 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 533 - Forks: 47

soupslurpr/Transcribro
Private and on-device speech recognition keyboard and service for Android.
Language: Kotlin - Size: 119 MB - Last synced at: 1 day ago - Pushed at: 13 days ago - Stars: 533 - Forks: 15

lkuza2/java-speech-api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Language: Java - Size: 423 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 531 - Forks: 304

Macoron/whisper.unity
Running speech to text model (whisper.cpp) in Unity3d on your local machine.
Language: C# - Size: 114 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 515 - Forks: 116

matthiasn/lotti
Achieve your goals and keep your data private with Lotti. This life tracking app is designed to help you stay motivated and on track, all while keeping your personal information safe and secure. Now with on-device speech recognition.
Language: Dart - Size: 42.8 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 513 - Forks: 53

amanvirparhar/chaplin
A real-time silent speech recognition tool.
Language: Python - Size: 18.4 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 479 - Forks: 35

Kyubyong/css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Language: HTML - Size: 179 MB - Last synced at: 7 days ago - Pushed at: about 5 years ago - Stars: 470 - Forks: 61

hackingbeauty/react-mic
Record audio from a user's microphone and display a cool visualization.
Language: JavaScript - Size: 15.3 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 463 - Forks: 157

reriiasu/speech-to-text
Real-time transcription using faster-whisper
Language: HTML - Size: 3.07 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 457 - Forks: 76
