GitHub topics: openai-whisper
usefulsensors/useful-transformers
Efficient Inference of Transformer models
Language: C++ - Size: 135 MB - Last synced at: about 13 hours ago - Pushed at: 9 months ago - Stars: 432 - Forks: 42

maciekt07/Lecture-Note-Generator-POC
📒 A proof-of-concept app that transcribes lecture recordings into text and generates structured academic notes using a local LLM
Language: TypeScript - Size: 22.4 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 1

gabriele-ciccotelli/WhisperQueue
Whisper Transcriber is particularly useful for those who have to transcribe large amounts of files in different formats.
Language: Python - Size: 22.5 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

absadiki/pywhispercpp
Python bindings for whisper.cpp
Language: Python - Size: 1.44 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 245 - Forks: 43

Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
Language: Python - Size: 1.14 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,009 - Forks: 91

speaches-ai/speaches
Language: Python - Size: 1.97 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,790 - Forks: 225

FlyingFathead/whisper-transcriber-telegram-bot
Python-based Telegram transcriber bot utilizing local Whisper models & yt-dlp
Language: Python - Size: 7.31 MB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 36 - Forks: 9

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API
⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。
Language: Python - Size: 1.21 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 368 - Forks: 42

RijoSLal/Quizzy
Quizzy is an AI interviewer that creates hyper-personalized interview simulation using a RAG-based system for dynamic conversations. It analyzes emotions, perception, posture, and responses, ensuring a natural flow. With job opening scraping and an embedding-based ATS score checker, Quizzy prepares you for the job market. Built with MLOps in Django
Language: CSS - Size: 5.12 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

semanticdata/vm-transcriber
Language: Python - Size: 45.9 KB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

sashabaranov/go-openai
OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go
Language: Go - Size: 709 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 9,926 - Forks: 1,574

Ambrosios13/RealTimeTranscription-RTT
Transforme voz em texto efetivamente: Ferramenta de transcrição em tempo real com modelos Whisper e aceleração por GPU NVIDIA com suporte CUDA.
Language: Python - Size: 103 KB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

AnanthaRajuC/AIML_NLP
AIML Natural Language Processing - Speech, Audio
Language: Java - Size: 4.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

locaal-ai/obs-localvocal
OBS plugin for local speech recognition and captioning using AI
Language: C++ - Size: 70.1 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 765 - Forks: 59

axshatInd/XethScribe
XethScribe is an AI-driven web application designed for real-time audio transcription and translation, leveraging advanced models like OpenAI Whisper for speech recognition. It seamlessly processes audio inputs to deliver accurate, timestamped text outputs for various use cases.
Language: JavaScript - Size: 144 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

HemulGM/DelphiOpenAI
OpenAI (and DeepSeek, Azure OpenAI) API wrapper for Delphi. Use ChatGPT, DALL-E, Whisper and other products.
Language: Pascal - Size: 938 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 259 - Forks: 64

tatsuikeda/simple-video-transcriber
Simple Video Transcriber is a Python-based tool that uses OpenAI's Whisper model to transcribe audio from video and audio files. It provides an easy-to-use interface for transcribing audio content and saving the results to a text file.
Language: Python - Size: 2.93 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

AayushKumarSingh/Speech-to-text
Transcribe Audio data from user to Text using locally fine-tuned OpenAI-Whisper model
Language: Python - Size: 9.77 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

AliasUruz/whisper-flash-transcriber
Instant offline audio transcription using OpenAI's Whisper AI.
Language: Python - Size: 1.26 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

adikul358/audionotes
Manage all of your dictated notes using OpenAI Whisper and Chat APIs.
Language: TypeScript - Size: 1.5 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

Illyism/openai-whisper-api
OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example
Language: TypeScript - Size: 22.5 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 113 - Forks: 11

c12i/bunge-bits
Bunge Bits provides convenient summaries of Kenyan National Assembly and Senate seatings, making legislative information more accessible and digestible.
Language: Rust - Size: 460 KB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 6 - Forks: 0

ethicalabs-ai/Kurtis-E1-MLX-Voice-Agent
A lightweight voice companion, optimized for macOS.
Language: Python - Size: 197 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 7 - Forks: 1

aramb-dev/transcriptr
Transcriptr is a modern web application that converts audio files to text using artificial intelligence. It provides a clean, intuitive interface for uploading audio files and receiving high-quality transcriptions powered by Replicate's Incredibly Fast Whisper model.
Language: TypeScript - Size: 15.9 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0

omkartidke42/Audio-text-rag-app
A web app that converts audio to text and enhances transcription with Retrieval-Augmented Generation (RAG). Upload audio, get accurate transcriptions with contextual enrichment using external knowledge sources
Language: Python - Size: 854 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

royceschultz/ComfyUI-TranscriptionTools
ComfyUI nodes for transcription on audio or video input.
Language: Python - Size: 314 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 20 - Forks: 3

tkarabela/pysubs2
A Python library for editing subtitle files
Language: Python - Size: 2.6 MB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 361 - Forks: 45

botbahlul/whisper_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
Language: Python - Size: 203 KB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 22 - Forks: 2

artenderrr/web-transcriber
A user-friendly web application that transcribes recorded audio using OpenAI's Whisper model.
Language: Vue - Size: 121 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

Pho86/mountain-madness-2025
Knot Madness, an interactive 3D physics simulation that allows users practice tying knots using hand tracking and voice commands. Winner of Best Technical Project @ Mountain-Madness 2025 🏅
Language: JavaScript - Size: 139 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 1

nicolodiamante/chatty
Unleash the power of Chatty: the intersection of ChatGPT’s intelligence, DALL·E's creativity, and Whisper's precise audio transcription for your Apple devices with support of 30 languages.
Size: 85.9 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 166 - Forks: 9

abus-aikorea/kara-audio
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.
Language: Python - Size: 21.6 MB - Last synced at: 16 days ago - Pushed at: 6 months ago - Stars: 45 - Forks: 4

quality-software-development/lazy-lecture
Репозиторий проекта для транскрипции текстов лекций
Language: Python - Size: 4.69 MB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 1

waltervanheuven/speech2text
Speech2Text
Language: Python - Size: 59.6 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

einToast/openai_stt_ha
OpenAI Whisper in HA via the OpenAI API for use in the Assist pipeline
Language: Python - Size: 39.1 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 12 - Forks: 4

savbell/whisper-writer
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
Language: Python - Size: 905 KB - Last synced at: 30 days ago - Pushed at: 9 months ago - Stars: 757 - Forks: 105

Adityauyadav/Psychometric-Test-Platform
Open-source platform using NLP & speech recognition for unbiased, real-time subjective psychometric assessments.
Size: 302 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

m1guelpf/auto-subtitle
Automatically generate and overlay subtitles for any video.
Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 1,854 - Forks: 303

SreejanPersonal/openai-unofficial
An completely Free & Unlimited unofficial Python SDK for the OpenAI API, providing seamless integration and easy-to-use methods for interacting with OpenAI's latest powerful AI models, including GPT-4o (Including gpt-4o-audio-preview & gpt-4o-realtime-preview Models), GPT-4, GPT-3.5 Turbo, DALL·E 3, Whisper & Text-to-Speech (TTS) models
Language: Python - Size: 36.1 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 21 - Forks: 10

ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
Language: Python - Size: 1.76 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2,517 - Forks: 449

kevinkoech357/transkript
A flask built web app that leverages the power of OpenAI's whisper model to transcribe audio and video files. Has support for various file formats. Generates timestamped .srt files.
Language: HTML - Size: 5.72 MB - Last synced at: 21 days ago - Pushed at: 23 days ago - Stars: 2 - Forks: 1

allozaur/memvo-io
Open-source Audio Transcription web app using ElevenLabs Scribe v1 & OpenAI Whisper models
Language: Svelte - Size: 475 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Grt1228/chatgpt-java
ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java
Language: Java - Size: 455 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 3,433 - Forks: 821

zahidkhawaja/whisper-nextjs
Next.js app for serverless deployments of OpenAI Whisper on Banana.dev
Language: JavaScript - Size: 72.3 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 95 - Forks: 35

danielrosehill/Whisper-Notepad-Simple
A Linux desktop utility for converting speech to text using the OpenAI Whisper API
Language: Python - Size: 1.24 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Eyevinn/auto-subtitles
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
Language: TypeScript - Size: 264 KB - Last synced at: 26 days ago - Pushed at: about 1 month ago - Stars: 42 - Forks: 6

ZakirCodeArchitect/Sonic-Lipsync-AI
A Google Colab-based Gradio app for generating lip-synced videos using the Sonic model. It supports audio-to-video syncing with Hugging Face models and runs entirely in the cloud—no local setup needed.
Language: Python - Size: 8.44 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

m1guelpf/yt-whisper
Using OpenAI's Whisper to automatically generate YouTube subtitles
Language: Python - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,395 - Forks: 143

Nikorasu/LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Language: Python - Size: 54.7 KB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 345 - Forks: 47

Carleslc/AudioToText
Transcribe and translate audio to text using Whisper and DeepL.
Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 323 - Forks: 44

alperensumeroglu/ai-clips-maker
AI-powered tool to turn long videos into short, viral-ready clips. Combines transcription, speaker diarization, scene detection & 9:16 resizing — perfect for creators & smart automation.
Language: Python - Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

404-5971/ytscript
A CLI to transcribe and summerize youtube videos
Language: Python - Size: 109 KB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

MaxineXiong/OpenAI-API-Web-Apps
This repository hosts a collection of custom web applications powered by OpenAI's GPT models (incl. o1, o3-mini, GPT-4.5, GPT-4o, and GPT-4o mini), Whisper model, and TTS model. These apps include an interactive chatbot ("Talk to GPT") for text or voice communication, and a coding assistant ("CodeMaxGPT") that supports various coding tasks.
Language: Python - Size: 101 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 39 - Forks: 18

AntonioAEMartins/speech-to-text
A Python toolkit for automated audio transcription using OpenAI's Whisper model. It handles large audio files through intelligent compression, parallel chunk processing, and correction of technical terms. Features include audio size optimization and smart text formatting, making it ideal for transcribing meetings and technical conversations.
Language: Python - Size: 19.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

platisd/phonix
Generate captions for videos using the power of OpenAI's Whisper API
Language: Python - Size: 36.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 42 - Forks: 4

nicolodiamante/notefy
Streamline your note-taking with ChatGPT's AI expertise and Whisper's precise transcription, enabling fast and efficient summarising.
Size: 44.9 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 20 - Forks: 0

teddylee777/openai-api-kr
OpenAI 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 Python OpenAI API 를 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.
Language: Jupyter Notebook - Size: 39.2 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 49 - Forks: 24

Justmalhar/open-audio
Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate natural-sounding audio from text. Built with modern web technologies for an intuitive user experience, including customizable voice and speech speed settings, and the ability to download audio files directly.
Language: JavaScript - Size: 318 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 84 - Forks: 31

Gemeri/Discord-Voice-Channel-Bot
A bot that can join voice channels using the OpenAI api and Microsoft's free Text-to-Speech (TTS) services. The bot can transcribe conversations, generate intelligent responses, and communicate verbally within your voice channels.
Language: JavaScript - Size: 60.5 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 2 - Forks: 1

KostasEreksonas/Audio-transcriber
Simple Python audio transcriber using OpenAI's Whisper speech recognition model
Language: Python - Size: 45.9 KB - Last synced at: 27 days ago - Pushed at: about 2 months ago - Stars: 34 - Forks: 9

blusewill/ytvideo-whisper
a python script that can auto generate subtitle in YouTube Videos
Language: Python - Size: 327 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

gorgarp/TwitchTranslate
A "Universal" Translation Program For Twitch Streams
Language: Python - Size: 91.8 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 34 - Forks: 20

Lambdua/openai4j
Java client library for OpenAI API.Full support for all OpenAI API models including Completions, Chat, Edits, Embeddings, Audio, Files, Assistants-v2, Images, Moderations, Batch, and Fine-tuning.
Language: Java - Size: 1.21 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 390 - Forks: 40

LoneWolfPro/OpenAI-Whisper-STT
Video to Audio Transcription using OpenAI-Whisper
Language: Python - Size: 12.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

BlackLionXD/LectureSummarizer
LectureSummarizer is an AI-powered website that transcribes and summarizes lectures. It uses OpenAI Whisper for accurate speech-to-text and Llama 2 for concise summaries. With an easy-to-use Gradio interface, it helps students capture and review key lecture points efficiently.
Language: Python - Size: 255 KB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

dhruvyad/uttertype
Short code for dictation using OpenAI Whisper for transcription.
Language: Python - Size: 145 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 65 - Forks: 6

alexogeny/cortana
Your own personal assistant thanks to chat-gpt, whisper, and elevenlabs tts
Language: Python - Size: 39.1 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 49 - Forks: 7

supershaneski/openai-whisper
A sample web app using OpenAI Whisper to transcribe audio built on Next.js. It records audio continuously for some time interval then uploads the audio data to the server for transcribing/translating.
Language: JavaScript - Size: 1010 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 170 - Forks: 33

gllmflndn/whisper.m
Automatic speech recognition in MATLAB/Octave (using whisper.cpp and OpenAI's Whisper)
Language: C - Size: 44.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

WebDevCaptain/nlp-review
Reviewing basics of Natural Language Processing
Language: Jupyter Notebook - Size: 8.08 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ahmedbesbes/audiolizr
A bentoML-powered API to transcribe audio and make sense of it
Language: Python - Size: 10.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 39 - Forks: 2

seanivore/docker-transcription
Automated Audio Transcription
Language: Python - Size: 31.5 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

tracywong117/AI-Video-Segment-Cutter
A Python program to cut segments of a video based on specified keywords using OpenAI Whisper.
Language: Python - Size: 9.06 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 2

fralapo/ai_video_title_generator
A powerful Python tool that automates the generation of SEO-optimized titles for social media videos using AI. This tool processes video clips by transcribing their audio content and generating engaging titles with relevant hashtags.
Language: Python - Size: 32.2 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

transcendence12/personal-ai-assistant
This project is about building a personal AI assistant on Telegram using the OpenAI API. The assistant will integrate functionalities like ChatGPT for conversation, DALL·E for image generation, and Whisper for voice recognition. It will also connect to the internet for real-time information retrieval.
Language: TypeScript - Size: 165 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

JoaoVitorLobo/YouTube-Videos-Summarizer
YouTube Video Summarizer Powered by AI. Whisper-1 and GPT-4o-Mini implementation through OpenAI API. Audio download, transcription to text and summarizer.
Size: 2.93 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

zaltinsoy/AutoSubZ Fork of m1guelpf/auto-subtitle
Automatically generate and overlay subtitles for any video.
Language: Python - Size: 18.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 0

danielrosehill/Thought-Pad
Linux desktop application that provides a two-stage process for creating notes from dictated speech (first stage, transcription via Whisper API; second stage light text formatting). Exports to markdown docs.
Language: Python - Size: 49.4 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

mirabdullahyaser/Summarizing-Youtube-Videos-with-OpenAI-Whisper-and-GPT-3
YouTube video summarization using Whisper audio transcription and GPT-based summaries.
Language: Python - Size: 1.79 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 32 - Forks: 7

legionJP/Caption_Generator
Caption Generator for Videos by uploading videos and Processing in background using python celery , redis for queue, openai whisper model for speech recongnition
Language: Python - Size: 2.06 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

EliasVincent/whisper-subtitles-webui
A gradio interface for making transcribed and translated subtitles for videos
Language: Python - Size: 51.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 34 - Forks: 6

loglux/FlexAudioPrint
FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a script for programmatic use. With FFmpeg for audio conversion, it supports multiple formats like MP3 and WAV. Ideal for transcribing meetings, lectures, and podcasts, with options to save results as text file
Language: Python - Size: 143 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

BruceWind/LocalWhisperAPIService
Language: Python - Size: 18.6 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ShauryaFulfagar/stt-example-rn-expo
An example of how to implement STT in ReactNative with Expo and OpenAI's Whisper
Language: Python - Size: 191 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

Stage-Whisper/Stage-Whisper
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
Language: TypeScript - Size: 3.4 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 255 - Forks: 26

liddiard/harmontown-search
Search all transcripts from the Harmontown podcast. Transcription powered by OpenAI's Whisper model. Search powered by Typesense.
Language: TypeScript - Size: 31 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 12 - Forks: 2

CrimeIsDown/trunk-transcribe
Transcription of calls from trunk-recorder using OpenAI Whisper
Language: Python - Size: 2.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 32 - Forks: 3

godmode2k/whisper.cpp.android
whisper.cpp.android with CLBlast(OpenCL), Translation (Google ML-Kit) and TTS
Language: Kotlin - Size: 1.49 MB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 7 - Forks: 1

didmar/whisper-api-server
Drop-in replacement for the OpenAI's Whisper API using the same API but running locally
Language: Python - Size: 188 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 11 - Forks: 3

ChetanXpro/autosub
Automatically generate and overlay subtitles for any video.
Language: TypeScript - Size: 24.5 MB - Last synced at: 29 days ago - Pushed at: 4 months ago - Stars: 11 - Forks: 0

theboringhumane/openai-voices.piper
🔊 PiperGen: Pretrain Piper TTS with OpenAI voices! Capture, convert, and fine-tune models for rich, natural speech synthesis. 🗣✨
Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

Mohamad-Hussein/speech-assistant
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
Language: Python - Size: 3.54 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 54 - Forks: 2

oscardelgado02/AI-NPC-in-VR-Prototype
VR prototype developed in Unity where the user can talk to an AI-driven NPC (ChatGPT API).
Language: C# - Size: 1.21 GB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

lonk42/wavebrowser
Language: JavaScript - Size: 124 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ckaytev/tgisper
Telegram bot with ASR
Language: Python - Size: 93.8 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 23 - Forks: 2

xiduzo/whisper-sentiment-analysis
An experiment on getting sentiment analysis using whisper
Language: Python - Size: 69.3 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

themihirmathur/SpeechGrade
'SpeechGrade' is an innovative educational platform designed to streamline speech assessment and enhance teaching by integrating the MERN stack, machine learning models, large language models (LLMs), and Google's Generative AI.
Language: JavaScript - Size: 4.37 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

achraf-oujjir/ProfGPT-Smart-VR-Professor
👨🏫🤖 ProfGPT: AI-powered VR professor with electrical circuits lab table ⚡💡 Built with Unity 🎮 GPT and Whisper APIs 🧠 and AWS Polly 🦜🗣️
Language: Smalltalk - Size: 111 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 4 - Forks: 0

AndreIglesias/AI-Medical-Voice-Assistant
AI chat/voice assistant designed to calculate medical scores
Language: Python - Size: 2.39 MB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 5 - Forks: 1

H-Software224/MoLU_digital_competition
Digital competition in 2024
Language: Jupyter Notebook - Size: 43.9 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0
