GitHub topics: openai-whisper | Ecosyste.ms: Repos

usefulsensors/useful-transformers

Efficient Inference of Transformer models

Language: C++ - Size: 135 MB - Last synced at: about 13 hours ago - Pushed at: 9 months ago - Stars: 432 - Forks: 42

maciekt07/Lecture-Note-Generator-POC

📒 A proof-of-concept app that transcribes lecture recordings into text and generates structured academic notes using a local LLM

Language: TypeScript - Size: 22.4 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 1

gabriele-ciccotelli/WhisperQueue

Whisper Transcriber is particularly useful for those who have to transcribe large amounts of files in different formats.

Language: Python - Size: 22.5 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

absadiki/pywhispercpp

Python bindings for whisper.cpp

Language: Python - Size: 1.44 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 245 - Forks: 43

Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Language: Python - Size: 1.14 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,009 - Forks: 91

speaches-ai/speaches

Language: Python - Size: 1.97 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,790 - Forks: 225

FlyingFathead/whisper-transcriber-telegram-bot

Python-based Telegram transcriber bot utilizing local Whisper models & yt-dlp

Language: Python - Size: 7.31 MB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 36 - Forks: 9

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API，使用本地运行的Whisper模型进行推理，并支持多GPU并发，针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫，可实现来自多个社交平台的无缝媒体处理，为媒体内容数据自动化处理提供了强大且可扩展的解决方案。

Language: Python - Size: 1.21 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 368 - Forks: 42

Quizzy is an AI interviewer that creates hyper-personalized interview simulation using a RAG-based system for dynamic conversations. It analyzes emotions, perception, posture, and responses, ensuring a natural flow. With job opening scraping and an embedding-based ATS score checker, Quizzy prepares you for the job market. Built with MLOps in Django

Language: CSS - Size: 5.12 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

semanticdata/vm-transcriber

Language: Python - Size: 45.9 KB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

sashabaranov/go-openai

OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go

Language: Go - Size: 709 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 9,926 - Forks: 1,574

Ambrosios13/RealTimeTranscription-RTT

Transforme voz em texto efetivamente: Ferramenta de transcrição em tempo real com modelos Whisper e aceleração por GPU NVIDIA com suporte CUDA.

Language: Python - Size: 103 KB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

AnanthaRajuC/AIML_NLP

AIML Natural Language Processing - Speech, Audio

Language: Java - Size: 4.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

locaal-ai/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

Language: C++ - Size: 70.1 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 765 - Forks: 59

axshatInd/XethScribe

XethScribe is an AI-driven web application designed for real-time audio transcription and translation, leveraging advanced models like OpenAI Whisper for speech recognition. It seamlessly processes audio inputs to deliver accurate, timestamped text outputs for various use cases.

Language: JavaScript - Size: 144 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

HemulGM/DelphiOpenAI

OpenAI (and DeepSeek, Azure OpenAI) API wrapper for Delphi. Use ChatGPT, DALL-E, Whisper and other products.

Language: Pascal - Size: 938 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 259 - Forks: 64

tatsuikeda/simple-video-transcriber

Simple Video Transcriber is a Python-based tool that uses OpenAI's Whisper model to transcribe audio from video and audio files. It provides an easy-to-use interface for transcribing audio content and saving the results to a text file.

Language: Python - Size: 2.93 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

AayushKumarSingh/Speech-to-text

Transcribe Audio data from user to Text using locally fine-tuned OpenAI-Whisper model

Language: Python - Size: 9.77 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

AliasUruz/whisper-flash-transcriber

Instant offline audio transcription using OpenAI's Whisper AI.

Language: Python - Size: 1.26 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

adikul358/audionotes

Manage all of your dictated notes using OpenAI Whisper and Chat APIs.

Language: TypeScript - Size: 1.5 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

Illyism/openai-whisper-api

OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example

Language: TypeScript - Size: 22.5 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 113 - Forks: 11

c12i/bunge-bits

Bunge Bits provides convenient summaries of Kenyan National Assembly and Senate seatings, making legislative information more accessible and digestible.

Language: Rust - Size: 460 KB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 6 - Forks: 0

ethicalabs-ai/Kurtis-E1-MLX-Voice-Agent

A lightweight voice companion, optimized for macOS.

Language: Python - Size: 197 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 7 - Forks: 1

aramb-dev/transcriptr

Transcriptr is a modern web application that converts audio files to text using artificial intelligence. It provides a clean, intuitive interface for uploading audio files and receiving high-quality transcriptions powered by Replicate's Incredibly Fast Whisper model.

Language: TypeScript - Size: 15.9 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0

omkartidke42/Audio-text-rag-app

A web app that converts audio to text and enhances transcription with Retrieval-Augmented Generation (RAG). Upload audio, get accurate transcriptions with contextual enrichment using external knowledge sources

Language: Python - Size: 854 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

royceschultz/ComfyUI-TranscriptionTools

ComfyUI nodes for transcription on audio or video input.

Language: Python - Size: 314 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 20 - Forks: 3

tkarabela/pysubs2

A Python library for editing subtitle files

Language: Python - Size: 2.6 MB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 361 - Forks: 45

botbahlul/whisper_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

Language: Python - Size: 203 KB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 22 - Forks: 2

artenderrr/web-transcriber

A user-friendly web application that transcribes recorded audio using OpenAI's Whisper model.

Language: Vue - Size: 121 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

Pho86/mountain-madness-2025

Knot Madness, an interactive 3D physics simulation that allows users practice tying knots using hand tracking and voice commands. Winner of Best Technical Project @ Mountain-Madness 2025 🏅

Language: JavaScript - Size: 139 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 1

nicolodiamante/chatty

Unleash the power of Chatty: the intersection of ChatGPT’s intelligence, DALL·E's creativity, and Whisper's precise audio transcription for your Apple devices with support of 30 languages.

Size: 85.9 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 166 - Forks: 9

abus-aikorea/kara-audio

Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.

Language: Python - Size: 21.6 MB - Last synced at: 16 days ago - Pushed at: 6 months ago - Stars: 45 - Forks: 4

quality-software-development/lazy-lecture

Репозиторий проекта для транскрипции текстов лекций

Language: Python - Size: 4.69 MB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 1

waltervanheuven/speech2text

Speech2Text

Language: Python - Size: 59.6 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

einToast/openai_stt_ha

OpenAI Whisper in HA via the OpenAI API for use in the Assist pipeline

Language: Python - Size: 39.1 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 12 - Forks: 4

savbell/whisper-writer

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

Language: Python - Size: 905 KB - Last synced at: 30 days ago - Pushed at: 9 months ago - Stars: 757 - Forks: 105

Adityauyadav/Psychometric-Test-Platform

Open-source platform using NLP & speech recognition for unbiased, real-time subjective psychometric assessments.

Size: 302 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

m1guelpf/auto-subtitle

Automatically generate and overlay subtitles for any video.

Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 1,854 - Forks: 303

SreejanPersonal/openai-unofficial

An completely Free & Unlimited unofficial Python SDK for the OpenAI API, providing seamless integration and easy-to-use methods for interacting with OpenAI's latest powerful AI models, including GPT-4o (Including gpt-4o-audio-preview & gpt-4o-realtime-preview Models), GPT-4, GPT-3.5 Turbo, DALL·E 3, Whisper & Text-to-Speech (TTS) models

Language: Python - Size: 36.1 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 21 - Forks: 10

ahmetoner/whisper-asr-webservice

OpenAI Whisper ASR Webservice API

Language: Python - Size: 1.76 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2,517 - Forks: 449

kevinkoech357/transkript

A flask built web app that leverages the power of OpenAI's whisper model to transcribe audio and video files. Has support for various file formats. Generates timestamped .srt files.

Language: HTML - Size: 5.72 MB - Last synced at: 21 days ago - Pushed at: 23 days ago - Stars: 2 - Forks: 1

allozaur/memvo-io

Open-source Audio Transcription web app using ElevenLabs Scribe v1 & OpenAI Whisper models

Language: Svelte - Size: 475 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Grt1228/chatgpt-java

ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java

Language: Java - Size: 455 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 3,433 - Forks: 821

zahidkhawaja/whisper-nextjs

Next.js app for serverless deployments of OpenAI Whisper on Banana.dev

Language: JavaScript - Size: 72.3 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 95 - Forks: 35

danielrosehill/Whisper-Notepad-Simple

A Linux desktop utility for converting speech to text using the OpenAI Whisper API

Language: Python - Size: 1.24 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Eyevinn/auto-subtitles

Automatically generate subtitles from an input audio or video file using OpenAI Whisper

Language: TypeScript - Size: 264 KB - Last synced at: 26 days ago - Pushed at: about 1 month ago - Stars: 42 - Forks: 6

ZakirCodeArchitect/Sonic-Lipsync-AI

A Google Colab-based Gradio app for generating lip-synced videos using the Sonic model. It supports audio-to-video syncing with Hugging Face models and runs entirely in the cloud—no local setup needed.

Language: Python - Size: 8.44 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

m1guelpf/yt-whisper

Using OpenAI's Whisper to automatically generate YouTube subtitles

Language: Python - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,395 - Forks: 143

Nikorasu/LiveWhisper

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

Language: Python - Size: 54.7 KB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 345 - Forks: 47

Carleslc/AudioToText

Transcribe and translate audio to text using Whisper and DeepL.

Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 323 - Forks: 44

alperensumeroglu/ai-clips-maker

AI-powered tool to turn long videos into short, viral-ready clips. Combines transcription, speaker diarization, scene detection & 9:16 resizing — perfect for creators & smart automation.

Language: Python - Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

404-5971/ytscript

A CLI to transcribe and summerize youtube videos

Language: Python - Size: 109 KB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

MaxineXiong/OpenAI-API-Web-Apps

This repository hosts a collection of custom web applications powered by OpenAI's GPT models (incl. o1, o3-mini, GPT-4.5, GPT-4o, and GPT-4o mini), Whisper model, and TTS model. These apps include an interactive chatbot ("Talk to GPT") for text or voice communication, and a coding assistant ("CodeMaxGPT") that supports various coding tasks.

Language: Python - Size: 101 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 39 - Forks: 18

AntonioAEMartins/speech-to-text

A Python toolkit for automated audio transcription using OpenAI's Whisper model. It handles large audio files through intelligent compression, parallel chunk processing, and correction of technical terms. Features include audio size optimization and smart text formatting, making it ideal for transcribing meetings and technical conversations.

Language: Python - Size: 19.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

platisd/phonix

Generate captions for videos using the power of OpenAI's Whisper API

Language: Python - Size: 36.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 42 - Forks: 4

nicolodiamante/notefy

Streamline your note-taking with ChatGPT's AI expertise and Whisper's precise transcription, enabling fast and efficient summarising.

Size: 44.9 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 20 - Forks: 0

teddylee777/openai-api-kr

OpenAI 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 Python OpenAI API 를 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.

Language: Jupyter Notebook - Size: 39.2 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 49 - Forks: 24

Justmalhar/open-audio

Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate natural-sounding audio from text. Built with modern web technologies for an intuitive user experience, including customizable voice and speech speed settings, and the ability to download audio files directly.

Language: JavaScript - Size: 318 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 84 - Forks: 31

Gemeri/Discord-Voice-Channel-Bot

A bot that can join voice channels using the OpenAI api and Microsoft's free Text-to-Speech (TTS) services. The bot can transcribe conversations, generate intelligent responses, and communicate verbally within your voice channels.

Language: JavaScript - Size: 60.5 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 2 - Forks: 1

KostasEreksonas/Audio-transcriber

Simple Python audio transcriber using OpenAI's Whisper speech recognition model

Language: Python - Size: 45.9 KB - Last synced at: 27 days ago - Pushed at: about 2 months ago - Stars: 34 - Forks: 9

blusewill/ytvideo-whisper

a python script that can auto generate subtitle in YouTube Videos

Language: Python - Size: 327 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

gorgarp/TwitchTranslate

A "Universal" Translation Program For Twitch Streams

Language: Python - Size: 91.8 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 34 - Forks: 20

Lambdua/openai4j

Java client library for OpenAI API.Full support for all OpenAI API models including Completions, Chat, Edits, Embeddings, Audio, Files, Assistants-v2, Images, Moderations, Batch, and Fine-tuning.

Language: Java - Size: 1.21 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 390 - Forks: 40

LoneWolfPro/OpenAI-Whisper-STT

Video to Audio Transcription using OpenAI-Whisper

Language: Python - Size: 12.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

BlackLionXD/LectureSummarizer

LectureSummarizer is an AI-powered website that transcribes and summarizes lectures. It uses OpenAI Whisper for accurate speech-to-text and Llama 2 for concise summaries. With an easy-to-use Gradio interface, it helps students capture and review key lecture points efficiently.

Language: Python - Size: 255 KB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

dhruvyad/uttertype

Short code for dictation using OpenAI Whisper for transcription.

Language: Python - Size: 145 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 65 - Forks: 6

alexogeny/cortana

Your own personal assistant thanks to chat-gpt, whisper, and elevenlabs tts

Language: Python - Size: 39.1 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 49 - Forks: 7

supershaneski/openai-whisper

A sample web app using OpenAI Whisper to transcribe audio built on Next.js. It records audio continuously for some time interval then uploads the audio data to the server for transcribing/translating.

Language: JavaScript - Size: 1010 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 170 - Forks: 33

gllmflndn/whisper.m

Automatic speech recognition in MATLAB/Octave (using whisper.cpp and OpenAI's Whisper)

Language: C - Size: 44.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

WebDevCaptain/nlp-review

Reviewing basics of Natural Language Processing

Language: Jupyter Notebook - Size: 8.08 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ahmedbesbes/audiolizr

A bentoML-powered API to transcribe audio and make sense of it

Language: Python - Size: 10.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 39 - Forks: 2

seanivore/docker-transcription

Automated Audio Transcription

Language: Python - Size: 31.5 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

tracywong117/AI-Video-Segment-Cutter

A Python program to cut segments of a video based on specified keywords using OpenAI Whisper.

Language: Python - Size: 9.06 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 2

fralapo/ai_video_title_generator

A powerful Python tool that automates the generation of SEO-optimized titles for social media videos using AI. This tool processes video clips by transcribing their audio content and generating engaging titles with relevant hashtags.

Language: Python - Size: 32.2 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

transcendence12/personal-ai-assistant

This project is about building a personal AI assistant on Telegram using the OpenAI API. The assistant will integrate functionalities like ChatGPT for conversation, DALL·E for image generation, and Whisper for voice recognition. It will also connect to the internet for real-time information retrieval.

Language: TypeScript - Size: 165 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

JoaoVitorLobo/YouTube-Videos-Summarizer

YouTube Video Summarizer Powered by AI. Whisper-1 and GPT-4o-Mini implementation through OpenAI API. Audio download, transcription to text and summarizer.

Size: 2.93 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

zaltinsoy/AutoSubZ Fork of m1guelpf/auto-subtitle

Automatically generate and overlay subtitles for any video.

Language: Python - Size: 18.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 0

danielrosehill/Thought-Pad

Linux desktop application that provides a two-stage process for creating notes from dictated speech (first stage, transcription via Whisper API; second stage light text formatting). Exports to markdown docs.

Language: Python - Size: 49.4 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

mirabdullahyaser/Summarizing-Youtube-Videos-with-OpenAI-Whisper-and-GPT-3

YouTube video summarization using Whisper audio transcription and GPT-based summaries.

Language: Python - Size: 1.79 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 32 - Forks: 7

legionJP/Caption_Generator

Caption Generator for Videos by uploading videos and Processing in background using python celery , redis for queue, openai whisper model for speech recongnition

Language: Python - Size: 2.06 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

EliasVincent/whisper-subtitles-webui

A gradio interface for making transcribed and translated subtitles for videos

Language: Python - Size: 51.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 34 - Forks: 6

loglux/FlexAudioPrint

FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a script for programmatic use. With FFmpeg for audio conversion, it supports multiple formats like MP3 and WAV. Ideal for transcribing meetings, lectures, and podcasts, with options to save results as text file

Language: Python - Size: 143 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0