GitHub topics: openai-whisper
CrimeIsDown/trunk-transcribe
Transcription of calls from trunk-recorder using OpenAI Whisper
Language: Python - Size: 2.64 MB - Last synced at: about 23 hours ago - Pushed at: about 24 hours ago - Stars: 37 - Forks: 3

arifulislamat/whisper-voice-transcription
Voice Transcription | Whisper | STT | Python3.13 | UV
Language: Python - Size: 887 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

moonshine-ai/useful-transformers
Efficient Inference of Transformer models
Language: C++ - Size: 135 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 455 - Forks: 43

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API
⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。
Language: Python - Size: 1.22 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 407 - Forks: 51

sashabaranov/go-openai
OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go
Language: Go - Size: 784 KB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 10,288 - Forks: 1,655

speaches-ai/speaches
Language: Python - Size: 2.4 MB - Last synced at: 3 days ago - Pushed at: 7 days ago - Stars: 2,340 - Forks: 282

Illyism/openai-whisper-api
OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example
Language: TypeScript - Size: 22.5 KB - Last synced at: about 3 hours ago - Pushed at: almost 2 years ago - Stars: 118 - Forks: 11

nicolodiamante/chatty
Unleash the power of Chatty: the intersection of ChatGPT’s intelligence, DALL·E's creativity, and Whisper's precise audio transcription for your Apple devices with support of 30 languages.
Size: 85.9 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 165 - Forks: 9

m1guelpf/auto-subtitle
Automatically generate and overlay subtitles for any video.
Language: Python - Size: 5.86 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 2,024 - Forks: 331

michaelyuwh/mcp-speech-to-text
🎙️ MCP Speech-to-Text Server with Enhanced Cantonese Support | Offline Vosk + Online Google Cloud | Auto-detection for zh-HK | n8n workflows | Hong Kong optimized 🇭🇰
Language: Python - Size: 175 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI
Language: C++ - Size: 70.1 MB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 869 - Forks: 68

lst97/CantoCap
Tool for generating accurate Cantonese subtitles from audio/video files using OpenAI Whisper with LLM enhancements.
Language: TypeScript - Size: 4.13 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

SU-PER-NOVA/whisper-offline-video-audio-transcriber
"An offline video & audio transcription tool powered by OpenAI Whisper. Convert your tutorials, lectures, and podcasts into accurate text transcripts and use AI to generate summaries, notes, and mind maps — saving hours of time and boosting productivity."
Language: Python - Size: 1.48 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

HemulGM/DelphiOpenAI
OpenAI (and DeepSeek, Azure OpenAI, YandexGPT, Ollama) API wrapper for Delphi. Use ChatGPT, DALL-E, Whisper and other products.
Language: Pascal - Size: 1.09 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 277 - Forks: 76

nishanbajracharya/py-transcribe
A python project to transcribe and generate srt subtitle files for foreign language videos.
Language: Python - Size: 27.3 KB - Last synced at: about 3 hours ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

wryan14/youtube-whisper
Transcribe audio files and YouTube videos to text with timestamps using OpenAI's Whisper API.
Language: Python - Size: 22.5 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

LunaticPrakash/Audio-Query
A full-stack application that allows users to upload audio files (like call recordings, meetings, or lectures) and search for keywords or phrases within them.
Language: JavaScript - Size: 40 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

SamOhrenberg/regulation-database
A searchable database of all Regulation Podcast episodes, automatically transcribed and updated.
Language: JavaScript - Size: 119 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 3 - Forks: 0

SreejanPersonal/openai-unofficial
An completely Free & Unlimited unofficial Python SDK for the OpenAI API, providing seamless integration and easy-to-use methods for interacting with OpenAI's latest powerful AI models, including GPT-4o (Including gpt-4o-audio-preview & gpt-4o-realtime-preview Models), GPT-4, GPT-3.5 Turbo, DALL·E 3, Whisper & Text-to-Speech (TTS) models
Language: Python - Size: 36.1 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 26 - Forks: 15

Keatonkirk55cfc/audio-to-text
🎧 audio-to-text transcribes audio files to text using the Web Speech API in a headless browser via Puppeteer, supporting ffmpeg formats and PulseAudio on Linux.
Language: TypeScript - Size: 42 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

tkarabela/pysubs2
A Python library for editing subtitle files
Language: Python - Size: 2.6 MB - Last synced at: 13 days ago - Pushed at: 7 months ago - Stars: 382 - Forks: 48

imprasukjain/Kyutai-STT
Demo repository for Kyutai Labs' STT-1B model: Real-time speech-to-text transcription with streaming inference, built-in VAD, and Jupyter notebook examples for audio processing and simulation.
Language: Jupyter Notebook - Size: 5.16 MB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 2 - Forks: 0

Aparnamol-KS/Captionify
This project is designed to convert spoken content into real-time captions, including mathematical notation, to enhance the accessibility and learning experience for students
Language: HTML - Size: 206 KB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

absadiki/pywhispercpp
Python bindings for whisper.cpp
Language: Python - Size: 1.52 MB - Last synced at: 9 days ago - Pushed at: 15 days ago - Stars: 282 - Forks: 48

platisd/phonix
Generate captions for videos using the power of OpenAI's Whisper API
Language: Python - Size: 36.1 KB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 46 - Forks: 4

theboringhumane/openai-voices.piper
🔊 PiperGen: Pretrain Piper TTS with OpenAI voices! Capture, convert, and fine-tune models for rich, natural speech synthesis. 🗣✨
Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: about 22 hours ago - Pushed at: 10 months ago - Stars: 5 - Forks: 0

AliasUruz/whisper-flash-transcriber
Instant offline audio transcription using OpenAI's Whisper AI at the power of your fingers! I recommend the "stable" branch.
Language: Python - Size: 3.96 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 2 - Forks: 0

RayanAIX/Speech-to-Text-Translator
Language: Jupyter Notebook - Size: 18 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

Hrishi11572/Jarvis
A bare bones, AI Voice assistant which is capable of answering your queries in device, without internet
Language: Python - Size: 11.7 KB - Last synced at: about 10 hours ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

FlyingFathead/whisper-transcriber-telegram-bot
Python-based Telegram transcriber bot utilizing local Whisper models & yt-dlp
Language: Python - Size: 7.49 MB - Last synced at: 7 days ago - Pushed at: 23 days ago - Stars: 42 - Forks: 11

ProgrammerGnome/lingua
Speech translator primarily for language learning. Languages: English <-> Hungarian
Language: Python - Size: 14.6 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

Fingolfin7/SpeechPracticeApp
Track and improve your speech clarity with real-time, data-driven feedback perfect for fixing mumbling and unclear articulation.
Language: Python - Size: 218 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

aramb-dev/transcriptr
Transcriptr is a modern web application that converts audio files to text using artificial intelligence. It provides a clean, intuitive interface for uploading audio files and receiving high-quality transcriptions powered by Replicate's Incredibly Fast Whisper model.
Language: TypeScript - Size: 16.4 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 4 - Forks: 0

mohamadahmadidev/whisper-sentence-aligner
A web-based tool that uses OpenAI's Whisper API to find the precise start and end timestamps of sentences from a JSON file within a corresponding WAV audio file.
Language: HTML - Size: 7.81 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

AvenCores/Goida-AI-Unlocker
🛡 Установщик разблокировщика зарубежных AI-сервисов (и не только) для России на Windows 10/11 🌍
Language: Python - Size: 130 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 14 - Forks: 1

Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
Language: Python - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,075 - Forks: 101

RijoSLal/Quizzy
Quizzy is an AI interviewer that creates hyper-personalized interview simulation using a RAG-based system for dynamic conversations. It analyzes emotions, perception, posture, and responses, ensuring a natural flow. With job opening scraping and an embedding-based ATS score checker, Quizzy prepares you for the job market. Built with MLOps in Django
Language: CSS - Size: 5.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

royceschultz/ComfyUI-TranscriptionTools
ComfyUI nodes for transcription on audio or video input.
Language: Python - Size: 317 KB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 22 - Forks: 4

av1kav/concierge
(Under development) Generative AI tool that analyzes meeting recordings to extract key insights, track action items and generate context-aware, audience-specific summaries. Streamline collaboration and reduce unnecessary admin work with Concierge today!
Language: Jupyter Notebook - Size: 78 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

sudhakar-r08/sudhakar-r08.github.io
I'm a Senior Android Developer with over 10 years of experience building high-performance, scalable mobile and backend solutions. I specialize in Kotlin, Jetpack Compose, MVVM, Clean Architecture, and have hands-on experience with modern Android frameworks, IoT integrations, BLE, VoIP, and real-time communication using WebRTC and XMPP.
Language: HTML - Size: 692 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

c12i/bunge-bits
Bunge Bits provides convenient summaries of Kenyan National Assembly and Senate sittings, making legislative information more accessible and digestible.
Language: Rust - Size: 3.08 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 112 - Forks: 16

derogab/yt-transcript
A Dockerized Telegram bot that downloads YouTube videos as MP3 audio and transcribes them using OpenAI Whisper
Language: Python - Size: 1.11 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 1

kaurodri/TikText
📊 Transcreva Vídeos do TikTok! - OpenAI Whisper | Streamlit | Python
Language: Python - Size: 71.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 1

einToast/openai_stt_ha
OpenAI Whisper in Home Assistant via the OpenAI API for use in the Assist pipeline
Language: Python - Size: 35.2 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 17 - Forks: 6

Vibhuarvind/BART-tomatic
AI-powered video transcription and summarization using Hugging Face’s BART and OpenAI Whisper. Fast, accurate, and organized outputs.
Size: 13.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Stage-Whisper/Stage-Whisper
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
Language: TypeScript - Size: 3.4 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 257 - Forks: 28

Nikorasu/LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Language: Python - Size: 24.4 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 350 - Forks: 48

rinndayy/Goida-AI-Unlocker
Unlock AI capabilities with Goida-AI-Unlocker. Access advanced features easily and enhance your projects. Join our community on GitHub! 🚀💻
Language: Python - Size: 43.9 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

nikdanilov/whisper-obsidian-plugin
Speech-to-text in Obsidian using OpenAI Whisper
Language: TypeScript - Size: 236 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 297 - Forks: 52

RiteshGenAI/openai_whisper_transcribe_yt_videos
This project is a Streamlit-based application that allows users to download audio from YouTube videos, transcribe them using OpenAI's Whisper model, and display the transcriptions with pagination.
Language: Python - Size: 68.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

ChaituRajSagar/video_to_narrative
Flask-based AI app that summarizes surveillance videos using Whisper (audio), ViT-GPT2 (frame captions), and Groq LLM (narratives). Produces both general and law enforcement-style summaries.
Language: Python - Size: 1.95 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Malith-Rukshan/whisper-transcriber-bot
🎙️ AI-powered Telegram bot for voice-to-text transcription using OpenAI Whisper. CPU-only, no GPU required, privacy-focused with local processing.
Language: Python - Size: 4.36 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

Lambdua/openai4j
Java client library for OpenAI API.Full support for all OpenAI API models including Completions, Chat, Edits, Embeddings, Audio, Files, Assistants-v2, Images, Moderations, Batch, and Fine-tuning.
Language: Java - Size: 1.17 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 403 - Forks: 44

nicolodiamante/notefy
Streamline your note-taking with ChatGPT's AI expertise and Whisper's precise transcription, enabling fast and efficient summarising.
Size: 44.9 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 0

olekli/DrDictaphone
Dictation app for the terminal and Neovim, using Whisper for transcription and ChatGPT for post-processing.
Language: Python - Size: 2.17 MB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 7 - Forks: 1

rakib-0/SubtitleGenerator
SubtitleGenerator is an interactive tool that automatically generates and translates subtitles for your videos using AI. It supports multiple languages and formats, making it easy to enhance your video content. 🛠️✨
Language: Python - Size: 27.3 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 1

danielrosehill/Whisper-Notepad-Simple
A Linux desktop utility for converting speech to text using the OpenAI Whisper API
Language: Python - Size: 1.24 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Frostbrewn/ai-inclusion-facilitator
Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

MaxineXiong/OpenAI-API-Web-Apps
This repository hosts a collection of custom web applications powered by OpenAI's GPT models (incl. o1, o3-mini, GPT-4.5, GPT-4o, and GPT-4o mini), Whisper model, and TTS model. These apps include an interactive chatbot ("Talk to GPT") for text or voice communication, and a coding assistant ("CodeMaxGPT") that supports various coding tasks.
Language: Python - Size: 101 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 43 - Forks: 19

PsychoKill95/whisper-web
Whisper Web is a real-time transcriber that converts speech to text efficiently. 🐙 Explore the project to learn about installation, deployment, and troubleshooting steps. 🌐
Language: Python - Size: 6.03 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

didmar/whisper-api-server
Drop-in replacement for the OpenAI's Whisper API using the same API but running locally
Language: Python - Size: 282 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 14 - Forks: 3

ckaytev/tgisper
Telegram bot with ASR
Language: Python - Size: 125 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 22 - Forks: 3

ZakirCodeArchitect/Sonic-Lipsync-AI
A Google Colab-based Gradio app for generating lip-synced videos using the Sonic model. It supports audio-to-video syncing with Hugging Face models and runs entirely in the cloud—no local setup needed.
Language: Python - Size: 8.44 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

sakhileln/smart-meeting-companion
An AI-powered application that helps users extract insights from their meetings. 🚀
Language: Dart - Size: 97.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 1

I5UCC/VRCTextboxSTT
A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that information to VRChats textbox system and/or KillFrenzyAvatarText over OSC. Also supports various other methods like OBS via Browsersource and a SteamVR overlay!
Language: Python - Size: 193 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 64 - Forks: 3

Sh1nr1/mai-ai-assistant-self-hosted
Mai is an emotionally intelligent, voice-enabled AI assistant built with FastAPI, Together.ai LLMs, memory persistence via ChromaDB, and real-time sentiment analysis. Designed to feel alive, empathetic, and human-like, Mai blends the charm of a flirty cyberpunk companion with the power of modern multimodal AI.
Language: Python - Size: 59.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
Language: Python - Size: 1.76 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,693 - Forks: 480

sanastasiou/dictation-service
GPU-accelerated speech-to-text service that types what you say, powered by OpenAI's Whisper AI
Language: Shell - Size: 74.2 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

sujeethshingade/speech-to-text
Chatbot with speech-to-text capabilities using OpenAI Whisper models
Language: HTML - Size: 129 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

tech-aakash/AI-enabled-speech-based-clinical-notes-drafter
VoiceRX is an AI-powered clinical documentation platform that streamlines medical note generation from doctor-patient voice interactions.
Language: Python - Size: 54.8 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

maciekt07/Lecture-Note-Generator-POC
📒 A proof-of-concept app that transcribes lecture recordings into text and generates markdown academic notes using a local LLM
Language: TypeScript - Size: 23.3 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

kavya429/tupac-almighty
Personal Telegram bot, **Tupac Almighty**, runs on Raspberry Pi with Mac support for heavy tasks. Efficient AI chats and voice-to-text features. 🚀🐙
Language: Python - Size: 94.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ofir5300/tupac-almighty
A Telegram bot with a hybrid architecture. Its core runs on a Raspberry Pi for 24/7 reliability, offloading heavy LLM and audio processing to a local Mac via a mac-as-a-server connection for personal assistance.
Language: Python - Size: 98.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Eyevinn/auto-subtitles
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
Language: TypeScript - Size: 376 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 46 - Forks: 7

TheSeraphim/scribe-forge-ai
🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Markdown), and support for 20+ audio formats. No external APIs required - works entirely offline.
Language: Python - Size: 2.32 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Garbii1/AI-Meeting-Companion-STT
🗣️ An AI-powered meeting companion that transcribes audio with OpenAI Whisper and generates summaries using IBM WatsonX (Llama 3) via a Gradio interface.
Language: Python - Size: 176 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

kevinkoech357/transkript
A flask built web app that leverages the power of OpenAI's whisper model to transcribe audio and video files. Has support for various file formats. Generates timestamped .srt files.
Language: HTML - Size: 5.73 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

NeiKa0s496/Whisper-Converter
Conversor de vídeo (utilizando un link de Youtube) a audio y posteriormente a texto utilizando openai-whisper
Language: Jupyter Notebook - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

BlackLionXD/LectureSummarizer
LectureSummarizer is an AI-powered website that transcribes and summarizes lectures. It uses OpenAI Whisper for accurate speech-to-text and Llama 2 for concise summaries. With an easy-to-use Gradio interface, it helps students capture and review key lecture points efficiently.
Language: Python - Size: 255 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

Sonupandit9693/meeting-ai-assistant
A collaborative platform that records, transcribes, summarizes meetings and auto-generates action items.
Language: TypeScript - Size: 122 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

liddiard/harmontown-search
Search all transcripts from the Harmontown podcast. Transcription powered by OpenAI's Whisper model. Search powered by Typesense.
Language: TypeScript - Size: 31 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 13 - Forks: 2

AmeliaES/speech-to-text
Python Flask + OpenAI whisper App for converting recorded speech to text
Language: JavaScript - Size: 177 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

JoshuaMart/AutoCaptions
Microservices-based solution for automatic video captioning, designed for 9:16 format videos (shorts). Generate AI-powered transcriptions and create styled captions with FFmpeg or Remotion.
Language: PHP - Size: 42.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

bellingcat/whisperbox-transcribe
Easy to deploy API for transcribing and translating audio / video using OpenAI's whisper model.
Language: Python - Size: 149 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 69 - Forks: 9

EmilHerzberg/AI-Dairy-Web-App
A full-stack web application that allows users to record audio diary entries, automatically transcribe them using OpenAI’s Whisper model, and conveniently review them through a calendar interface.
Language: JavaScript - Size: 6.46 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

thc1006/whisper-colab-tpu-transcriber
High-performance Google Colab Notebook for fast & accurate audio transcription/translation using OpenAI Whisper. Accelerated on TPUs with PyTorch/XLA. Features an interactive UI for model selection, multi-language support, and long-form audio processing.
Language: Jupyter Notebook - Size: 377 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

loglux/FlexAudioPrint
FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a script for programmatic use. With FFmpeg for audio conversion, it supports multiple formats like MP3 and WAV. Ideal for transcribing meetings, lectures, and podcasts, with options to save results as text file
Language: Python - Size: 167 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

gabrielcaiana/translate-ai
Uma aplicação Node.js que gera legendas de vídeos utilizando AI.
Language: JavaScript - Size: 59.6 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Tr1nside/obsidian-telegram-bot
Telegram-бот для создания, управления и хранения заметок в формате Markdown, интегрированный с локальной папкой заметок.
Language: Python - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

l1ght14/emotion_detector
Python/Streamlit app for multimodal emotion (text, audio, video) & sentiment analysis using HuggingFace, Whisper & DeepFace. Detects happy, sad, angry, etc. with confidence scores.
Language: Python - Size: 32.2 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

savbell/whisper-writer
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
Language: Python - Size: 905 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 815 - Forks: 111

davidkiss/openai-twilio-phonebot-demo
This is a simple demo on how to use the OpenAI Realtime API with to create a phone bot that can accept incoming phone calls and use OpenAI to generate responses in real-time.
Language: TypeScript - Size: 20.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Grt1228/chatgpt-java
ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java
Language: Java - Size: 455 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 3,446 - Forks: 823

m1guelpf/yt-whisper
Using OpenAI's Whisper to automatically generate YouTube subtitles
Language: Python - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1,402 - Forks: 144

braulio-dev/VoxLens
Voice transcription and summarization tool
Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Carleslc/AudioToText
Transcribe and translate audio to text using Whisper and DeepL.
Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 332 - Forks: 45

seanivore/docker-transcription
Automated Audio Transcription
Language: Python - Size: 32.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 1

mike-rambil/jarvis-ai
Fun little project to learn more into OpenAI Whisper, LlamaIndex & Hugging Face Transformers
Language: Python - Size: 6.34 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

jaredescott/youtube_transcriber
A Python tool that generates high-accuracy transcripts from any YouTube video with audio. Works with songs, podcasts, lectures, and interviews, featuring GPU acceleration and intelligent formatting.
Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

kallebysantos/upload.ai
AI powered video interaction
Language: TypeScript - Size: 9.55 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0
