An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: openai-whisper

CrimeIsDown/trunk-transcribe

Transcription of calls from trunk-recorder using OpenAI Whisper

Language: Python - Size: 2.64 MB - Last synced at: about 23 hours ago - Pushed at: about 24 hours ago - Stars: 37 - Forks: 3

arifulislamat/whisper-voice-transcription

Voice Transcription | Whisper | STT | Python3.13 | UV

Language: Python - Size: 887 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

moonshine-ai/useful-transformers

Efficient Inference of Transformer models

Language: C++ - Size: 135 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 455 - Forks: 43

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。

Language: Python - Size: 1.22 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 407 - Forks: 51

sashabaranov/go-openai

OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go

Language: Go - Size: 784 KB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 10,288 - Forks: 1,655

speaches-ai/speaches

Language: Python - Size: 2.4 MB - Last synced at: 3 days ago - Pushed at: 7 days ago - Stars: 2,340 - Forks: 282

Illyism/openai-whisper-api

OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example

Language: TypeScript - Size: 22.5 KB - Last synced at: about 3 hours ago - Pushed at: almost 2 years ago - Stars: 118 - Forks: 11

nicolodiamante/chatty

Unleash the power of Chatty: the intersection of ChatGPT’s intelligence, DALL·E's creativity, and Whisper's precise audio transcription for your Apple devices with support of 30 languages.

Size: 85.9 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 165 - Forks: 9

m1guelpf/auto-subtitle

Automatically generate and overlay subtitles for any video.

Language: Python - Size: 5.86 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 2,024 - Forks: 331

michaelyuwh/mcp-speech-to-text

🎙️ MCP Speech-to-Text Server with Enhanced Cantonese Support | Offline Vosk + Online Google Cloud | Auto-detection for zh-HK | n8n workflows | Hong Kong optimized 🇭🇰

Language: Python - Size: 175 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

royshil/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

Language: C++ - Size: 70.1 MB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 869 - Forks: 68

lst97/CantoCap

Tool for generating accurate Cantonese subtitles from audio/video files using OpenAI Whisper with LLM enhancements.

Language: TypeScript - Size: 4.13 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

SU-PER-NOVA/whisper-offline-video-audio-transcriber

"An offline video & audio transcription tool powered by OpenAI Whisper. Convert your tutorials, lectures, and podcasts into accurate text transcripts and use AI to generate summaries, notes, and mind maps — saving hours of time and boosting productivity."

Language: Python - Size: 1.48 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

HemulGM/DelphiOpenAI

OpenAI (and DeepSeek, Azure OpenAI, YandexGPT, Ollama) API wrapper for Delphi. Use ChatGPT, DALL-E, Whisper and other products.

Language: Pascal - Size: 1.09 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 277 - Forks: 76

nishanbajracharya/py-transcribe

A python project to transcribe and generate srt subtitle files for foreign language videos.

Language: Python - Size: 27.3 KB - Last synced at: about 3 hours ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

wryan14/youtube-whisper

Transcribe audio files and YouTube videos to text with timestamps using OpenAI's Whisper API.

Language: Python - Size: 22.5 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

LunaticPrakash/Audio-Query

A full-stack application that allows users to upload audio files (like call recordings, meetings, or lectures) and search for keywords or phrases within them.

Language: JavaScript - Size: 40 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

SamOhrenberg/regulation-database

A searchable database of all Regulation Podcast episodes, automatically transcribed and updated.

Language: JavaScript - Size: 119 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 3 - Forks: 0

SreejanPersonal/openai-unofficial

An completely Free & Unlimited unofficial Python SDK for the OpenAI API, providing seamless integration and easy-to-use methods for interacting with OpenAI's latest powerful AI models, including GPT-4o (Including gpt-4o-audio-preview & gpt-4o-realtime-preview Models), GPT-4, GPT-3.5 Turbo, DALL·E 3, Whisper & Text-to-Speech (TTS) models

Language: Python - Size: 36.1 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 26 - Forks: 15

Keatonkirk55cfc/audio-to-text

🎧 audio-to-text transcribes audio files to text using the Web Speech API in a headless browser via Puppeteer, supporting ffmpeg formats and PulseAudio on Linux.

Language: TypeScript - Size: 42 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

tkarabela/pysubs2

A Python library for editing subtitle files

Language: Python - Size: 2.6 MB - Last synced at: 13 days ago - Pushed at: 7 months ago - Stars: 382 - Forks: 48

imprasukjain/Kyutai-STT

Demo repository for Kyutai Labs' STT-1B model: Real-time speech-to-text transcription with streaming inference, built-in VAD, and Jupyter notebook examples for audio processing and simulation.

Language: Jupyter Notebook - Size: 5.16 MB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 2 - Forks: 0

Aparnamol-KS/Captionify

This project is designed to convert spoken content into real-time captions, including mathematical notation, to enhance the accessibility and learning experience for students

Language: HTML - Size: 206 KB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

absadiki/pywhispercpp

Python bindings for whisper.cpp

Language: Python - Size: 1.52 MB - Last synced at: 9 days ago - Pushed at: 15 days ago - Stars: 282 - Forks: 48

platisd/phonix

Generate captions for videos using the power of OpenAI's Whisper API

Language: Python - Size: 36.1 KB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 46 - Forks: 4

theboringhumane/openai-voices.piper

🔊 PiperGen: Pretrain Piper TTS with OpenAI voices! Capture, convert, and fine-tune models for rich, natural speech synthesis. 🗣✨

Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: about 22 hours ago - Pushed at: 10 months ago - Stars: 5 - Forks: 0

AliasUruz/whisper-flash-transcriber

Instant offline audio transcription using OpenAI's Whisper AI at the power of your fingers! I recommend the "stable" branch.

Language: Python - Size: 3.96 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 2 - Forks: 0

RayanAIX/Speech-to-Text-Translator

Language: Jupyter Notebook - Size: 18 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

Hrishi11572/Jarvis

A bare bones, AI Voice assistant which is capable of answering your queries in device, without internet

Language: Python - Size: 11.7 KB - Last synced at: about 10 hours ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

FlyingFathead/whisper-transcriber-telegram-bot

Python-based Telegram transcriber bot utilizing local Whisper models & yt-dlp

Language: Python - Size: 7.49 MB - Last synced at: 7 days ago - Pushed at: 23 days ago - Stars: 42 - Forks: 11

ProgrammerGnome/lingua

Speech translator primarily for language learning. Languages: English <-> Hungarian

Language: Python - Size: 14.6 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

Fingolfin7/SpeechPracticeApp

Track and improve your speech clarity with real-time, data-driven feedback perfect for fixing mumbling and unclear articulation.

Language: Python - Size: 218 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

aramb-dev/transcriptr

Transcriptr is a modern web application that converts audio files to text using artificial intelligence. It provides a clean, intuitive interface for uploading audio files and receiving high-quality transcriptions powered by Replicate's Incredibly Fast Whisper model.

Language: TypeScript - Size: 16.4 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 4 - Forks: 0

mohamadahmadidev/whisper-sentence-aligner

A web-based tool that uses OpenAI's Whisper API to find the precise start and end timestamps of sentences from a JSON file within a corresponding WAV audio file.

Language: HTML - Size: 7.81 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

AvenCores/Goida-AI-Unlocker

🛡 Установщик разблокировщика зарубежных AI-сервисов (и не только) для России на Windows 10/11 🌍

Language: Python - Size: 130 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 14 - Forks: 1

Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Language: Python - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,075 - Forks: 101

RijoSLal/Quizzy

Quizzy is an AI interviewer that creates hyper-personalized interview simulation using a RAG-based system for dynamic conversations. It analyzes emotions, perception, posture, and responses, ensuring a natural flow. With job opening scraping and an embedding-based ATS score checker, Quizzy prepares you for the job market. Built with MLOps in Django

Language: CSS - Size: 5.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

royceschultz/ComfyUI-TranscriptionTools

ComfyUI nodes for transcription on audio or video input.

Language: Python - Size: 317 KB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 22 - Forks: 4

av1kav/concierge

(Under development) Generative AI tool that analyzes meeting recordings to extract key insights, track action items and generate context-aware, audience-specific summaries. Streamline collaboration and reduce unnecessary admin work with Concierge today!

Language: Jupyter Notebook - Size: 78 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

sudhakar-r08/sudhakar-r08.github.io

I'm a Senior Android Developer with over 10 years of experience building high-performance, scalable mobile and backend solutions. I specialize in Kotlin, Jetpack Compose, MVVM, Clean Architecture, and have hands-on experience with modern Android frameworks, IoT integrations, BLE, VoIP, and real-time communication using WebRTC and XMPP.

Language: HTML - Size: 692 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

c12i/bunge-bits

Bunge Bits provides convenient summaries of Kenyan National Assembly and Senate sittings, making legislative information more accessible and digestible.

Language: Rust - Size: 3.08 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 112 - Forks: 16

derogab/yt-transcript

A Dockerized Telegram bot that downloads YouTube videos as MP3 audio and transcribes them using OpenAI Whisper

Language: Python - Size: 1.11 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 1

kaurodri/TikText

📊 Transcreva Vídeos do TikTok! - OpenAI Whisper | Streamlit | Python

Language: Python - Size: 71.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 1

einToast/openai_stt_ha

OpenAI Whisper in Home Assistant via the OpenAI API for use in the Assist pipeline

Language: Python - Size: 35.2 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 17 - Forks: 6

Vibhuarvind/BART-tomatic

AI-powered video transcription and summarization using Hugging Face’s BART and OpenAI Whisper. Fast, accurate, and organized outputs.

Size: 13.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Stage-Whisper/Stage-Whisper

The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.

Language: TypeScript - Size: 3.4 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 257 - Forks: 28

Nikorasu/LiveWhisper

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

Language: Python - Size: 24.4 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 350 - Forks: 48

rinndayy/Goida-AI-Unlocker

Unlock AI capabilities with Goida-AI-Unlocker. Access advanced features easily and enhance your projects. Join our community on GitHub! 🚀💻

Language: Python - Size: 43.9 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

nikdanilov/whisper-obsidian-plugin

Speech-to-text in Obsidian using OpenAI Whisper

Language: TypeScript - Size: 236 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 297 - Forks: 52

RiteshGenAI/openai_whisper_transcribe_yt_videos

This project is a Streamlit-based application that allows users to download audio from YouTube videos, transcribe them using OpenAI's Whisper model, and display the transcriptions with pagination.

Language: Python - Size: 68.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

ChaituRajSagar/video_to_narrative

Flask-based AI app that summarizes surveillance videos using Whisper (audio), ViT-GPT2 (frame captions), and Groq LLM (narratives). Produces both general and law enforcement-style summaries.

Language: Python - Size: 1.95 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Malith-Rukshan/whisper-transcriber-bot

🎙️ AI-powered Telegram bot for voice-to-text transcription using OpenAI Whisper. CPU-only, no GPU required, privacy-focused with local processing.

Language: Python - Size: 4.36 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

Lambdua/openai4j

Java client library for OpenAI API.Full support for all OpenAI API models including Completions, Chat, Edits, Embeddings, Audio, Files, Assistants-v2, Images, Moderations, Batch, and Fine-tuning.

Language: Java - Size: 1.17 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 403 - Forks: 44

nicolodiamante/notefy

Streamline your note-taking with ChatGPT's AI expertise and Whisper's precise transcription, enabling fast and efficient summarising.

Size: 44.9 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 0

olekli/DrDictaphone

Dictation app for the terminal and Neovim, using Whisper for transcription and ChatGPT for post-processing.

Language: Python - Size: 2.17 MB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 7 - Forks: 1

rakib-0/SubtitleGenerator

SubtitleGenerator is an interactive tool that automatically generates and translates subtitles for your videos using AI. It supports multiple languages and formats, making it easy to enhance your video content. 🛠️✨

Language: Python - Size: 27.3 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 1

danielrosehill/Whisper-Notepad-Simple

A Linux desktop utility for converting speech to text using the OpenAI Whisper API

Language: Python - Size: 1.24 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Frostbrewn/ai-inclusion-facilitator

Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

MaxineXiong/OpenAI-API-Web-Apps

This repository hosts a collection of custom web applications powered by OpenAI's GPT models (incl. o1, o3-mini, GPT-4.5, GPT-4o, and GPT-4o mini), Whisper model, and TTS model. These apps include an interactive chatbot ("Talk to GPT") for text or voice communication, and a coding assistant ("CodeMaxGPT") that supports various coding tasks.

Language: Python - Size: 101 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 43 - Forks: 19

PsychoKill95/whisper-web

Whisper Web is a real-time transcriber that converts speech to text efficiently. 🐙 Explore the project to learn about installation, deployment, and troubleshooting steps. 🌐

Language: Python - Size: 6.03 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

didmar/whisper-api-server

Drop-in replacement for the OpenAI's Whisper API using the same API but running locally

Language: Python - Size: 282 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 14 - Forks: 3

ckaytev/tgisper

Telegram bot with ASR

Language: Python - Size: 125 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 22 - Forks: 3

ZakirCodeArchitect/Sonic-Lipsync-AI

A Google Colab-based Gradio app for generating lip-synced videos using the Sonic model. It supports audio-to-video syncing with Hugging Face models and runs entirely in the cloud—no local setup needed.

Language: Python - Size: 8.44 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

sakhileln/smart-meeting-companion

An AI-powered application that helps users extract insights from their meetings. 🚀

Language: Dart - Size: 97.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 1

I5UCC/VRCTextboxSTT

A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that information to VRChats textbox system and/or KillFrenzyAvatarText over OSC. Also supports various other methods like OBS via Browsersource and a SteamVR overlay!

Language: Python - Size: 193 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 64 - Forks: 3

Sh1nr1/mai-ai-assistant-self-hosted

Mai is an emotionally intelligent, voice-enabled AI assistant built with FastAPI, Together.ai LLMs, memory persistence via ChromaDB, and real-time sentiment analysis. Designed to feel alive, empathetic, and human-like, Mai blends the charm of a flirty cyberpunk companion with the power of modern multimodal AI.

Language: Python - Size: 59.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ahmetoner/whisper-asr-webservice

OpenAI Whisper ASR Webservice API

Language: Python - Size: 1.76 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,693 - Forks: 480

sanastasiou/dictation-service

GPU-accelerated speech-to-text service that types what you say, powered by OpenAI's Whisper AI

Language: Shell - Size: 74.2 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

sujeethshingade/speech-to-text

Chatbot with speech-to-text capabilities using OpenAI Whisper models

Language: HTML - Size: 129 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

tech-aakash/AI-enabled-speech-based-clinical-notes-drafter

VoiceRX is an AI-powered clinical documentation platform that streamlines medical note generation from doctor-patient voice interactions.

Language: Python - Size: 54.8 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

maciekt07/Lecture-Note-Generator-POC

📒 A proof-of-concept app that transcribes lecture recordings into text and generates markdown academic notes using a local LLM

Language: TypeScript - Size: 23.3 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

kavya429/tupac-almighty

Personal Telegram bot, **Tupac Almighty**, runs on Raspberry Pi with Mac support for heavy tasks. Efficient AI chats and voice-to-text features. 🚀🐙

Language: Python - Size: 94.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ofir5300/tupac-almighty

A Telegram bot with a hybrid architecture. Its core runs on a Raspberry Pi for 24/7 reliability, offloading heavy LLM and audio processing to a local Mac via a mac-as-a-server connection for personal assistance.

Language: Python - Size: 98.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Eyevinn/auto-subtitles

Automatically generate subtitles from an input audio or video file using OpenAI Whisper

Language: TypeScript - Size: 376 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 46 - Forks: 7

TheSeraphim/scribe-forge-ai

🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Markdown), and support for 20+ audio formats. No external APIs required - works entirely offline.

Language: Python - Size: 2.32 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Garbii1/AI-Meeting-Companion-STT

🗣️ An AI-powered meeting companion that transcribes audio with OpenAI Whisper and generates summaries using IBM WatsonX (Llama 3) via a Gradio interface.

Language: Python - Size: 176 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

kevinkoech357/transkript

A flask built web app that leverages the power of OpenAI's whisper model to transcribe audio and video files. Has support for various file formats. Generates timestamped .srt files.

Language: HTML - Size: 5.73 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

NeiKa0s496/Whisper-Converter

Conversor de vídeo (utilizando un link de Youtube) a audio y posteriormente a texto utilizando openai-whisper

Language: Jupyter Notebook - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

BlackLionXD/LectureSummarizer

LectureSummarizer is an AI-powered website that transcribes and summarizes lectures. It uses OpenAI Whisper for accurate speech-to-text and Llama 2 for concise summaries. With an easy-to-use Gradio interface, it helps students capture and review key lecture points efficiently.

Language: Python - Size: 255 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

Sonupandit9693/meeting-ai-assistant

A collaborative platform that records, transcribes, summarizes meetings and auto-generates action items.

Language: TypeScript - Size: 122 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

liddiard/harmontown-search

Search all transcripts from the Harmontown podcast. Transcription powered by OpenAI's Whisper model. Search powered by Typesense.

Language: TypeScript - Size: 31 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 13 - Forks: 2

AmeliaES/speech-to-text

Python Flask + OpenAI whisper App for converting recorded speech to text

Language: JavaScript - Size: 177 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

JoshuaMart/AutoCaptions

Microservices-based solution for automatic video captioning, designed for 9:16 format videos (shorts). Generate AI-powered transcriptions and create styled captions with FFmpeg or Remotion.

Language: PHP - Size: 42.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

bellingcat/whisperbox-transcribe

Easy to deploy API for transcribing and translating audio / video using OpenAI's whisper model.

Language: Python - Size: 149 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 69 - Forks: 9

EmilHerzberg/AI-Dairy-Web-App

A full-stack web application that allows users to record audio diary entries, automatically transcribe them using OpenAI’s Whisper model, and conveniently review them through a calendar interface.

Language: JavaScript - Size: 6.46 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

thc1006/whisper-colab-tpu-transcriber

High-performance Google Colab Notebook for fast & accurate audio transcription/translation using OpenAI Whisper. Accelerated on TPUs with PyTorch/XLA. Features an interactive UI for model selection, multi-language support, and long-form audio processing.

Language: Jupyter Notebook - Size: 377 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

loglux/FlexAudioPrint

FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a script for programmatic use. With FFmpeg for audio conversion, it supports multiple formats like MP3 and WAV. Ideal for transcribing meetings, lectures, and podcasts, with options to save results as text file

Language: Python - Size: 167 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

gabrielcaiana/translate-ai

Uma aplicação Node.js que gera legendas de vídeos utilizando AI.

Language: JavaScript - Size: 59.6 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Tr1nside/obsidian-telegram-bot

Telegram-бот для создания, управления и хранения заметок в формате Markdown, интегрированный с локальной папкой заметок.

Language: Python - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

l1ght14/emotion_detector

Python/Streamlit app for multimodal emotion (text, audio, video) & sentiment analysis using HuggingFace, Whisper & DeepFace. Detects happy, sad, angry, etc. with confidence scores.

Language: Python - Size: 32.2 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

savbell/whisper-writer

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

Language: Python - Size: 905 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 815 - Forks: 111

davidkiss/openai-twilio-phonebot-demo

This is a simple demo on how to use the OpenAI Realtime API with to create a phone bot that can accept incoming phone calls and use OpenAI to generate responses in real-time.

Language: TypeScript - Size: 20.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Grt1228/chatgpt-java

ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java

Language: Java - Size: 455 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 3,446 - Forks: 823

m1guelpf/yt-whisper

Using OpenAI's Whisper to automatically generate YouTube subtitles

Language: Python - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1,402 - Forks: 144

braulio-dev/VoxLens

Voice transcription and summarization tool

Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Carleslc/AudioToText

Transcribe and translate audio to text using Whisper and DeepL.

Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 332 - Forks: 45

seanivore/docker-transcription

Automated Audio Transcription

Language: Python - Size: 32.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 1

mike-rambil/jarvis-ai

Fun little project to learn more into OpenAI Whisper, LlamaIndex & Hugging Face Transformers

Language: Python - Size: 6.34 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

jaredescott/youtube_transcriber

A Python tool that generates high-accuracy transcripts from any YouTube video with audio. Works with songs, podcasts, lectures, and interviews, featuring GPU acceleration and intelligent formatting.

Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

kallebysantos/upload.ai

AI powered video interaction

Language: TypeScript - Size: 9.55 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0