GitHub topics: whisper-api
Evil0ctal/Fast-Powerful-Whisper-AI-Services-API
⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。
Language: Python - Size: 1.21 MB - Last synced at: about 1 hour ago - Pushed at: about 2 months ago - Stars: 359 - Forks: 39

ionic-bond/stream-translator-gpt Fork of fortypercnt/stream-translator
A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
Language: Python - Size: 21.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 182 - Forks: 25

waltervanheuven/speech2text
Speech2Text
Language: Python - Size: 59.6 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

connor-pickett/tend_t
used for CASCogLab
Language: Python - Size: 4.88 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Language: Python - Size: 592 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6,463 - Forks: 524

FlyingFathead/TelegramBot-OpenAI-API
A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API
Language: Python - Size: 731 KB - Last synced at: 7 days ago - Pushed at: 14 days ago - Stars: 23 - Forks: 5

Carleslc/AudioToText
Transcribe and translate audio to text using Whisper and DeepL.
Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 323 - Forks: 44

codeonthespectrum/AISubs
A subtitle generator for videos up to 10GB, automatically transcribing and translating spoken content into Brazilian Portuguese. Ideal for multilingual content, this tool creates accurate `.srt` files for seamless integration with video players.
Language: Python - Size: 6.84 KB - Last synced at: 12 days ago - Pushed at: 20 days ago - Stars: 3 - Forks: 0

Clats97/ClatScribe
ClatScribe is a speech-to-text tool that captures real-time audio, transcribes 3- second chunks via OpenAI API, timestamps and logs the text. The 3 second audio files are deleted after the newest file is written, so it does not take up lots of space. It has a CLI and a GUI..
Language: Python - Size: 16.6 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

mallorbc/whisper_mic
Project that allows one to use a microphone with OpenAI whisper.
Language: Python - Size: 54.7 KB - Last synced at: 12 days ago - Pushed at: 10 months ago - Stars: 762 - Forks: 167

GURPREETKAURJETHRA/Youtube-Video-Transcribe-Summarizer-LLM-App
YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smoothly runs on CPU as Llama 2 model is in GGUF format loaded through Llama.cpp.
Language: Python - Size: 3.77 MB - Last synced at: 14 days ago - Pushed at: 12 months ago - Stars: 10 - Forks: 7

mouredev/tggenerator
Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)
Language: Kotlin - Size: 31.4 MB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 202 - Forks: 13

shaadclt/Groq-Whisper-Transcription-App
A Streamlit-based web application that transcribes audio files using OpenAI's Whisper API. You can either upload an MP3 file or input a YouTube URL to convert video audio into text within seconds.
Language: Python - Size: 14.6 KB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

Chidwi-commits/host-client-for-whisper-ai
A simple Python host-client setup for audio transcription using OpenAI's Whisper AI model.
Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

carloscdias/whisper-cpp-python
whisper.cpp bindings for python
Language: Python - Size: 79.1 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 89 - Forks: 23

igopalakrishna/EchoMate
EchoMate – An AI mental health companion using GPT-4o, Whisper, and Gradio for empathetic conversations, real-time transcription (95% accuracy), and seamless interaction. Boosts mood scores by 28% and achieves 98% session completion.
Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

danielrosehill/Thought-Pad
Linux desktop application that provides a two-stage process for creating notes from dictated speech (first stage, transcription via Whisper API; second stage light text formatting). Exports to markdown docs.
Language: Python - Size: 49.4 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

francisdiasbr/real-estate-app
✨ Intelligent real estate search platform using AI to understand natural language queries and find properties based on context and meaning.
Language: TypeScript - Size: 420 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

nomadxxxx/symmetrical-fishstick
A small proxy to allow OpenAI Whisper-like requests to Deepgram
Language: Python - Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

bruceunx/video-maestro
A powerful desktop app built with Tauri and ReactJS to manage videos from YouTube or similar platforms. Features include audio-to-text transcription, translation, summarization, and a user-friendly interface. Perfect for creators, researchers, and video enthusiasts!
Language: Rust - Size: 26.9 MB - Last synced at: 24 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

didmar/whisper-api-server
Drop-in replacement for the OpenAI's Whisper API using the same API but running locally
Language: Python - Size: 188 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 11 - Forks: 3

kristofferv98/VoiceProcessingToolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
Language: Python - Size: 34.3 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

themanyone/whisper_dictation
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
Language: Python - Size: 963 KB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 187 - Forks: 27

redocrepus/arkode
Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT
Language: TypeScript - Size: 97.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

jk-oster/voice-to-text-extension
A web extension to use your voice as input for any webpage
Language: JavaScript - Size: 227 KB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

AznIronMan/pyscribe
PyScribe is a command-line tool to transcribe audio files. It uses `ffmpeg` for audio conversion and `pywhisper` for transcription.
Language: Python - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

alexiskirke/meditation-video-generator-openai
A tool to create guided meditations (like those found on YouTube) using OpenAI.
Language: Python - Size: 74.2 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

leonchanwy/subtitle
用 Open AI 的 Whisper API 轉譯字幕的 Web UI。
Language: Python - Size: 39.5 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

vwkyc/ASSR 📦
sentiment analysis on transcribed speech or text with multilingual capability
Language: JavaScript - Size: 1.57 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

niqifan007/Openai-tts-stt-streamlit
A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能
Language: Python - Size: 20.5 KB - Last synced at: 27 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

maninhouse/Huh
「Huh(蛤)?」是一個使用 Flask 和 OpenAI API 建立的 LINE 聊天機器人。它可以接收並處理來自 LINE 的語音訊息,並利用 OpenAI 的語音識別技術將語音轉換為文字,同時將文字訊息回傳給用戶。
Language: Python - Size: 52.7 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

moltrus/openai-api-pricing
A simple Python script to track the amount spent based on usage parameters for OpenAI APIs
Language: Python - Size: 4.88 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Youknow2509/Real-Time-Speech-To-Text
Speech To Text in Real-Time
Language: Python - Size: 8.79 KB - Last synced at: 15 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

NoSkhil/speechToText
Quick speech to text integration demo, using OpenAI's Whisper API
Language: TypeScript - Size: 17.6 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lliWcWill/liveTranslation_openai-whisper
Live translation tool utilizing OpenAI's Whisper model for real-time audio transcription/translation with BYOK OpenAI API key for your choice of language.
Language: Python - Size: 179 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 5 - Forks: 2

MO7YW4NG/CYCU-iLearning-Video-Transcription
中原大學 iLearning 影片教材轉錄逐字稿
Language: Python - Size: 27.3 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

goktugcy/noteai
An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.
Language: TypeScript - Size: 204 KB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

natehouk/flow-ai-hackathon-2023
YASS.ai - Team Orange's entry to the Flow AI Hackathon 2023
Language: Python - Size: 127 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1

gabrielcdb/RealJarvis
A working Speech to Speech AI assistant that can interact with you, manage your system, and more!
Language: Python - Size: 237 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 12 - Forks: 0

jacintogomez/Whisper-AI-Translation
Multilingual verbal conversation with an AI bot
Language: Jupyter Notebook - Size: 38.1 KB - Last synced at: 22 days ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

march038/MultiMP3-Transcription-Whisper1-API-Regardless-Of-Size
Python script to transcribe all mp3 files from a local directory using the Whisper 1 API, regardless of their size by using batch processing if necessary
Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

marcelpetrick/speech4excellence
Voice transcription prototype with openAI's Whisper and PyQt-UI and Excel output
Language: Python - Size: 352 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

arian0zen/QueryWhisperer
Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.
Language: JavaScript - Size: 3.62 MB - Last synced at: 11 months ago - Pushed at: 12 months ago - Stars: 29 - Forks: 7

allseeteam/whisperx-fastapi Fork of m-bain/whisperX
WhisperX FastAPI integration
Language: Python - Size: 23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Chasesc/whisper-api
Language: Python - Size: 48.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

leotodisco/WLT
Tool che utilizza la tecnologia Whisper per trascrivere le lezioni universitarie.
Language: Python - Size: 39.1 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mochi-neko/Whisper-API-unity
A client library of OpenAI Whisper transcription and translation API for Unity.
Language: C# - Size: 443 KB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 1

7Biscuits/Tube.ai
YouTube Summarizer and Chatbot
Language: TypeScript - Size: 12.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

supershaneski/openai-whisper-talk
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.
Language: JavaScript - Size: 601 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 92 - Forks: 28

DamianB-BitFlipper/async-whisper
Asynchronously transcribe audio files split into chunks in parallel and intelligently join results, yielding nearly identical transcriptions to full audio transcriptions but in a fraction of the time.
Language: Python - Size: 18.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ayushsoni1010/textify
🎙️Seamlessly transcribing the world, one spoken word at a time, in any language you desire.
Language: TypeScript - Size: 374 KB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

Itz-fork/Vrappy
Summarize videos using AI
Language: TypeScript - Size: 48.8 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

supershaneski/openai-whisper-api
A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic speech recognition (ASR) system, built using Next 13, the React framework
Language: JavaScript - Size: 650 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 47 - Forks: 17

Lord-Haji/ChatAudio Fork of Anil-matcha/ChatPDF
Language: Python - Size: 2.94 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

fabianigual/whisper-ai-litio
Whisper AI para transcripción a documentos en formato markdown
Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

jacksparrow124/HM-GPT
Home Manager GPT is a text to speech chat gpt that can be used to control your entire house. ask verbal questions and get verbal answers from google speech recognition.
Language: Python - Size: 106 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

mvanzulli/Meeting_Assistant
A simple and effortless automatic recorder, transcriber, translator and summarizer for meetings: Whisper + ChatGPT
Language: Python - Size: 981 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

interactive-applications/speech-to-clipboard
A simple UI tool written in Python, for recording audio from a microphone and automatically transcribing the recording using OpenAI's Whisper model via OpenAI's API.
Language: Python - Size: 354 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

cedpoilly/parrot
Ced's parrot! Speech-to-text (Whisper API from OpenAI) and text-to-speech (Narakeet API) demo.
Language: Vue - Size: 2.17 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

swfz/gpt-1on1
Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

satoryu/video_description_generator
Language: Python - Size: 4.88 KB - Last synced at: 26 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

codeverlan/Clinical_Transcriber
Uses Whisper AI to transcribe and process audio files, with the output being useful to psychotherapists.
Language: Python - Size: 2.11 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

UBOS-tech/node-red-contrib-speech-to-text-ubos
Learn how to turn audio into text.
Language: JavaScript - Size: 18.6 KB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

crazydevlegend/twspace-discord-stt
Discord bot that downloads and transcribes twitter space audio file
Language: Python - Size: 14.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

iamthejahid/transcript_whisper_flutter
Whisper is an automatic speech recognition (ASR), a product of open ai. This is Offline Whisper ai integration in flutter.
Language: C++ - Size: 91.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

thomasjv799/Whisper-Speech-to-Text-CLI.
A Simple CLI that uses open AI's whisper model to transcribe any audio.
Language: Python - Size: 19.8 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
