An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: whisper-api

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。

Language: Python - Size: 1.21 MB - Last synced at: about 1 hour ago - Pushed at: about 2 months ago - Stars: 359 - Forks: 39

ionic-bond/stream-translator-gpt Fork of fortypercnt/stream-translator

A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.

Language: Python - Size: 21.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 182 - Forks: 25

waltervanheuven/speech2text

Speech2Text

Language: Python - Size: 59.6 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

connor-pickett/tend_t

used for CASCogLab

Language: Python - Size: 4.88 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

adithya-s-k/omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Language: Python - Size: 592 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6,463 - Forks: 524

FlyingFathead/TelegramBot-OpenAI-API

A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API

Language: Python - Size: 731 KB - Last synced at: 7 days ago - Pushed at: 14 days ago - Stars: 23 - Forks: 5

Carleslc/AudioToText

Transcribe and translate audio to text using Whisper and DeepL.

Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 323 - Forks: 44

codeonthespectrum/AISubs

A subtitle generator for videos up to 10GB, automatically transcribing and translating spoken content into Brazilian Portuguese. Ideal for multilingual content, this tool creates accurate `.srt` files for seamless integration with video players.

Language: Python - Size: 6.84 KB - Last synced at: 12 days ago - Pushed at: 20 days ago - Stars: 3 - Forks: 0

Clats97/ClatScribe

ClatScribe is a speech-to-text tool that captures real-time audio, transcribes 3- second chunks via OpenAI API, timestamps and logs the text. The 3 second audio files are deleted after the newest file is written, so it does not take up lots of space. It has a CLI and a GUI..

Language: Python - Size: 16.6 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

mallorbc/whisper_mic

Project that allows one to use a microphone with OpenAI whisper.

Language: Python - Size: 54.7 KB - Last synced at: 12 days ago - Pushed at: 10 months ago - Stars: 762 - Forks: 167

GURPREETKAURJETHRA/Youtube-Video-Transcribe-Summarizer-LLM-App

YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smoothly runs on CPU as Llama 2 model is in GGUF format loaded through Llama.cpp.

Language: Python - Size: 3.77 MB - Last synced at: 14 days ago - Pushed at: 12 months ago - Stars: 10 - Forks: 7

mouredev/tggenerator

Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)

Language: Kotlin - Size: 31.4 MB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 202 - Forks: 13

shaadclt/Groq-Whisper-Transcription-App

A Streamlit-based web application that transcribes audio files using OpenAI's Whisper API. You can either upload an MP3 file or input a YouTube URL to convert video audio into text within seconds.

Language: Python - Size: 14.6 KB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

Chidwi-commits/host-client-for-whisper-ai

A simple Python host-client setup for audio transcription using OpenAI's Whisper AI model.

Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

carloscdias/whisper-cpp-python

whisper.cpp bindings for python

Language: Python - Size: 79.1 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 89 - Forks: 23

igopalakrishna/EchoMate

EchoMate – An AI mental health companion using GPT-4o, Whisper, and Gradio for empathetic conversations, real-time transcription (95% accuracy), and seamless interaction. Boosts mood scores by 28% and achieves 98% session completion.

Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

danielrosehill/Thought-Pad

Linux desktop application that provides a two-stage process for creating notes from dictated speech (first stage, transcription via Whisper API; second stage light text formatting). Exports to markdown docs.

Language: Python - Size: 49.4 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

francisdiasbr/real-estate-app

✨ Intelligent real estate search platform using AI to understand natural language queries and find properties based on context and meaning.

Language: TypeScript - Size: 420 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

nomadxxxx/symmetrical-fishstick

A small proxy to allow OpenAI Whisper-like requests to Deepgram

Language: Python - Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

bruceunx/video-maestro

A powerful desktop app built with Tauri and ReactJS to manage videos from YouTube or similar platforms. Features include audio-to-text transcription, translation, summarization, and a user-friendly interface. Perfect for creators, researchers, and video enthusiasts!

Language: Rust - Size: 26.9 MB - Last synced at: 24 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

didmar/whisper-api-server

Drop-in replacement for the OpenAI's Whisper API using the same API but running locally

Language: Python - Size: 188 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 11 - Forks: 3

kristofferv98/VoiceProcessingToolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications

Language: Python - Size: 34.3 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

themanyone/whisper_dictation

Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.

Language: Python - Size: 963 KB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 187 - Forks: 27

redocrepus/arkode

Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT

Language: TypeScript - Size: 97.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

jk-oster/voice-to-text-extension

A web extension to use your voice as input for any webpage

Language: JavaScript - Size: 227 KB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

AznIronMan/pyscribe

PyScribe is a command-line tool to transcribe audio files. It uses `ffmpeg` for audio conversion and `pywhisper` for transcription.

Language: Python - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

alexiskirke/meditation-video-generator-openai

A tool to create guided meditations (like those found on YouTube) using OpenAI.

Language: Python - Size: 74.2 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

leonchanwy/subtitle

用 Open AI 的 Whisper API 轉譯字幕的 Web UI。

Language: Python - Size: 39.5 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

vwkyc/ASSR 📦

sentiment analysis on transcribed speech or text with multilingual capability

Language: JavaScript - Size: 1.57 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

niqifan007/Openai-tts-stt-streamlit

A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能

Language: Python - Size: 20.5 KB - Last synced at: 27 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

maninhouse/Huh

「Huh(蛤)?」是一個使用 Flask 和 OpenAI API 建立的 LINE 聊天機器人。它可以接收並處理來自 LINE 的語音訊息,並利用 OpenAI 的語音識別技術將語音轉換為文字,同時將文字訊息回傳給用戶。

Language: Python - Size: 52.7 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

moltrus/openai-api-pricing

A simple Python script to track the amount spent based on usage parameters for OpenAI APIs

Language: Python - Size: 4.88 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Youknow2509/Real-Time-Speech-To-Text

Speech To Text in Real-Time

Language: Python - Size: 8.79 KB - Last synced at: 15 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

NoSkhil/speechToText

Quick speech to text integration demo, using OpenAI's Whisper API

Language: TypeScript - Size: 17.6 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lliWcWill/liveTranslation_openai-whisper

Live translation tool utilizing OpenAI's Whisper model for real-time audio transcription/translation with BYOK OpenAI API key for your choice of language.

Language: Python - Size: 179 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 5 - Forks: 2

MO7YW4NG/CYCU-iLearning-Video-Transcription

中原大學 iLearning 影片教材轉錄逐字稿

Language: Python - Size: 27.3 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

goktugcy/noteai

An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.

Language: TypeScript - Size: 204 KB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

natehouk/flow-ai-hackathon-2023

YASS.ai - Team Orange's entry to the Flow AI Hackathon 2023

Language: Python - Size: 127 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1

gabrielcdb/RealJarvis

A working Speech to Speech AI assistant that can interact with you, manage your system, and more!

Language: Python - Size: 237 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 12 - Forks: 0

jacintogomez/Whisper-AI-Translation

Multilingual verbal conversation with an AI bot

Language: Jupyter Notebook - Size: 38.1 KB - Last synced at: 22 days ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

march038/MultiMP3-Transcription-Whisper1-API-Regardless-Of-Size

Python script to transcribe all mp3 files from a local directory using the Whisper 1 API, regardless of their size by using batch processing if necessary

Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

marcelpetrick/speech4excellence

Voice transcription prototype with openAI's Whisper and PyQt-UI and Excel output

Language: Python - Size: 352 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

arian0zen/QueryWhisperer

Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.

Language: JavaScript - Size: 3.62 MB - Last synced at: 11 months ago - Pushed at: 12 months ago - Stars: 29 - Forks: 7

allseeteam/whisperx-fastapi Fork of m-bain/whisperX

WhisperX FastAPI integration

Language: Python - Size: 23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Chasesc/whisper-api

Language: Python - Size: 48.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

leotodisco/WLT

Tool che utilizza la tecnologia Whisper per trascrivere le lezioni universitarie.

Language: Python - Size: 39.1 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mochi-neko/Whisper-API-unity

A client library of OpenAI Whisper transcription and translation API for Unity.

Language: C# - Size: 443 KB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 1

7Biscuits/Tube.ai

YouTube Summarizer and Chatbot

Language: TypeScript - Size: 12.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

supershaneski/openai-whisper-talk

openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.

Language: JavaScript - Size: 601 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 92 - Forks: 28

DamianB-BitFlipper/async-whisper

Asynchronously transcribe audio files split into chunks in parallel and intelligently join results, yielding nearly identical transcriptions to full audio transcriptions but in a fraction of the time.

Language: Python - Size: 18.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ayushsoni1010/textify

🎙️Seamlessly transcribing the world, one spoken word at a time, in any language you desire.

Language: TypeScript - Size: 374 KB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

Itz-fork/Vrappy

Summarize videos using AI

Language: TypeScript - Size: 48.8 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

supershaneski/openai-whisper-api

A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic speech recognition (ASR) system, built using Next 13, the React framework

Language: JavaScript - Size: 650 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 47 - Forks: 17

Lord-Haji/ChatAudio Fork of Anil-matcha/ChatPDF

Language: Python - Size: 2.94 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

fabianigual/whisper-ai-litio

Whisper AI para transcripción a documentos en formato markdown

Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

jacksparrow124/HM-GPT

Home Manager GPT is a text to speech chat gpt that can be used to control your entire house. ask verbal questions and get verbal answers from google speech recognition.

Language: Python - Size: 106 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

mvanzulli/Meeting_Assistant

A simple and effortless automatic recorder, transcriber, translator and summarizer for meetings: Whisper + ChatGPT

Language: Python - Size: 981 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

interactive-applications/speech-to-clipboard

A simple UI tool written in Python, for recording audio from a microphone and automatically transcribing the recording using OpenAI's Whisper model via OpenAI's API.

Language: Python - Size: 354 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

cedpoilly/parrot

Ced's parrot! Speech-to-text (Whisper API from OpenAI) and text-to-speech (Narakeet API) demo.

Language: Vue - Size: 2.17 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

swfz/gpt-1on1

Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

satoryu/video_description_generator

Language: Python - Size: 4.88 KB - Last synced at: 26 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

codeverlan/Clinical_Transcriber

Uses Whisper AI to transcribe and process audio files, with the output being useful to psychotherapists.

Language: Python - Size: 2.11 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

UBOS-tech/node-red-contrib-speech-to-text-ubos

Learn how to turn audio into text.

Language: JavaScript - Size: 18.6 KB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

crazydevlegend/twspace-discord-stt

Discord bot that downloads and transcribes twitter space audio file

Language: Python - Size: 14.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

iamthejahid/transcript_whisper_flutter

Whisper is an automatic speech recognition (ASR), a product of open ai. This is Offline Whisper ai integration in flutter.

Language: C++ - Size: 91.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

thomasjv799/Whisper-Speech-to-Text-CLI.

A Simple CLI that uses open AI's whisper model to transcribe any audio.

Language: Python - Size: 19.8 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0