GitHub topics: speech-to-text-api
Evil0ctal/Fast-Powerful-Whisper-AI-Services-API
⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。
Language: Python - Size: 1.21 MB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 368 - Forks: 42

rtzr/Awesome-Korean-Speech-Recognition
한국어 음성인식 STT API 리스트. 각 성능 벤치마크.
Size: 86.9 KB - Last synced at: 9 days ago - Pushed at: 18 days ago - Stars: 403 - Forks: 22

versevo-ai/versevo-ai
Language: Python - Size: 585 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 4

HenestrosaDev/audiotext
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
Language: Python - Size: 80.5 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 194 - Forks: 18

ShouryaKapoor/RealTime_Transcripter_Mini_Project
Automated Transcription System is a Python-based app using OpenAI's Whisper model for real-time audio/video transcription. It features a GUI, monitors a folder for new media files, and supports multiple formats. The app ensures efficient processing, avoids duplicates, and provides logs, making it ideal for seamless speech-to-text conversion.
Language: Python - Size: 73.2 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

StudiYash/DweshaMukt
DweshaMukt leverages BERT and deep learning to detect hate speech in Hinglish, Hindi, and English across text, audio, video, images, GIFs, and YouTube comments. With real-time analysis, emoticon detection, and a Streamlit interface, it aims to foster safer online spaces through advanced NLP techniques.
Language: Jupyter Notebook - Size: 194 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

ascender1729/ClearSpeak
ClearSpeak is a real-time audio transcription application using Google's Speech-to-Text API. It features a Tkinter-based GUI, filtering background noise, and providing clear speech transcription.
Language: Python - Size: 11.6 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

uberduck-ai/openduck
Building an open-source interactive AI plush toy.
Language: Python - Size: 3.93 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 1

ilya-filatov-94/Voice-assistent
Voice assistent for Desktop
Language: C++ - Size: 56.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

NthnUlmr/DiscordLiveTranscriptionBot
A discord bot which transcribes your audio in real time using a combination of API calls to other services.
Language: Python - Size: 198 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

alhazenlabs/jarvis
Language: Python - Size: 217 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vikashkhati007/-Speech-To-Text-Recognition-App
Speech To Text Recognition App converts spoken words to written text in real-time using the browser's speech recognition API. The app is built on React and provides users with easy control of speech recognition, manipulation of text, and copying to the clipboard. It is an accessible way to input text for users with disabilities.
Language: JavaScript - Size: 260 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ineed-coffee/seesun
:gift: 세상을 밝혀주는 따듯한 서비스 시선 :gift: 멀티캠퍼스 딥러닝 기반 AI엔지니어링 과정 파이널 프로젝트
Language: CSS - Size: 88.6 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 3
