GitHub topics: voice-synthesis
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language: Python - Size: 162 MB - Last synced at: about 20 hours ago - Pushed at: about 1 year ago - Stars: 42,427 - Forks: 5,566

RageAgainstThePixel/ElevenLabs-DotNet
A Non-Official ElevenLabs RESTful API Client for dotnet
Language: C# - Size: 2 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 70 - Forks: 25

denizsafak/abogen
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
Language: Python - Size: 4.08 MB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 3,205 - Forks: 161

hparcells/rtvc
💬 "Realtime" voice transcription and cloning using ElevenLabs's API.
Language: TypeScript - Size: 499 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 6

Sincromisor/Sincromisor
かわいいキャラと声になってライブ配信・かわいいAIエージェントとおしゃべりWebサービス基盤(全部オンプレ運用可能)
Language: Python - Size: 19.9 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 20 - Forks: 0

FranciscoTC9999/abogen
🔊 Convert text to speech effortlessly with Abogen, a robust tool that supports multiple operating systems for clear and natural voice outputs.
Language: Python - Size: 2.13 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

soffiee32/OtosakuTTS-iOS
🗣️ Generate natural-sounding speech on iOS devices with this Swift library using on-device text-to-speech synthesis, ensuring privacy and fast performance.
Language: Swift - Size: 18.6 KB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

chdh/klatt-syn
Klatt formant synthesizer
Language: TypeScript - Size: 37.1 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 65 - Forks: 5

ManimCommunity/manim-voiceover
Manim plugin for all things voiceover
Language: Python - Size: 879 KB - Last synced at: 19 days ago - Pushed at: 7 months ago - Stars: 242 - Forks: 60

nipponjo/tts-arabic-pytorch
TTS models for Arabic (Tacotron2, FastPitch)
Language: Jupyter Notebook - Size: 3.23 MB - Last synced at: 20 days ago - Pushed at: 10 months ago - Stars: 117 - Forks: 31

RageAgainstThePixel/com.rest.elevenlabs
A non-official Eleven Labs voice synthesis client for Unity (UPM)
Language: C# - Size: 2.45 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 99 - Forks: 13

Otosaku/OtosakuTTS-iOS
Swift library for offline text-to-speech synthesis on iOS/macOS. Generate natural speech directly on device using CoreML-optimized FastPitch and HiFiGAN models. No internet required, fully private.
Language: Swift - Size: 16.6 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0

chdh/klatt-syn-app
GUI applikation for the Klatt formant synthesizer package
Language: TypeScript - Size: 33.2 KB - Last synced at: 26 days ago - Pushed at: 27 days ago - Stars: 11 - Forks: 3

AI2AIs/ai2ais-core
AI2AIs Core Engine - Autonomous digital organisms that debate to survive. Real AI characters with evolving personalities, vector memory systems, and adaptive voice synthesis. Features real-time TTS, lip-sync, peer analysis, and life energy mechanics.
Language: Python - Size: 81.6 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

nipponjo/tts_arabic
TTS for Arabic (FastPitch, Mixer-TTS) in the ONNX format
Language: Python - Size: 85.9 KB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 26 - Forks: 6

Secret-Society-Braid/voicevox4j
Java FFI wrapper for VOICEVOX CORE
Language: Java - Size: 153 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

dhruvldrp9/News_Agent
News Agent provides real-time news updates, AI-powered summaries, and voice interaction. Get instant access to global and regional news with intelligent analysis powered by OpenAI.
Language: JavaScript - Size: 438 KB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 1

yuis-ice/text-to-speech
🎤 VoiceFlow - Modern text-to-speech web application with real-time word highlighting, customizable voice settings, and content management. Built with React, TypeScript, and Web Speech API.
Language: TypeScript - Size: 88.9 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Pavan143Kundeti/text-to-podcast-with-subtitles
Convert any text script into a podcast with automatic text-to-speech and subtitle generation. Features a simple web player, MP3/WAV export, and easy subtitle creation for accessible, shareable audio content.
Language: Python - Size: 10.3 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

notebook-nexus/chatterbox-tts-colab
Transform any text into natural-sounding speech, clone voices from audio samples, and create professional voiceovers - all running free in Google Colab!
Language: Jupyter Notebook - Size: 1.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 17 - Forks: 2

lifecompanionaac/lifecompanion
LifeCompanion is a free open-source AAC software
Language: Java - Size: 55.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 7

panyanyany/Twocast
AI Podcast Generator for bilingual episodes, Multi Languages, Alternative to NotebookLLM;真人对话AI播客生成器,多语言,多音色
Language: TypeScript - Size: 4.33 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 906 - Forks: 77

spokestack/spokestack-ios 📦
Spokestack: give your iOS app a voice interface!
Language: Swift - Size: 9.94 MB - Last synced at: 26 days ago - Pushed at: about 4 years ago - Stars: 45 - Forks: 9

YuzukiTsuru/lessampler
lessampler is a Singing Voice Synthesizer
Language: C++ - Size: 19 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 73 - Forks: 5

YanivHaliwa/gemini-tts-conversation-generator
Language: Python - Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

walidBenterki/auto-trans
Transform video URLs into clean transcripts with auto-trans. Download, transcribe, and copy text in one command. Perfect for researchers and content creators. 🐙📂
Language: Python - Size: 23.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

jonatangulei/Cultural_AI_tutor
Empower children with Cultural AI Tutor, an AI-driven platform offering personalized stories and interactive learning in math, science, and history. 🌍💻
Language: TypeScript - Size: 259 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

wink-wink-wink555/blind_navigation
The Tactile Paving Navigation Assistant System is an AI-powered solution designed to enhance mobility for visually impaired individuals by combining real-time video analysis with voice-guided navigation. All data is processed locally, prioritizing privacy and offline usability.
Language: HTML - Size: 16.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 0

XueJourney/AIA
AIA是一个创新的双模式AI对话系统,通过推理模型进行逻辑分析,OpenAI提供人性化回复。支持GUI/CLI双界面、语音合成、个性化偏好设置和智能前缀控制。为用户提供既有深度又有温度的AI交流体验。
Language: Python - Size: 1.2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

nithins7676/Cultural_AI_tutor
🎓 Cultural AI Tutor - AI-Powered Educational Storytelling Platform Interactive learning app that generates culturally relevant stories for children using AI. Features voice narration, math visualization, and personalized quizzes. Built with React, TypeScript, Supabase, and Google Gemini AI. Perfect for children's education with cultural context
Language: TypeScript - Size: 260 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

IlhamSyahputra23/chatterbox-tts-colab
Easily clone voices and convert text to speech with Chatterbox TTS in Google Colab. Start your voice project today! 🐙✨
Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

46nori/FMSynthEnsemble
USB MIDI FM synthesizer that supports voice synthesis using CSM (Composite Sinusoidal Model
Language: C++ - Size: 26.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

smoke-trees/Voice-synthesis
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.
Language: Python - Size: 3.12 MB - Last synced at: about 2 months ago - Pushed at: almost 5 years ago - Stars: 170 - Forks: 46

EasyAI-France/Audiobook-Simplifier
Audiobook Simplifier is a tool that creates audiobooks from text documents or eBooks using TTS (Text-to-Speech) technology.
Language: Python - Size: 70.3 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

gooofy/zerovox
zero-shot realtime TTS system, fully offline, free and open source
Language: Python - Size: 38.9 MB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 41 - Forks: 5

ruputron/rupu_tts
The first official open-source release of Rupu TTS — a lightweight, offline desktop text-to-speech app powered by Coqui TTS. Only made this to learn a bit myself.
Language: Python - Size: 159 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

TSavo/chatterbox-tts-api
High-performance TTS API with voice cloning, emotion control, and synchronous MP3 generation. Built with FastAPI and powered by Chatterbox TTS.
Language: Python - Size: 82 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

zakaton/Pink-Trombone
A programmable version of Neil Thapen's Pink Trombone
Language: JavaScript - Size: 17.1 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 177 - Forks: 30

MatusOllah/gotau
Work-in-progress UTAU-compatible singing voice synthesizer, written in Go
Language: Go - Size: 37.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

john-carroll-sw/coffee-chat-voice-assistant
Coffee Chat Voice Assistant is a voice-driven ordering system powered by Azure OpenAI GPT-4o Realtime API, simulating the experience of ordering coffee with a café barista. It supports natural conversations, live order updates, and real-time transcription, showcasing the power of AI for seamless customer interactions.
Language: Jupyter Notebook - Size: 42.4 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 24 - Forks: 7

wafflecomposite/15.ai-Python-API
Python3 script for interaction with https://fifteen.ai/
Language: Python - Size: 22.5 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 41 - Forks: 12

spokestack/spokestack-android 📦
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Language: Java - Size: 1.25 MB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 74 - Forks: 10

ZDisket/TensorVox
Desktop application for neural speech synthesis written in C++
Language: C++ - Size: 15.5 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 215 - Forks: 20

yas-sim/csm_voice_encode_synthesis_python
Expermental code for CSM voice synthesis + CSM data generation
Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 1

arniery/andys-project
final assignment for the trinity SLP course "speech processing 2: acoustic modelling": cascade and parallel formant synthesis, the end goal being to produce vowels using both methods.
Language: Jupyter Notebook - Size: 664 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ngpepin/TTS
Convert a markdown file into an audio narration using VCTK and Coqui-TT
Language: Shell - Size: 20 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

olaviinha/NeuralTextToAudio
Text prompt steered synthetic audio generators
Language: Jupyter Notebook - Size: 337 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 46 - Forks: 7

jeanjerome/VoiceGenMeeting
CLI tool that generates synthetic meeting audio from a simple text-based transcript, assigning a unique voice to each speaker.
Size: 4.14 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

nipponjo/mixer-tts-pytorch
Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Snigdho8869/text-to-speech-app
A Flask web app that converts text to speech using gTTS.
Language: HTML - Size: 66.4 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Minusree/VoiceAura
VoiceAura is an audio processing pipeline for Singing Voice Conversion using the so-vits-svc-fork framework. It processes your voice, performs source separation, prepares datasets, and includes automatic preprocessing. The pipeline uses tensorboard for training and modifies vocal quality and pitch during inference, producing high-quality outputs.
Language: Jupyter Notebook - Size: 15.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 0

jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Size: 136 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 1,875 - Forks: 237

alexnaughtonjr/Real-Time-Voice-Cloning Fork of CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Size: 352 MB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 0

tjas/postgrad-ai-nlp2-voice-ui
A Voice User Interface tool for Text-to-Speech and Speech-to-Text, built with Python and Django Framework, to solve the proposed exercise in "Cognitive Computing 2: Voice User Interface" discipline.
Language: JavaScript - Size: 871 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 0

MayorX500/VoiceSynth-Agentifai Fork of ddu72/PI
Projeto Informática '24 - Síntese de Voz em Tempo Real - Agentifai
Language: Python - Size: 35.9 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

berlin0308/NTU-2023Fall-Intro-AI
Language: Jupyter Notebook - Size: 144 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

davep/festival.el
festival.el provides a simple interface into the festival speech synthesis program
Language: Emacs Lisp - Size: 52.7 KB - Last synced at: 5 days ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 2

storbeck/bait
Generate realistic IT security alert voicemails using GPT-4 for scripting and ElevenLabs for AI voice synthesis. A Go-based tool for crafting professional-grade alerts with customizable details and natural-sounding audio.
Language: Go - Size: 1.36 MB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

yuanhao-chen-nyoeghau/klatt-api
Flask app to synthesise a vowel based on formant values. Backend for react-klatt.nyoeghau.com
Language: Python - Size: 9.77 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

EX3exp/MiriVoice
Open-Free TTS Platform For All
Language: C# - Size: 125 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 8 - Forks: 2

PepperoniJoe/BeaconDetector
An iOS app that can detect an iBeacon. This app acts as an example Museum app that will display details of an art exhibit when near the exhibit's beacons.
Language: Swift - Size: 34.8 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 5

Harium/espeak-java
espeak java wrapper
Language: Java - Size: 14.6 KB - Last synced at: 18 days ago - Pushed at: over 4 years ago - Stars: 16 - Forks: 8

madworx/tms5220-atmega
TMS5220 exploration unit
Language: C - Size: 3.84 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

Mohamedhany99/Audio-Splitter-per-seconds-python-
This Script takes an audio file (preferred ".wav" filetype) as input and split it for each 3 seconds (editable) then creates a folder and input the splitted audio files and number it in an ascending way.
Language: Python - Size: 13.4 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

yuanhao-chen-nyoeghau/react-klatt
Use formant values to synthesise vowels.
Language: TypeScript - Size: 5.42 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

13-4dev/RVC-model-train-Windows-
pipeline for Mangio RVC Fork
Language: Python - Size: 15.6 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

GeneralNuisance0/Arachne-RECLUSE-neo-
Size: 1000 Bytes - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

weizlogy/alpacartaw
generAtes reaL-time subtitles in multiPle lAnguages from voiCe recognition And Reads Them Aloud. (with obs and obs-Websocket
Language: JavaScript - Size: 298 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

Rennsen/DocuNarrator-AI
DocuNarrator: Narrate Your Life. Inspired by a Twitter post, this project uses AI to create a personal narrator for your life.
Language: Python - Size: 198 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

mehdihosseinimoghadam/Signal-Processing
Signal Processing with Python and Librosa
Language: Jupyter Notebook - Size: 46.6 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 2

jim-schwoebel/nala
🦁 Nala is an agile open-source voice assistant framework (20+ actions).
Language: Python - Size: 40.7 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 35 - Forks: 15

kristoisberg/gonesyntees
Golang client for the voice synthesis service by the Institute of the Estonian Language.
Language: Go - Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 1

Developer-RONNIE/ai-saas
Genius is an innovative AI-SaaS platform. Our platform offers five powerful capabilities: Conversation, Image Generation, Video Generation, Music Generation, and Code Generation. Each feature is crafted to deliver exceptional performance and user satisfaction.
Language: TypeScript - Size: 147 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

sankeer28/DiscordBot-v2
Discord bot made using python with many features including AI chat, music playback, video downloader, OSINT tools, and more
Language: Python - Size: 299 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Mohamedhany99/Voice-Frequency-Extraction-Signal-Processing-
This Script is able to extract Frequency of the voice detected in an audio file (preferred in ".wav" filetype)
Language: Python - Size: 94.7 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 1

DanRuta/xVA-Synth
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
Language: JavaScript - Size: 1.14 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 580 - Forks: 54

temptemp3/polly.sh
Wrapper for aws polly in bash
Language: Shell - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

ZTiKnl/sara_old 📦
Sara is a prompt that: listens for commands (keyboard or voice recognition), executes a built in command or a plugin based on regular expression string matching, then uses text-to-speech give the answer. Now with vision support through USB webcam (WIP).
Language: JavaScript - Size: 1.66 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Azure-Samples/Cognitive-Services-Voice-Assistant
Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription
Language: C++ - Size: 76.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 92 - Forks: 99

DLehenbauer/c64-sam
Documented 6502 assembly code for the SAM voice synthesizer
Language: Assembly - Size: 69.3 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 1

yas-sim/csm_voice_synthesis_ym2203_python
An experimental code of CSM (composite sinusoidal modeling) voice synthesis with Python
Language: Jupyter Notebook - Size: 3.63 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

lugomio/browser-speech-synthesis
The project uses the browser's speech synthesis feature to transform text typed by the user into voice.
Language: HTML - Size: 21.5 KB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

YuzukiTsuru/SinsyPlus
Singing Voice Synthesis System based on Sinsy
Language: Python - Size: 6.31 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 3

radoslawregula/VoxG
Singing voice synthesizer using GANs
Language: Python - Size: 145 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

CheapCyborg/SpeakEasy
SpeakEasy - Real time translations and text-to-speech
Language: Python - Size: 328 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

colejd/jon-trombone
A poor use case for voice synthesis
Language: JavaScript - Size: 2.88 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

kdffdwsfgdw43331/iidia
Size: 1.95 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

RG-7/RoboSpeaker
This is a simple text to speech translator developed using python
Language: Python - Size: 3.21 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

omolewadavids/RetinopathyChatBot
Image Classification/Natural Language Processing: An AI-enabled conversational chatbot that helps diabetic patients in detecting diabetic retinopathy, give informations about the symptoms, treatments, researches going on, all sort of information about the disease. This patient-care app also find the nearest eye hospital near the patient for emergency visits
Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

SforAiDl/Neural-Voice-Cloning-With-Few-Samples 📦
This repository has implementation for "Neural Voice Cloning With Few Samples"
Language: Python - Size: 42.3 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 415 - Forks: 121

hujinsen/pytorch-StarGAN-VC
Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .
Language: Python - Size: 79 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 235 - Forks: 59

thedigitalchief/voice-command-assistant
Powerful assistant performing powerful automated tasks from user’s voice inputs. Developed using machine learning and speech synthesis Python frameworks.
Language: Python - Size: 31.3 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

JollyToday/GhostCut-auto_video_translation
auto video translation-video translator can auto translate video hard subtitles, auto video translation and dubbing, remove any video text, auto remove video subtitles/text. 自动视频翻译配音,自动翻译视频字幕和回填样式,自动硬字幕翻译。
Language: Python - Size: 101 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 43 - Forks: 8

intunist/nnsvs-japanese-plus
Custom HED and Table for Intunist Japanese
Size: 192 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 1

brycehowitson/SSML-prosody-library
A collection of pre-built speech synthesis settings used to convey emotion
Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 11 - Forks: 3

SethKitchen/SethVoice
Making an AI voice from my speaking
Language: Jupyter Notebook - Size: 268 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

kazukiotsuka/FFTNet
implementation of Zeyu et al.「FFTNET: A REAL-TIME SPEAKER-DEPENDENT NEURAL VOCODER」
Language: Jupyter Notebook - Size: 27.3 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

PepperoniJoe/Voices
An iOS app that reads any typed text using one of many voices.
Language: Swift - Size: 17.2 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

shun60s/Vocal-Tube-Model
a very simple vocal tract model, few tube model. generate vowel sound by it
Language: Python - Size: 477 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 16 - Forks: 3

eros71-dev/mario-voice-dataset
A dataset for Mario's voice (Charles Martinet), from the Super Mario franchise. More info here: https://uberduck.ai/about
Size: 21.7 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1
