An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: voice-synthesis

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language: Python - Size: 162 MB - Last synced at: about 20 hours ago - Pushed at: about 1 year ago - Stars: 42,427 - Forks: 5,566

RageAgainstThePixel/ElevenLabs-DotNet

A Non-Official ElevenLabs RESTful API Client for dotnet

Language: C# - Size: 2 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 70 - Forks: 25

denizsafak/abogen

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

Language: Python - Size: 4.08 MB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 3,205 - Forks: 161

hparcells/rtvc

💬 "Realtime" voice transcription and cloning using ElevenLabs's API.

Language: TypeScript - Size: 499 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 6

Sincromisor/Sincromisor

かわいいキャラと声になってライブ配信・かわいいAIエージェントとおしゃべりWebサービス基盤(全部オンプレ運用可能)

Language: Python - Size: 19.9 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 20 - Forks: 0

FranciscoTC9999/abogen

🔊 Convert text to speech effortlessly with Abogen, a robust tool that supports multiple operating systems for clear and natural voice outputs.

Language: Python - Size: 2.13 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

soffiee32/OtosakuTTS-iOS

🗣️ Generate natural-sounding speech on iOS devices with this Swift library using on-device text-to-speech synthesis, ensuring privacy and fast performance.

Language: Swift - Size: 18.6 KB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

chdh/klatt-syn

Klatt formant synthesizer

Language: TypeScript - Size: 37.1 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 65 - Forks: 5

ManimCommunity/manim-voiceover

Manim plugin for all things voiceover

Language: Python - Size: 879 KB - Last synced at: 19 days ago - Pushed at: 7 months ago - Stars: 242 - Forks: 60

nipponjo/tts-arabic-pytorch

TTS models for Arabic (Tacotron2, FastPitch)

Language: Jupyter Notebook - Size: 3.23 MB - Last synced at: 20 days ago - Pushed at: 10 months ago - Stars: 117 - Forks: 31

RageAgainstThePixel/com.rest.elevenlabs

A non-official Eleven Labs voice synthesis client for Unity (UPM)

Language: C# - Size: 2.45 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 99 - Forks: 13

Otosaku/OtosakuTTS-iOS

Swift library for offline text-to-speech synthesis on iOS/macOS. Generate natural speech directly on device using CoreML-optimized FastPitch and HiFiGAN models. No internet required, fully private.

Language: Swift - Size: 16.6 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0

chdh/klatt-syn-app

GUI applikation for the Klatt formant synthesizer package

Language: TypeScript - Size: 33.2 KB - Last synced at: 26 days ago - Pushed at: 27 days ago - Stars: 11 - Forks: 3

AI2AIs/ai2ais-core

AI2AIs Core Engine - Autonomous digital organisms that debate to survive. Real AI characters with evolving personalities, vector memory systems, and adaptive voice synthesis. Features real-time TTS, lip-sync, peer analysis, and life energy mechanics.

Language: Python - Size: 81.6 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

nipponjo/tts_arabic

TTS for Arabic (FastPitch, Mixer-TTS) in the ONNX format

Language: Python - Size: 85.9 KB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 26 - Forks: 6

Secret-Society-Braid/voicevox4j

Java FFI wrapper for VOICEVOX CORE

Language: Java - Size: 153 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

dhruvldrp9/News_Agent

News Agent provides real-time news updates, AI-powered summaries, and voice interaction. Get instant access to global and regional news with intelligent analysis powered by OpenAI.

Language: JavaScript - Size: 438 KB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 1

yuis-ice/text-to-speech

🎤 VoiceFlow - Modern text-to-speech web application with real-time word highlighting, customizable voice settings, and content management. Built with React, TypeScript, and Web Speech API.

Language: TypeScript - Size: 88.9 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Pavan143Kundeti/text-to-podcast-with-subtitles

Convert any text script into a podcast with automatic text-to-speech and subtitle generation. Features a simple web player, MP3/WAV export, and easy subtitle creation for accessible, shareable audio content.

Language: Python - Size: 10.3 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

notebook-nexus/chatterbox-tts-colab

Transform any text into natural-sounding speech, clone voices from audio samples, and create professional voiceovers - all running free in Google Colab!

Language: Jupyter Notebook - Size: 1.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 17 - Forks: 2

lifecompanionaac/lifecompanion

LifeCompanion is a free open-source AAC software

Language: Java - Size: 55.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 7

panyanyany/Twocast

AI Podcast Generator for bilingual episodes, Multi Languages, Alternative to NotebookLLM;真人对话AI播客生成器,多语言,多音色

Language: TypeScript - Size: 4.33 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 906 - Forks: 77

spokestack/spokestack-ios 📦

Spokestack: give your iOS app a voice interface!

Language: Swift - Size: 9.94 MB - Last synced at: 26 days ago - Pushed at: about 4 years ago - Stars: 45 - Forks: 9

YuzukiTsuru/lessampler

lessampler is a Singing Voice Synthesizer

Language: C++ - Size: 19 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 73 - Forks: 5

YanivHaliwa/gemini-tts-conversation-generator

Language: Python - Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

walidBenterki/auto-trans

Transform video URLs into clean transcripts with auto-trans. Download, transcribe, and copy text in one command. Perfect for researchers and content creators. 🐙📂

Language: Python - Size: 23.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

jonatangulei/Cultural_AI_tutor

Empower children with Cultural AI Tutor, an AI-driven platform offering personalized stories and interactive learning in math, science, and history. 🌍💻

Language: TypeScript - Size: 259 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

wink-wink-wink555/blind_navigation

The Tactile Paving Navigation Assistant System is an AI-powered solution designed to enhance mobility for visually impaired individuals by combining real-time video analysis with voice-guided navigation. All data is processed locally, prioritizing privacy and offline usability.

Language: HTML - Size: 16.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 0

XueJourney/AIA

AIA是一个创新的双模式AI对话系统,通过推理模型进行逻辑分析,OpenAI提供人性化回复。支持GUI/CLI双界面、语音合成、个性化偏好设置和智能前缀控制。为用户提供既有深度又有温度的AI交流体验。

Language: Python - Size: 1.2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

nithins7676/Cultural_AI_tutor

🎓 Cultural AI Tutor - AI-Powered Educational Storytelling Platform Interactive learning app that generates culturally relevant stories for children using AI. Features voice narration, math visualization, and personalized quizzes. Built with React, TypeScript, Supabase, and Google Gemini AI. Perfect for children's education with cultural context

Language: TypeScript - Size: 260 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

IlhamSyahputra23/chatterbox-tts-colab

Easily clone voices and convert text to speech with Chatterbox TTS in Google Colab. Start your voice project today! 🐙✨

Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

46nori/FMSynthEnsemble

USB MIDI FM synthesizer that supports voice synthesis using CSM (Composite Sinusoidal Model

Language: C++ - Size: 26.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

smoke-trees/Voice-synthesis

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Language: Python - Size: 3.12 MB - Last synced at: about 2 months ago - Pushed at: almost 5 years ago - Stars: 170 - Forks: 46

EasyAI-France/Audiobook-Simplifier

Audiobook Simplifier is a tool that creates audiobooks from text documents or eBooks using TTS (Text-to-Speech) technology.

Language: Python - Size: 70.3 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

gooofy/zerovox

zero-shot realtime TTS system, fully offline, free and open source

Language: Python - Size: 38.9 MB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 41 - Forks: 5

ruputron/rupu_tts

The first official open-source release of Rupu TTS — a lightweight, offline desktop text-to-speech app powered by Coqui TTS. Only made this to learn a bit myself.

Language: Python - Size: 159 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

TSavo/chatterbox-tts-api

High-performance TTS API with voice cloning, emotion control, and synchronous MP3 generation. Built with FastAPI and powered by Chatterbox TTS.

Language: Python - Size: 82 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

zakaton/Pink-Trombone

A programmable version of Neil Thapen's Pink Trombone

Language: JavaScript - Size: 17.1 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 177 - Forks: 30

MatusOllah/gotau

Work-in-progress UTAU-compatible singing voice synthesizer, written in Go

Language: Go - Size: 37.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

john-carroll-sw/coffee-chat-voice-assistant

Coffee Chat Voice Assistant is a voice-driven ordering system powered by Azure OpenAI GPT-4o Realtime API, simulating the experience of ordering coffee with a café barista. It supports natural conversations, live order updates, and real-time transcription, showcasing the power of AI for seamless customer interactions.

Language: Jupyter Notebook - Size: 42.4 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 24 - Forks: 7

wafflecomposite/15.ai-Python-API

Python3 script for interaction with https://fifteen.ai/

Language: Python - Size: 22.5 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 41 - Forks: 12

spokestack/spokestack-android 📦

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Language: Java - Size: 1.25 MB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 74 - Forks: 10

ZDisket/TensorVox

Desktop application for neural speech synthesis written in C++

Language: C++ - Size: 15.5 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 215 - Forks: 20

yas-sim/csm_voice_encode_synthesis_python

Expermental code for CSM voice synthesis + CSM data generation

Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 1

arniery/andys-project

final assignment for the trinity SLP course "speech processing 2: acoustic modelling": cascade and parallel formant synthesis, the end goal being to produce vowels using both methods.

Language: Jupyter Notebook - Size: 664 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ngpepin/TTS

Convert a markdown file into an audio narration using VCTK and Coqui-TT

Language: Shell - Size: 20 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

olaviinha/NeuralTextToAudio

Text prompt steered synthetic audio generators

Language: Jupyter Notebook - Size: 337 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 46 - Forks: 7

jeanjerome/VoiceGenMeeting

CLI tool that generates synthetic meeting audio from a simple text-based transcript, assigning a unique voice to each speaker.

Size: 4.14 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

nipponjo/mixer-tts-pytorch

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Snigdho8869/text-to-speech-app

A Flask web app that converts text to speech using gTTS.

Language: HTML - Size: 66.4 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Minusree/VoiceAura

VoiceAura is an audio processing pipeline for Singing Voice Conversion using the so-vits-svc-fork framework. It processes your voice, performs source separation, prepares datasets, and includes automatic preprocessing. The pipeline uses tensorboard for training and modifies vocal quality and pitch during inference, producing high-quality outputs.

Language: Jupyter Notebook - Size: 15.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 0

jim-schwoebel/voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Size: 136 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 1,875 - Forks: 237

alexnaughtonjr/Real-Time-Voice-Cloning Fork of CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Size: 352 MB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 0

tjas/postgrad-ai-nlp2-voice-ui

A Voice User Interface tool for Text-to-Speech and Speech-to-Text, built with Python and Django Framework, to solve the proposed exercise in "Cognitive Computing 2: Voice User Interface" discipline.

Language: JavaScript - Size: 871 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 0

MayorX500/VoiceSynth-Agentifai Fork of ddu72/PI

Projeto Informática '24 - Síntese de Voz em Tempo Real - Agentifai

Language: Python - Size: 35.9 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

berlin0308/NTU-2023Fall-Intro-AI

Language: Jupyter Notebook - Size: 144 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

davep/festival.el

festival.el provides a simple interface into the festival speech synthesis program

Language: Emacs Lisp - Size: 52.7 KB - Last synced at: 5 days ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 2

storbeck/bait

Generate realistic IT security alert voicemails using GPT-4 for scripting and ElevenLabs for AI voice synthesis. A Go-based tool for crafting professional-grade alerts with customizable details and natural-sounding audio.

Language: Go - Size: 1.36 MB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

yuanhao-chen-nyoeghau/klatt-api

Flask app to synthesise a vowel based on formant values. Backend for react-klatt.nyoeghau.com

Language: Python - Size: 9.77 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

EX3exp/MiriVoice

Open-Free TTS Platform For All

Language: C# - Size: 125 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 8 - Forks: 2

PepperoniJoe/BeaconDetector

An iOS app that can detect an iBeacon. This app acts as an example Museum app that will display details of an art exhibit when near the exhibit's beacons.

Language: Swift - Size: 34.8 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 5

Harium/espeak-java

espeak java wrapper

Language: Java - Size: 14.6 KB - Last synced at: 18 days ago - Pushed at: over 4 years ago - Stars: 16 - Forks: 8

madworx/tms5220-atmega

TMS5220 exploration unit

Language: C - Size: 3.84 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

Mohamedhany99/Audio-Splitter-per-seconds-python-

This Script takes an audio file (preferred ".wav" filetype) as input and split it for each 3 seconds (editable) then creates a folder and input the splitted audio files and number it in an ascending way.

Language: Python - Size: 13.4 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

yuanhao-chen-nyoeghau/react-klatt

Use formant values to synthesise vowels.

Language: TypeScript - Size: 5.42 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

13-4dev/RVC-model-train-Windows-

pipeline for Mangio RVC Fork

Language: Python - Size: 15.6 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

GeneralNuisance0/Arachne-RECLUSE-neo-

Size: 1000 Bytes - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

weizlogy/alpacartaw

generAtes reaL-time subtitles in multiPle lAnguages ​​from voiCe recognition And Reads Them Aloud. (with obs and obs-Websocket

Language: JavaScript - Size: 298 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

Rennsen/DocuNarrator-AI

DocuNarrator: Narrate Your Life. Inspired by a Twitter post, this project uses AI to create a personal narrator for your life.

Language: Python - Size: 198 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

mehdihosseinimoghadam/Signal-Processing

Signal Processing with Python and Librosa

Language: Jupyter Notebook - Size: 46.6 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 2

jim-schwoebel/nala

🦁 Nala is an agile open-source voice assistant framework (20+ actions).

Language: Python - Size: 40.7 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 35 - Forks: 15

kristoisberg/gonesyntees

Golang client for the voice synthesis service by the Institute of the Estonian Language.

Language: Go - Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 1

Developer-RONNIE/ai-saas

Genius is an innovative AI-SaaS platform. Our platform offers five powerful capabilities: Conversation, Image Generation, Video Generation, Music Generation, and Code Generation. Each feature is crafted to deliver exceptional performance and user satisfaction.

Language: TypeScript - Size: 147 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

sankeer28/DiscordBot-v2

Discord bot made using python with many features including AI chat, music playback, video downloader, OSINT tools, and more

Language: Python - Size: 299 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Mohamedhany99/Voice-Frequency-Extraction-Signal-Processing-

This Script is able to extract Frequency of the voice detected in an audio file (preferred in ".wav" filetype)

Language: Python - Size: 94.7 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 1

DanRuta/xVA-Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

Language: JavaScript - Size: 1.14 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 580 - Forks: 54

temptemp3/polly.sh

Wrapper for aws polly in bash

Language: Shell - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

ZTiKnl/sara_old 📦

Sara is a prompt that: listens for commands (keyboard or voice recognition), executes a built in command or a plugin based on regular expression string matching, then uses text-to-speech give the answer. Now with vision support through USB webcam (WIP).

Language: JavaScript - Size: 1.66 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Azure-Samples/Cognitive-Services-Voice-Assistant

Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription

Language: C++ - Size: 76.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 92 - Forks: 99

DLehenbauer/c64-sam

Documented 6502 assembly code for the SAM voice synthesizer

Language: Assembly - Size: 69.3 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 1

yas-sim/csm_voice_synthesis_ym2203_python

An experimental code of CSM (composite sinusoidal modeling) voice synthesis with Python

Language: Jupyter Notebook - Size: 3.63 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

lugomio/browser-speech-synthesis

The project uses the browser's speech synthesis feature to transform text typed by the user into voice.

Language: HTML - Size: 21.5 KB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

YuzukiTsuru/SinsyPlus

Singing Voice Synthesis System based on Sinsy

Language: Python - Size: 6.31 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 3

radoslawregula/VoxG

Singing voice synthesizer using GANs

Language: Python - Size: 145 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

CheapCyborg/SpeakEasy

SpeakEasy - Real time translations and text-to-speech

Language: Python - Size: 328 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

colejd/jon-trombone

A poor use case for voice synthesis

Language: JavaScript - Size: 2.88 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

kdffdwsfgdw43331/iidia

Size: 1.95 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

RG-7/RoboSpeaker

This is a simple text to speech translator developed using python

Language: Python - Size: 3.21 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

omolewadavids/RetinopathyChatBot

Image Classification/Natural Language Processing: An AI-enabled conversational chatbot that helps diabetic patients in detecting diabetic retinopathy, give informations about the symptoms, treatments, researches going on, all sort of information about the disease. This patient-care app also find the nearest eye hospital near the patient for emergency visits

Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

SforAiDl/Neural-Voice-Cloning-With-Few-Samples 📦

This repository has implementation for "Neural Voice Cloning With Few Samples"

Language: Python - Size: 42.3 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 415 - Forks: 121

hujinsen/pytorch-StarGAN-VC

Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .

Language: Python - Size: 79 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 235 - Forks: 59

thedigitalchief/voice-command-assistant

Powerful assistant performing powerful automated tasks from user’s voice inputs. Developed using machine learning and speech synthesis Python frameworks.

Language: Python - Size: 31.3 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

JollyToday/GhostCut-auto_video_translation

auto video translation-video translator can auto translate video hard subtitles, auto video translation and dubbing, remove any video text, auto remove video subtitles/text. 自动视频翻译配音,自动翻译视频字幕和回填样式,自动硬字幕翻译。

Language: Python - Size: 101 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 43 - Forks: 8

intunist/nnsvs-japanese-plus

Custom HED and Table for Intunist Japanese

Size: 192 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 1

brycehowitson/SSML-prosody-library

A collection of pre-built speech synthesis settings used to convey emotion

Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 11 - Forks: 3

SethKitchen/SethVoice

Making an AI voice from my speaking

Language: Jupyter Notebook - Size: 268 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

kazukiotsuka/FFTNet

implementation of Zeyu et al.「FFTNET: A REAL-TIME SPEAKER-DEPENDENT NEURAL VOCODER」

Language: Jupyter Notebook - Size: 27.3 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

PepperoniJoe/Voices

An iOS app that reads any typed text using one of many voices.

Language: Swift - Size: 17.2 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

shun60s/Vocal-Tube-Model

a very simple vocal tract model, few tube model. generate vowel sound by it

Language: Python - Size: 477 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 16 - Forks: 3

eros71-dev/mario-voice-dataset

A dataset for Mario's voice (Charles Martinet), from the Super Mario franchise. More info here: https://uberduck.ai/about

Size: 21.7 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1