An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: xttsv2

Degon3399/XTTS_V2

This repository offers a framework for fine-tuning the XTTS_V2 model, focusing on multilingual text-to-speech applications. It includes tools for both full model fine-tuning and LoRA fine-tuning, along with inference scripts for easy speech synthesis. 🐙🌐

Language: Python - Size: 269 KB - Last synced at: about 3 hours ago - Pushed at: about 6 hours ago - Stars: 1 - Forks: 1

Mohamedfat7i/local-voice-cloning-app

🔊 Clone voices easily with this lightweight Python app that synthesizes audio using a simple voice-cloning workflow.

Size: 118 KB - Last synced at: about 14 hours ago - Pushed at: about 15 hours ago - Stars: 0 - Forks: 0

daswer123/xtts-api-server

A simple FastAPI Server to run XTTSv2

Language: Python - Size: 2.26 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 536 - Forks: 137

camdenFletcher03/XTTS_GUI

This program is designed to provide a graphical user interface for the xtts_api_server project: https://github.com/daswer123/xtts-api-server

Language: Python - Size: 51.8 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

toutia/inference_server

serveur d'inférence basé sur Nvidia Riva

Language: Shell - Size: 782 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

noco-ai/spellbook-docker

AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models

Language: Shell - Size: 2.39 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 163 - Forks: 13

art-from-the-machine/Mantella

Mantella is a Skyrim and Fallout 4 mod which allows you to naturally speak to NPCs using Whisper (speech-to-text), LLMs (text generation), and Piper / xVASynth / XTTS (text-to-speech).

Language: Python - Size: 187 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 286 - Forks: 72

Arizoonaa/LipIt_README

⭐SSAFY 12기 특화 프로젝트 2등 수상⭐

Size: 123 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 3

AIFSH/ComfyUI-XTTS

a custom comfyui node for coqui-ai/TTS's xtts module! support 17 languages voice cloning and tts

Language: Python - Size: 847 KB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 60 - Forks: 16

jpoll962/coqui-ai-TTS Fork of idiap/coqui-ai-TTS

My fork of idiap's coqui-ai-TTS repo - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language: Python - Size: 134 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

veralvx/xtts-gradio Fork of coqui-ai/TTS

Run XTTS within Docker/Podman for voice fine-tuning in a Web UI

Language: Python - Size: 133 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

astramind-ai/Auralis

A Fast TTS Engine

Language: Python - Size: 2 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 517 - Forks: 38

gokhaneraslan/XTTS_V2

Training XTTS V2 and PEFT LORA Text-to-Speech (TTS)

Language: Python - Size: 275 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

daswer123/xtts-webui

Webui for using XTTS and for finetuning it

Language: Python - Size: 2.76 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 808 - Forks: 158

overcrash66/OpenTranslator

Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features

Language: Python - Size: 7.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 8 - Forks: 3

KickerMix/Discord-Local-LLM-VoiceChat-Bot

Saya Voice Assistant for Discord AI voice bot: listens, detects keywords, chats via LM Studio, and replies with TTS or voice cloning.

Language: Python - Size: 2.79 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

gokhaneraslan/tts-dataset-generator

With this tool you can create custom TTS dataset from video or audio.

Language: Python - Size: 65.4 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

lukaszliniewicz/Pandrator

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

Language: Python - Size: 8.11 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 450 - Forks: 35

asiff00/Training-TTS

Train and finutune text-to-speech models for Bengali and many other languages!

Language: Jupyter Notebook - Size: 140 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 2

lukaszliniewicz/easy_xtts_trainer

A command line utility to easily finetune XTTS models in a fully automated way. Developed for Pandrator.

Language: Python - Size: 368 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

lef-fan/aria

A local and uncensored AI entity.

Language: Python - Size: 7.7 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 61 - Forks: 14

Zuellni/XTTS-Server 📦

XTTS server for SillyTavern.

Language: Python - Size: 44.9 KB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

0x41337/xtts2-ui Fork of BoltzmannEntropy/xtts2-ui

A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech

Language: Python - Size: 5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

tuanh123789/Train_Hifigan_XTTS

This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.

Language: Python - Size: 268 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 59 - Forks: 21

MrZeroX1/Renan-Model Fork of Haurrus/xtts-trainer-no-ui-auto

This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for accelerated training.

Language: Python - Size: 533 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SilkReyn/MAS-xttsClient

Submod for Monika-After-Story that generates voice for Monika's dialogue by interfacing with XTTS-Server-API speech synthesis from daswer123

Language: Ren'Py - Size: 1.69 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

pbanuru/xtts2-ui Fork of BoltzmannEntropy/xtts2-ui

A User Interface for XTTS-2 Text-Based Voice Cloning with 10 seconds

Language: Python - Size: 5 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

omenius/epub2mp3

Converts epub e-book files to mp3 audiobook files.

Language: Python - Size: 74.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

nsourlos/voice_cloning_tools

Various tools to clone a voice

Language: Jupyter Notebook - Size: 561 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0