An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: vits

fishaudio/fish-speech

SOTA Open Source TTS

Language: Python - Size: 18.5 MB - Last synced at: about 7 hours ago - Pushed at: 5 days ago - Stars: 22,878 - Forks: 1,884

PhamHuynhAnh16/Vietnamese-RVC

Dự án công cụ chuyển đổi giọng nói dành cho người Việt

Language: Python - Size: 15.9 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 14 - Forks: 5

JackismyShephard/ultimate-rvc

An app for creating audio-based content such as song covers and speech using Retrieval-based Voice Conversion.

Language: Python - Size: 7.74 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 147 - Forks: 34

XxAZVDxX/Local-Live2D-AI-Girlfriend-iOS

Let’s start chatting with your Live2D or VRM girlfriend in iOS (Support Local LLM, API, Apple Intelligent)

Size: 17.5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

Language: Python - Size: 41.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2,558 - Forks: 428

k2-fsa/sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 12 programming languages

Language: C++ - Size: 10 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 7,260 - Forks: 847

Mobile-Artificial-Intelligence/babylon.cpp

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.

Language: Python - Size: 324 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 23 - Forks: 3

fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

Language: Python - Size: 7.95 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 8,552 - Forks: 1,223

yeyupiaoling/VITS-Pytorch

本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。

Language: Python - Size: 2.89 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 53 - Forks: 9

voicepaw/so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language: Python - Size: 20.2 MB - Last synced at: 6 days ago - Pushed at: 21 days ago - Stars: 9,102 - Forks: 1,219

PlayVoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language: Python - Size: 41.3 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 2,813 - Forks: 925

High-Logic/Genie

GPT-SoVITS ONNX Inference Engine & Model Converter

Language: Python - Size: 770 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 216 - Forks: 10

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language: Python - Size: 13.9 MB - Last synced at: 8 days ago - Pushed at: 10 months ago - Stars: 31,713 - Forks: 4,453

Aivis-Project/AivisSpeech-Engine

AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine

Language: Python - Size: 335 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 125 - Forks: 16

codename0og/codename-rvc-fork-3 📦

Codename's rvc fork version 3, based on Applio.

Language: Python - Size: 4.38 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 37 - Forks: 4

LlmKira/VitsServer

🌻 VITS ONNX TTS server designed for fast inference 🔥

Language: Python - Size: 2.18 MB - Last synced at: 2 days ago - Pushed at: 7 months ago - Stars: 128 - Forks: 7

changzy00/pytorch-attention

🦖Pytorch implementation of popular Attention Mechanisms, Vision Transformers, MLP-Like models and CNNs.🔥🔥🔥

Language: Python - Size: 3.5 MB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 502 - Forks: 49

thewh1teagle/piper-rs

Use piper TTS models in Rust

Language: Rust - Size: 162 KB - Last synced at: about 7 hours ago - Pushed at: 9 months ago - Stars: 32 - Forks: 18

setiawand/coqui-tts-id

TTS (Text-to-Speech) Bahasa Indonesia menggunakan Coqui TTS dengan model VITS multi-speaker. Mendukung normalisasi teks khusus Indonesia dan 80+ speaker voice.

Language: Python - Size: 83 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

blaisewf/rvc-cli 📦

🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!

Language: Python - Size: 3.26 MB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 214 - Forks: 48

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language: Python - Size: 13.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 49,530 - Forks: 5,433

Arizoonaa/LipIt_README

⭐SSAFY 12기 특화 프로젝트 2등 수상⭐

Size: 123 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 3

Aivis-Project/AivisSpeech

AivisSpeech: AI Voice Imitation System - Text to Speech Software

Language: TypeScript - Size: 59.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 378 - Forks: 17

Ikaros-521/AI-Vtuber Fork of sandboxdream/AI-Vtuber

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。

Language: Python - Size: 820 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3,970 - Forks: 603

Voine/Bert-VITS2-MNN

TTS System Bert-VITS2 Android Ver, powered by alibaba-MNN engine.

Language: Kotlin - Size: 38.9 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 76 - Forks: 8

Hexanol777/Kikiyomu

聞き読む. real-time text-to-speech tool for VNs

Language: Python - Size: 58.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

PlayVoice/vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Language: Python - Size: 2.97 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1,205 - Forks: 177

yukiarimo/hanasu

Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture

Language: Python - Size: 5.64 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 31 - Forks: 5

Artrajz/vits-simple-api

A simple VITS HTTP API, developed by extending Moegoe with additional features.

Language: Python - Size: 14.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 984 - Forks: 130

huakunyang/SummerTTS

SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be easily used for Chinese TTS with just one key build out

Language: C++ - Size: 310 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 466 - Forks: 86

Convbased/Convbased-Studio

A higher quality RVC pretrained model to accelerate your training process.

Size: 99.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 14 - Forks: 1

karim23657/Persian-tts-coqui

Persian/Farsi text to speech(TTS) training using coqui tts

Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 157 - Forks: 19

btseee/mongol-tts

Lightweight Mongolian Text-To-Speech with Vits and CoquiTTS

Language: Python - Size: 655 KB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

soulteary/simple-image-search-engine

图片搜索引擎,很简单。三步构建属于你自己的图片搜索引擎,掌握向量数据库和以图搜图、文本搜索图片。

Language: Python - Size: 200 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 145 - Forks: 42

PlayVoice/VI-Speaker

Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.

Language: Python - Size: 62.5 KB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 30 - Forks: 4

tsukumijima/Aivis

開発休止中ですが、将来的に Aivis-Project/AivisBuilder として大幅リニューアル予定のリポジトリです

Language: Python - Size: 662 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 150 - Forks: 13

SUC-DriverOld/so-vits-svc-Deployment-Documents

So-VITS-SVC 本地部署使用帮助文档,提供Colab笔记本 So-VITS-SVC Local Deployment Document and provide Colab notebook

Language: Jupyter Notebook - Size: 349 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 718 - Forks: 108

Rtwotwo/Visual-Locator

Visual Locator is a method used to carry out absolute drone visual positioning in the case of GPS rejection. The software mainly includes the retrieval and registration part of the model and the data set production part.

Language: Python - Size: 323 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

legekka/GanyuTTS

A small VITS+SOVITS/RVC TTS API

Language: Python - Size: 163 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 35 - Forks: 6

34j/awesome-vits 📦

List of repositories relevant to VITS.

Size: 5.86 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 36 - Forks: 1

PriesiaMioShirakana/DragonianVoice

多个SVC/TTS的C++推理库

Language: C - Size: 101 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1,077 - Forks: 126

SadeghKrmi/pertts-streamlit

Persian text-to-speech streamlit interface

Language: Python - Size: 266 MB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 41 - Forks: 5

QuantiusBenignus/voluble

Let your GNOME desktop speak to you. Reads your desktop notifications or selected text out-loud with human-like voice using Piper. Uses a local LLM to summarize selected text.

Language: JavaScript - Size: 231 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 9 - Forks: 0

IRedDragonICY/vixevia

An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.

Language: Python - Size: 42.5 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 40 - Forks: 4

open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language: Python - Size: 127 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 9,091 - Forks: 719

qiaolinwang/VITS Fork of AlexandaJerry/vits-mandarin-biaobei

Implementation of the VITS model

Language: Jupyter Notebook - Size: 3.1 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 402 - Forks: 76

Voine/ChatWaifu_Mobile

移动版二次元 AI 老婆聊天器

Language: C++ - Size: 415 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 1,297 - Forks: 142

seanghay/KLEA

An open-source Khmer Word to Speech Model. Just single word not sentence!

Language: Python - Size: 610 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 3

PlayVoice/lora-svc

singing voice change based on whisper, and lora for singing voice clone

Language: Python - Size: 17.6 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 636 - Forks: 78

reshalfahsi/AI-Cover-Song

Cover Song Powered by SoftVC VITS

Language: Jupyter Notebook - Size: 10.7 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 8

Rtwotwo/MMChat

MMChat aims to combine current cutting-edge technologies and published big language models to build a dynamic and interactive multi-modal platform, covering vision, language, 3D, text processing and other aspects of processing.

Language: Python - Size: 84 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Bebra777228/PolGen-RVC

Преобразование голоса на основе VITS. Ориентировано на простоту, качество и производительность.

Language: Python - Size: 7.44 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 33 - Forks: 5

Panzer-Jack/Cyber_AI-Waife

Cyber AI-Waife | 这是一个有灵魂的赛博女朋友 | Web-Live2D with LLM and VITS

Size: 55.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 26 - Forks: 0

Panzer-Jack/easy-live2d-ai

赛博老婆随意链接 | 一个集成了 Web 端 Live2D 动画、LLM 智能对话与 VITS 语音合成的 通用SDK | A universal SDK that integrates Web-based Live2D animation, LLM intelligent dialogue, and VITS speech synthesis.

Size: 6.84 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

svjack/Genshin-Impact-Fan-Video

一个《原神》AI驱动视频项目,利用LLM API生成角色互动文案,VITS技术进行语音合成,并结合先进的文生图和视频合成技术,创造出游戏角色之间有趣的场景。最终产出为短视频。

Language: Python - Size: 4.92 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 14 - Forks: 4

gokhaneraslan/tts-dataset-generator

With this tool you can create custom TTS dataset from video or audio.

Language: Python - Size: 65.4 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

UFOAlastor/AI-Waifu-Project-LaIN

一个拥有长期记忆, 表情动作, 语音对话/打断/声纹识别, FunctionCall, 多模型支持的AI Waifu客户端.

Language: Python - Size: 62.3 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 13 - Forks: 0

Eerrly/VITSAIChatVtube

使用Torch VITS语音合成,结合OpenAI ChatGPT进行互动

Language: Python - Size: 31.9 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 34 - Forks: 3

sayakpaul/probing-vits

Probing the representations of Vision Transformers.

Language: Jupyter Notebook - Size: 33.3 MB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 324 - Forks: 20

slegroux/nimrod

minimal deep learning framework

Language: Jupyter Notebook - Size: 119 MB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

PlayVoice/Grad-SVC

Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei

Language: Python - Size: 2.25 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 144 - Forks: 14

PlayVoice/VI-SVS

Singing Voice Synthesis based on VITS, different from VISinger

Language: Python - Size: 2.14 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 190 - Forks: 32

mahshid1378/so-vits-svc-fork

realtime support, improved interface and more features.

Language: Python - Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

seanghay/vits.cpp

VITS Inference using ONNX Runtime on C++

Language: C++ - Size: 12.7 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 1

zwssunny/pingo

pingo智能演示平台

Language: Python - Size: 1.55 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

AkiKurisu/VirtualHuman-Unity 📦

VirtualHuman is a Unity Plugin to use LLM&&VITS easily

Language: C# - Size: 3.13 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 8

newreport/vtbai

ai live,ai vtb,bilibili live with chatgpt,基于chatgpt后端进行ai直播,配合obs和vts,需要图形化界面

Language: Python - Size: 1.96 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 109 - Forks: 23

hcy71o/SC-VITS Fork of jaywalnut310/vits

VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.

Language: Python - Size: 25.3 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 36 - Forks: 2

rotten-work/vits-mandarin-windows Fork of jaywalnut310/vits

VITS for Mandarin. Support Windows and Linux, low-end and high-end hardwares

Language: Jupyter Notebook - Size: 5.69 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 110 - Forks: 13

Richardn2002/shizuka-app

Data curation, training and deployment of VITS model(s) of 好本静, Yoshimoto Shizuka, from 君のことが大大大大大好きな100人.

Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 4 - Forks: 0

Aqirito/A.L.I.C.E

A.L.I.C.E (Artificial Labile Intelligence Cybernated Existence). A REST API of A.I companion for creating more complex system

Language: Python - Size: 22.2 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 7 - Forks: 0

svc-develop-team/so-vits-svc 📦

SoftVC VITS Singing Voice Conversion

Language: Python - Size: 10.6 MB - Last synced at: 8 months ago - Pushed at: almost 2 years ago - Stars: 26,385 - Forks: 4,900

Akegarasu/vits-webui

VITS web UI

Language: Python - Size: 158 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 43 - Forks: 7

chinosk6/umamusume-voice-text-extractor

Extract the voice and corresponding text

Language: C# - Size: 2.55 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 74 - Forks: 9

al3xkras/sovits-svc-tools-docker

A unified docker environment combining SoVITS SVC fork, UVR5, audio-separator and pyannote.audio.

Language: Python - Size: 11.7 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

yuanhao-chen-nyoeghau/shanghainese-tts

Shanghainese TTS

Language: Jupyter Notebook - Size: 1.98 GB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 21 - Forks: 5

blycr/GPT-SoVITS-FemV_v2

根据GPT-SoVITS项目制作的Female V的SoVITS模型

Size: 1.12 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

vtuber-plan/vcvits

Non Parallel Voice Conversion based on VITS

Language: Python - Size: 8.52 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 3

FENRlR/MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

Language: Python - Size: 3.11 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 117 - Forks: 28

PlayVoice/VI-SVC

VI-SVC model is just VITS without MAS and DurationPredictor.

Language: Python - Size: 16.5 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 0

yeyupiaoling/VITS-PaddlePaddle

本项目是基于PaddlePaddle的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。

Language: Python - Size: 2.98 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

AI-Hobbyist/Models

Acoustic models for SVS/SVC/TTS

Size: 1.67 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 27 - Forks: 2

liupf1122/Multifunctional-chat-software-system

基于socket的网络聊天室;异步爬虫;LLM;VITS

Language: Python - Size: 67.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

ORI-Muchim/One-Click-VITS-Training

VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification config.json + Training, Inference)

Language: Python - Size: 67.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 33 - Forks: 5

ORI-Muchim/PolyLangVITS

Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)

Language: Python - Size: 30.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 68 - Forks: 7

weirdseed/Vits-Android-ncnn

vits Android部署

Language: C++ - Size: 1.92 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 293 - Forks: 49

ORI-Muchim/One-Click-MB-iSTFT-VITS2

MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making config.json + Training, Inference) ONE-CLICK

Language: Python - Size: 28.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 0

blaise-tk/VoRAS

VoRAS: Vocos Retrieval and self-Augmentation for Speech

Language: Python - Size: 2.33 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 2

Mofa-Xingche/Bert-VITS2-2.2-models-jp-6-speaker-tts

Download model link (bert-vits2-2.2)

Size: 25.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

litagin02/vits-japros-webui 📦

日本語TTS(VITS)の学習と音声合成のGradio WebUI

Language: Python - Size: 2.01 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 33 - Forks: 3

yakami129/huggingface-java-sdk

这是一个huggingface语音模型的sdk,可以调用huggingface上的API获取语音文件

Language: Java - Size: 66.4 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

HaomingXR/vits-webui Fork of Artrajz/vits-simple-api 📦

A simple Webui for VITS Inference with API support, built on MoeGoe

Language: Python - Size: 9.92 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Hecate2/sukasuka-vocal-dataset-builder

すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass subtitle files; manually label vocal files to characters. Will be used for PITS/VITS/Diffusion text-to-speech/SVC. 根据字幕,从视频里抽取全部语音,然后手动按角色标注。

Language: Python - Size: 1.19 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 35 - Forks: 3

ddPn08/rvc-webui

liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project

Language: Python - Size: 398 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 374 - Forks: 56

thg80/VITS_AIchat

Language: Python - Size: 47.5 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

mobassir94/comprehensive-bangla-tts

Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural voice cloning systems for Bangla for the first time, supporting different SOTA models for Bangla and also Multilingual (Arabic+Bengali) code mixed TTS pipeline.

Language: Jupyter Notebook - Size: 57.4 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 25 - Forks: 4

Northerner1/RVC-WebUI-localization-ru_RU

Русская локализация для Retrieval-based-Voice-Conversion-WebUI / Russian localization for Retrieval-based-Voice-Conversion-WebUI

Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

weirdseed/vits-ncnn-convert-tool

Language: Python - Size: 35.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 45 - Forks: 11

Slebee/ChatGPT-live2d-Desktop

A chatgpt client with support for claude, live2d table favourites and Vits text-to-speech

Language: TypeScript - Size: 15.8 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 0

QuyAnh2005/vits-japanese

Text to Speech for Japanese

Language: Python - Size: 3.76 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0