Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: text-to-audio
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language: HTML - Size: 1.75 MB - Last synced: about 23 hours ago - Pushed: 4 days ago - Stars: 105 - Forks: 5
gitmylo/audio-webui
A webui for different audio related Neural Networks
Language: Python - Size: 703 KB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 930 - Forks: 89
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language: Python - Size: 10.3 MB - Last synced: 4 days ago - Pushed: 7 days ago - Stars: 4,023 - Forks: 333
bnsantoso/sub-to-audio
Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.
Language: Python - Size: 99.6 KB - Last synced: 2 days ago - Pushed: 6 months ago - Stars: 91 - Forks: 9
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Language: Python - Size: 961 KB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 695 - Forks: 103
RhythrosaLabs/soundstorm
Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusiasts. From sample pack creation and algorithmic composition to AI text-to-audio and onscreen ChatGPT, Soundstorm is a sonic powerhouse.
Language: Python - Size: 3.38 MB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 20 - Forks: 6
serp-ai/ai-text-to-audio-latent-diffusion Fork of Harmonai-org/sample-generator
text-to-audio-latent-diffusion
Language: Python - Size: 58.2 MB - Last synced: 15 days ago - Pushed: 9 months ago - Stars: 30 - Forks: 8
lucidrains/nuwa-pytorch
Implementation of NÃœWA, state of the art attention network for text to video synthesis, in Pytorch
Language: Python - Size: 1.78 MB - Last synced: 28 days ago - Pushed: over 1 year ago - Stars: 534 - Forks: 62
AMAAI-Lab/mustango
Mustango: Toward Controllable Text-to-Music Generation
Language: Python - Size: 54.2 MB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 277 - Forks: 22
Kartiksood10/Text-to-Music-Generation-App
Generate Music using natural language prompts using Meta's MusicGen Small Model.
Language: Python - Size: 11.7 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
declare-lab/tango
Hosts a family of diffusion models for text-to-audio generation.
Language: Python - Size: 17.7 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 880 - Forks: 68
Ate329/SentiMusic
A text-to-audio application that turns words and sentiments into melodies.
Language: Python - Size: 3.38 MB - Last synced: 16 days ago - Pushed: about 2 months ago - Stars: 2 - Forks: 0
happylittlecat2333/Auffusion
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
Language: Jupyter Notebook - Size: 23.9 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 106 - Forks: 10
kennethleungty/Text-to-Audio-with-Bark
Exploring Bark, the Open-Source Text-to-Audio Generative Model
Language: Jupyter Notebook - Size: 2.67 MB - Last synced: 3 months ago - Pushed: 8 months ago - Stars: 13 - Forks: 4
vishalnagda1/text-to-speech
Python program to convert text to speech.
Language: Python - Size: 6.84 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 3 - Forks: 16
ilaria-manco/word2wave
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
Language: Python - Size: 1.15 MB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 116 - Forks: 16
inferless/bark
Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying.
Language: Python - Size: 24.4 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 2 - Forks: 4
Consistency-TTA/consistency-tta.github.io
Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Language: HTML - Size: 144 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7 - Forks: 0
PapayaResearch/ctag
Creative Text-to-Audio Generation via Synthesizer Programming @ NeurIPS'23 ML4Audio Workshop
Language: Python - Size: 106 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
darylalim/generate-audio-audiocraft-audiogen
Generate audio from text with AudioCraft AudioGen.
Language: Jupyter Notebook - Size: 7.74 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
anverbogatov/text-to-audio-tool
Simple Java based console tool that transforms your text files to audio files.
Language: Java - Size: 32.2 KB - Last synced: 6 months ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0
artinmohajeri/tkinter-text-to-voice
Language: Python - Size: 49.8 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
ahsplore/TalkitOut-TTS-web-application-python
TalkItOut is a Python and Flask-based web application that can convert text to speech, choose your preferred language for audio output, access a built-in dictionary for word meanings, and even extract text from images, complete with audio generation.
Language: HTML - Size: 9.13 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 4 - Forks: 1
keonlee9420/WaveGrad2
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Language: Python - Size: 18 MB - Last synced: 7 months ago - Pushed: almost 3 years ago - Stars: 64 - Forks: 14
camenduru/audioldm-colab
AudioLDM text to audio colab
Language: Jupyter Notebook - Size: 24.4 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 15 - Forks: 1
mohaimenulislamshawon/text-to-voice-speech-converter
The program is created based on google text to speech or voice converter machine. You can convert top 20 languages with this convert. I have made this for the educational & experimental perpose.
Language: HTML - Size: 12.7 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0
ivan-guerra/morse
A text to Morse code translator
Language: C++ - Size: 80.1 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0
Ajmal112/texttospeech
The Text-to-Speech website is a testing API project that enables users to effortlessly convert text or sentences into MP3 audio files. With its user-friendly interface, users can simply input their desired text, initiate the conversion process, and obtain an audio file in seconds, facilitating convenient access to spoken content from written text.
Language: HTML - Size: 5.86 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
OpenGenus/audio
Tool to convert an article to a summarized audio version [developed by OG Intern Ambarish Deb]
Language: Python - Size: 7.1 MB - Last synced: 9 days ago - Pushed: 11 months ago - Stars: 0 - Forks: 1
brayanjeshua/chatgpt-to-speech
CHATGPT Text-to-Speech Application
Language: JavaScript - Size: 9.77 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0
Djmcflush/RaveFussion
A text to audio pipeline using Riffusion (a finetuned stablediffusion model) and using RAVE a audio to audio AutoEncoder.
Language: Python - Size: 8.98 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 4 - Forks: 3
saba99/Text-To-Audio-ChatGPT
Text To Audio (Voice, Music) -Support Chat-GPT
Language: Python - Size: 0 Bytes - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
mython-dev/text-to-audio-converter
Convert Audio to Text using Telebot gTTS
Language: Python - Size: 128 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
techguy940/text-to-speech
Text-to-Speech
Language: Python - Size: 2.93 KB - Last synced: 9 months ago - Pushed: over 4 years ago - Stars: 1 - Forks: 1