Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: text-to-audio

Repositories

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

Language: HTML - Size: 1.75 MB - Last synced: about 23 hours ago - Pushed: 4 days ago - Stars: 105 - Forks: 5

gitmylo/audio-webui

A webui for different audio related Neural Networks

Language: Python - Size: 703 KB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 930 - Forks: 89

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language: Python - Size: 10.3 MB - Last synced: 4 days ago - Pushed: 7 days ago - Stars: 4,023 - Forks: 333

bnsantoso/sub-to-audio

Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.

Language: Python - Size: 99.6 KB - Last synced: 2 days ago - Pushed: 6 months ago - Stars: 91 - Forks: 9

Text-to-Audio/Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Language: Python - Size: 961 KB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 695 - Forks: 103

RhythrosaLabs/soundstorm

Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusiasts. From sample pack creation and algorithmic composition to AI text-to-audio and onscreen ChatGPT, Soundstorm is a sonic powerhouse.

Language: Python - Size: 3.38 MB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 20 - Forks: 6

serp-ai/ai-text-to-audio-latent-diffusion Fork of Harmonai-org/sample-generator

text-to-audio-latent-diffusion

Language: Python - Size: 58.2 MB - Last synced: 15 days ago - Pushed: 9 months ago - Stars: 30 - Forks: 8

lucidrains/nuwa-pytorch

Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch

Language: Python - Size: 1.78 MB - Last synced: 28 days ago - Pushed: over 1 year ago - Stars: 534 - Forks: 62

AMAAI-Lab/mustango

Mustango: Toward Controllable Text-to-Music Generation

Language: Python - Size: 54.2 MB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 277 - Forks: 22

Kartiksood10/Text-to-Music-Generation-App

Generate Music using natural language prompts using Meta's MusicGen Small Model.

Language: Python - Size: 11.7 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

declare-lab/tango

Hosts a family of diffusion models for text-to-audio generation.

Language: Python - Size: 17.7 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 880 - Forks: 68

Ate329/SentiMusic

A text-to-audio application that turns words and sentiments into melodies.

Language: Python - Size: 3.38 MB - Last synced: 16 days ago - Pushed: about 2 months ago - Stars: 2 - Forks: 0

happylittlecat2333/Auffusion

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Language: Jupyter Notebook - Size: 23.9 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 106 - Forks: 10

kennethleungty/Text-to-Audio-with-Bark

Exploring Bark, the Open-Source Text-to-Audio Generative Model

Language: Jupyter Notebook - Size: 2.67 MB - Last synced: 3 months ago - Pushed: 8 months ago - Stars: 13 - Forks: 4

vishalnagda1/text-to-speech

Python program to convert text to speech.

Language: Python - Size: 6.84 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 3 - Forks: 16

ilaria-manco/word2wave

Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

Language: Python - Size: 1.15 MB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 116 - Forks: 16

inferless/bark

Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying.

Language: Python - Size: 24.4 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 2 - Forks: 4

Consistency-TTA/consistency-tta.github.io

Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation

Language: HTML - Size: 144 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7 - Forks: 0

PapayaResearch/ctag

Creative Text-to-Audio Generation via Synthesizer Programming @ NeurIPS'23 ML4Audio Workshop

Language: Python - Size: 106 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

darylalim/generate-audio-audiocraft-audiogen

Generate audio from text with AudioCraft AudioGen.

Language: Jupyter Notebook - Size: 7.74 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

anverbogatov/text-to-audio-tool

Simple Java based console tool that transforms your text files to audio files.

Language: Java - Size: 32.2 KB - Last synced: 6 months ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

artinmohajeri/tkinter-text-to-voice

Language: Python - Size: 49.8 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

ahsplore/TalkitOut-TTS-web-application-python

TalkItOut is a Python and Flask-based web application that can convert text to speech, choose your preferred language for audio output, access a built-in dictionary for word meanings, and even extract text from images, complete with audio generation.

Language: HTML - Size: 9.13 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 4 - Forks: 1

keonlee9420/WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Language: Python - Size: 18 MB - Last synced: 7 months ago - Pushed: almost 3 years ago - Stars: 64 - Forks: 14

camenduru/audioldm-colab

AudioLDM text to audio colab

Language: Jupyter Notebook - Size: 24.4 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 15 - Forks: 1

mohaimenulislamshawon/text-to-voice-speech-converter

The program is created based on google text to speech or voice converter machine. You can convert top 20 languages with this convert. I have made this for the educational & experimental perpose.

Language: HTML - Size: 12.7 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0

ivan-guerra/morse

A text to Morse code translator

Language: C++ - Size: 80.1 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0

Ajmal112/texttospeech

The Text-to-Speech website is a testing API project that enables users to effortlessly convert text or sentences into MP3 audio files. With its user-friendly interface, users can simply input their desired text, initiate the conversion process, and obtain an audio file in seconds, facilitating convenient access to spoken content from written text.

Language: HTML - Size: 5.86 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0