An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: text-to-sound

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

Language: HTML - Size: 12.7 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 472 - Forks: 26

zhenye234/xcodec

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Language: Python - Size: 1.77 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 200 - Forks: 12

kennethleungty/Text-to-Audio-with-Bark

Exploring Bark, the Open-Source Text-to-Audio Generative Model

Language: Jupyter Notebook - Size: 2.67 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 4

ericpesto/ai-sample-generator

Create .wav audio samples with text-to-sound generative AI

Language: Python - Size: 2.83 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 3