GitHub / YingqingHe / Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Stars: 455
Forks: 26
Open issues: 0
License: None
Language: HTML
Size: 12.7 MB
Dependencies parsed at: Pending
Created at: over 1 year ago
Updated at: 6 days ago
Pushed at: 16 days ago
Last synced at: 6 days ago
Topics: aigc, large-language-models, large-vision-language-models, llm, lvlm, mllm, multimodal-generation, multimodal-large-language-models, multimodal-models, multimodality, text-to-3d, text-to-audio, text-to-image, text-to-music, text-to-sound, text-to-speech, text-to-video