Topic: "diffusion-models"
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Language: HTML - Size: 1.84 MB - Last synced at: 11 days ago - Pushed at: 9 months ago - Stars: 11,638 - Forks: 972

Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Language: Python - Size: 71.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9,694 - Forks: 833

Tencent/Hunyuan3D-2
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Language: Python - Size: 79.2 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 8,423 - Forks: 706

openvinotoolkit/openvino
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Language: C++ - Size: 844 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8,132 - Forks: 2,573

FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language: Jupyter Notebook - Size: 620 KB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 7,408 - Forks: 463

open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
Language: Jupyter Notebook - Size: 31.3 MB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 7,131 - Forks: 1,079

yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language: Python - Size: 131 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 5,620 - Forks: 529

Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Language: Python - Size: 10.2 MB - Last synced at: 13 days ago - Pushed at: 9 months ago - Stars: 4,985 - Forks: 424

showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, and various other applications.
Size: 681 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4,299 - Forks: 253

bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
Language: Python - Size: 9.06 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 3,589 - Forks: 526

Lightricks/LTX-Video
Official repository for LTX-Video
Language: Python - Size: 126 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 3,339 - Forks: 293

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
Size: 272 KB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 3,157 - Forks: 263

ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language: Python - Size: 62.5 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 3,099 - Forks: 275

zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Size: 197 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 3,024 - Forks: 512

TingsongYu/PyTorch-Tutorial-2nd
《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
Language: Jupyter Notebook - Size: 84.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2,957 - Forks: 324

deepseek-ai/DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Language: Python - Size: 66.3 MB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 2,901 - Forks: 346

jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Language: Python - Size: 929 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2,612 - Forks: 258

Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Language: Python - Size: 122 MB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 2,308 - Forks: 200

bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
Language: Python - Size: 9.85 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2,226 - Forks: 213

Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language: Python - Size: 58 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 2,180 - Forks: 91

andreas128/RePaint
Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022
Language: Python - Size: 91.8 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 2,071 - Forks: 168

ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
Size: 162 MB - Last synced at: 5 days ago - Pushed at: 23 days ago - Stars: 2,067 - Forks: 106

open-mmlab/mmgeneration
MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
Language: Python - Size: 26.6 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 1,970 - Forks: 232

adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
Language: Python - Size: 60.5 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 1,930 - Forks: 141

yang-song/score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Language: Jupyter Notebook - Size: 3.93 MB - Last synced at: 8 days ago - Pushed at: 9 months ago - Stars: 1,891 - Forks: 327

SUDO-AI-3D/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
Language: Python - Size: 2.38 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 1,876 - Forks: 126

siliconflow/onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
Language: Jupyter Notebook - Size: 114 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 1,863 - Forks: 125

junshutang/Make-It-3D
[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Language: Python - Size: 178 MB - Last synced at: 15 days ago - Pushed at: 10 months ago - Stars: 1,841 - Forks: 128

amirhossein-kz/Awesome-Diffusion-Models-in-Medical-Imaging
Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)
Size: 628 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 1,763 - Forks: 156

eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Language: Python - Size: 46.9 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 1,739 - Forks: 123

FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language: Python - Size: 5.35 MB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 1,708 - Forks: 75

wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
Size: 617 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 1,664 - Forks: 74

LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
Language: Python - Size: 63.5 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 1,655 - Forks: 125

hymie122/RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Size: 6.49 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 1,601 - Forks: 110

guochengqian/Magic123
[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
Language: Jupyter Notebook - Size: 113 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 1,574 - Forks: 96

TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Language: Python - Size: 37 MB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 1,548 - Forks: 127

MaximeVandegar/Papers-in-100-Lines-of-Code
Implementation of papers in 100 lines of code.
Language: Python - Size: 20.5 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 1,479 - Forks: 156

yang-song/score_sde
Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Language: Jupyter Notebook - Size: 4.35 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 1,476 - Forks: 206

mit-han-lab/nunchaku
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Language: Cuda - Size: 85.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,425 - Forks: 84

menyifang/MIMO
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
Size: 13.5 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1,412 - Forks: 57

THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Language: Python - Size: 4.18 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 1,369 - Forks: 71

ThuCCSLab/Awesome-LM-SSP
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
Size: 2.3 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,362 - Forks: 87

bloc97/CrossAttentionControl
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
Language: Jupyter Notebook - Size: 62.3 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 1,331 - Forks: 88

PKU-YuanGroup/MagicTime
[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language: Python - Size: 1.08 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1,311 - Forks: 125

Zheng-Chong/CatVTON
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
Language: Python - Size: 16.3 MB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 1,264 - Forks: 154

wyhuai/DDNM
[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model
Language: Python - Size: 14 MB - Last synced at: 15 days ago - Pushed at: 12 months ago - Stars: 1,246 - Forks: 92

davidADSP/Generative_Deep_Learning_2nd_Edition
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: 15 days ago - Pushed at: 11 months ago - Stars: 1,245 - Forks: 485

gcorso/DiffDock
Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
Language: Python - Size: 211 MB - Last synced at: 10 days ago - Pushed at: 8 months ago - Stars: 1,219 - Forks: 296

Yujun-Shi/DragDiffusion
[CVPR2024, Highlight] Official code for DragDiffusion
Language: Python - Size: 25.3 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 1,207 - Forks: 91

Tencent/HunyuanVideo-I2V
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
Language: Python - Size: 146 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1,206 - Forks: 95

muzishen/IMAGDressing
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.
Language: Python - Size: 46.4 MB - Last synced at: 26 days ago - Pushed at: about 1 month ago - Stars: 1,201 - Forks: 104

Fantasy-Studio/Paint-by-Example
Paint by Example: Exemplar-based Image Editing with Diffusion Models
Language: Python - Size: 18 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 1,170 - Forks: 99

declare-lab/tango
A family of diffusion models for text-to-audio generation.
Language: Python - Size: 19.5 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 1,086 - Forks: 88

opendilab/awesome-diffusion-model-in-rl
A curated list of Diffusion Model in RL resources (continually updated)
Size: 634 KB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 1,084 - Forks: 59

a-r-r-o-w/finetrainers
Memory-optimized training library for diffusion models
Language: Python - Size: 54.1 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,060 - Forks: 116

OpenBMB/VisCPM
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
Language: Python - Size: 7.38 MB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 1,056 - Forks: 93

lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
Language: Python - Size: 8.85 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 1,051 - Forks: 73

Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python - Size: 4.45 MB - Last synced at: 2 days ago - Pushed at: 6 days ago - Stars: 1,027 - Forks: 82

PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
Size: 3.04 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 1,027 - Forks: 27

omerbt/MultiDiffusion
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
Language: Jupyter Notebook - Size: 6.93 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 1,027 - Forks: 59

wladradchenko/wunjo.wladradchenko.ru
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
Language: Python - Size: 263 MB - Last synced at: 5 days ago - Pushed at: 22 days ago - Stars: 1,006 - Forks: 102

liuyuan-pal/SyncDreamer
[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
Language: Python - Size: 81.6 MB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 958 - Forks: 43

open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
Language: Python - Size: 79 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 956 - Forks: 75

showlab/MotionDirector
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
Language: Python - Size: 177 MB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 955 - Forks: 55

3DTopia/3DTopia-XL
[CVPR 2025] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion
Language: Python - Size: 72.3 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 949 - Forks: 35

cure-lab/MagicDrive
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
Language: Python - Size: 23.5 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 937 - Forks: 48

phizaz/diffae
Official implementation of Diffusion Autoencoders
Language: Jupyter Notebook - Size: 10.9 MB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 910 - Forks: 144

radames/Real-Time-Latent-Consistency-Model
App showcasing multiple real-time diffusion models pipelines with Diffusers
Language: Python - Size: 396 KB - Last synced at: 4 days ago - Pushed at: 29 days ago - Stars: 888 - Forks: 104

NVlabs/ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
Language: Python - Size: 16.4 MB - Last synced at: 26 days ago - Pushed at: 10 months ago - Stars: 888 - Forks: 49

horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
Language: Python - Size: 102 MB - Last synced at: 11 days ago - Pushed at: 10 months ago - Stars: 883 - Forks: 43

UCSC-VLAA/story-adapter
A Training-free Iterative Framework for Long Story Visualization
Language: Python - Size: 280 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 877 - Forks: 123

eric-ai-lab/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
Language: Python - Size: 61.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 854 - Forks: 52

ali-vilab/videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
Language: Python - Size: 33.3 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 843 - Forks: 77

hkproj/pytorch-stable-diffusion
Stable Diffusion implemented from scratch in PyTorch
Language: Jupyter Notebook - Size: 1.08 MB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 830 - Forks: 171

shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language: Jupyter Notebook - Size: 57.6 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 819 - Forks: 104

showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language: Python - Size: 162 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 804 - Forks: 36

mbreuss/diffusion-literature-for-robotics
Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.
Size: 220 KB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 803 - Forks: 41

songweige/rich-text-to-image
Rich-Text-to-Image Generation
Language: Python - Size: 41.4 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 786 - Forks: 67

lixinustc/Awesome-diffusion-model-for-image-processing
one summary of diffusion-based image processing, including restoration, enhancement, coding, quality assessment
Size: 1.16 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 758 - Forks: 57

hustvl/GaussianDreamer
[CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models
Language: Python - Size: 74.7 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 753 - Forks: 39

Boese0601/MagicDance
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
Language: Python - Size: 131 MB - Last synced at: 26 days ago - Pushed at: 10 months ago - Stars: 746 - Forks: 64

lzzcd001/MeshDiffusion
Official implementation of "MeshDiffusion: Score-based Generative 3D Mesh Modeling" (ICLR 2023 Spotlight)
Language: Python - Size: 17.9 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 746 - Forks: 40

dome272/Paella
Official Implementation of Paella https://arxiv.org/abs/2211.07292v2
Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 744 - Forks: 54

MyNiuuu/MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
Language: Python - Size: 117 MB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 732 - Forks: 46

vicgalle/stable-diffusion-aesthetic-gradients
Personalization for Stable Diffusion via Aesthetic Gradients 🎨
Language: Jupyter Notebook - Size: 92.5 MB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 730 - Forks: 62

yuval-alaluf/Attend-and-Excite
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
Language: Jupyter Notebook - Size: 103 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 729 - Forks: 62

ayaanzhaque/instruct-nerf2nerf
Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions (ICCV 2023)
Language: Python - Size: 3.89 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 719 - Forks: 58

ali-vilab/TeaCache
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Language: Python - Size: 22.5 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 686 - Forks: 26

OpenTexture/Paint3D
[CVPR 2024] Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models, a no lighting baked texture generative model
Language: Python - Size: 38.3 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 685 - Forks: 32

ExponentialML/Text-To-Video-Finetuning 📦
Finetune ModelScope's Text To Video model using Diffusers 🧨
Language: Python - Size: 1.82 MB - Last synced at: about 6 hours ago - Pushed at: over 1 year ago - Stars: 685 - Forks: 109

zubair-irshad/Awesome-Robotics-3D
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
Size: 730 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 680 - Forks: 35

mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Language: Python - Size: 2.42 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 675 - Forks: 32

PKU-YuanGroup/ConsisID
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Language: Python - Size: 13.3 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 667 - Forks: 33

Project-MONAI/GenerativeModels 📦
MONAI Generative Models makes it easy to train, evaluate, and deploy generative models and related applications
Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 653 - Forks: 90

Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Language: Python - Size: 961 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 646 - Forks: 87

teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
Language: Jupyter Notebook - Size: 35.8 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 646 - Forks: 66

thu-ml/CRM
[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.
Language: Python - Size: 4.38 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 641 - Forks: 54

ChenWu98/cycle-diffusion
[ICCV 2023] A latent space for stochastic diffusion models
Language: Python - Size: 52.5 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 617 - Forks: 36

mikonvergence/DiffusionFastForward
DiffusionFastForward: a free course and experimental framework for diffusion-based generative models
Language: Jupyter Notebook - Size: 4.09 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 601 - Forks: 54

omriav/blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
Language: Jupyter Notebook - Size: 9.84 MB - Last synced at: 26 days ago - Pushed at: 11 months ago - Stars: 594 - Forks: 37
