Topic: "diffusion"
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language: Python - Size: 34.9 MB - Last synced at: 14 days ago - Pushed at: 18 days ago - Stars: 159,144 - Forks: 29,579
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Language: Python - Size: 86.1 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 32,191 - Forks: 6,630
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language: Python - Size: 23.7 MB - Last synced at: 15 days ago - Pushed at: 18 days ago - Stars: 20,310 - Forks: 2,131
datawhalechina/leedl-tutorial
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Language: Jupyter Notebook - Size: 295 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 15,209 - Forks: 3,037
easydiffusion/easydiffusion
An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.
Language: JavaScript - Size: 57.3 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 10,169 - Forks: 847
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language: Jupyter Notebook - Size: 177 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 7,434 - Forks: 490
open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
Language: Jupyter Notebook - Size: 31.3 MB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 7,344 - Forks: 1,098
leejet/stable-diffusion.cpp
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++
Language: C++ - Size: 56.8 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 4,994 - Forks: 483
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Language: Python - Size: 727 MB - Last synced at: 13 days ago - Pushed at: 15 days ago - Stars: 4,838 - Forks: 321
transformerlab/transformerlab-app
Open Source Machine Learning Research Platform designed for frontier AI/ML workflows. Local, on-prem, or in the cloud. Open source.
Language: Python - Size: 19.1 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 4,726 - Forks: 493
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Language: Jupyter Notebook - Size: 399 MB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 4,284 - Forks: 365
datawhalechina/tiny-universe
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Language: Jupyter Notebook - Size: 22.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4,095 - Forks: 410
riffusion/riffusion-hobby
Stable diffusion for real-time music generation
Language: Python - Size: 8.06 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 3,859 - Forks: 466
jina-ai/discoart
🪩 Create Disco Diffusion artworks in one line
Language: Python - Size: 29 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 3,835 - Forks: 246
pollinations/pollinations
Your Friendly Open-Source Gen-AI Platform
Language: TypeScript - Size: 512 MB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 3,485 - Forks: 478
openvpi/DiffSinger Fork of MoonInTheRiver/DiffSinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Language: Python - Size: 66.1 MB - Last synced at: 9 days ago - Pushed at: 11 days ago - Stars: 3,041 - Forks: 321
williamyang1991/Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Language: Jupyter Notebook - Size: 8.89 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 2,982 - Forks: 200
PlayVoice/whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
Language: Python - Size: 41.3 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 2,813 - Forks: 925
ai-forever/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
Language: Jupyter Notebook - Size: 37.3 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 2,810 - Forks: 313
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language: Python - Size: 230 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 2,781 - Forks: 299
prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Language: Python - Size: 9.44 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2,736 - Forks: 172
riffusion/riffusion-app-hobby
Stable diffusion for real-time music generation (web app)
Language: TypeScript - Size: 30.6 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 2,671 - Forks: 209
nunchaku-tech/ComfyUI-nunchaku
ComfyUI Plugin of Nunchaku
Language: Python - Size: 2.97 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 2,648 - Forks: 132
bytedance/InfiniteYou
🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Language: Python - Size: 13.5 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 2,591 - Forks: 285
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
Size: 162 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 2,240 - Forks: 111
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language: Python - Size: 58 MB - Last synced at: 8 months ago - Pushed at: 11 months ago - Stars: 2,187 - Forks: 91
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
Size: 563 KB - Last synced at: 7 days ago - Pushed at: 11 days ago - Stars: 2,096 - Forks: 97
amirhossein-kz/Awesome-Diffusion-Models-in-Medical-Imaging
Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)
Size: 1.18 MB - Last synced at: 34 minutes ago - Pushed at: about 2 months ago - Stars: 2,021 - Forks: 170
River-Zhang/ICEdit
[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt release! Only 4GB VRAM is enough to run!
Language: Python - Size: 29 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1,934 - Forks: 108
rupeshs/fastsdcpu
Fast stable diffusion on CPU and AI PC
Language: Python - Size: 18.8 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 1,917 - Forks: 170
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language: Python - Size: 5.35 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 1,917 - Forks: 94
vllm-project/vllm-omni
A framework for efficient model inference with omni-modality models
Language: Python - Size: 4.88 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1,831 - Forks: 226
NVIDIA/Cosmos-Tokenizer 📦
A suite of image and video neural tokenizers
Language: Jupyter Notebook - Size: 16.5 MB - Last synced at: 9 days ago - Pushed at: 11 months ago - Stars: 1,695 - Forks: 85
varunshenoy/opendream
An extensible, easy-to-use, and portable diffusion web UI 👨🎨
Language: JavaScript - Size: 32.7 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 1,669 - Forks: 71
IntelLabs/fastRAG
Efficient Retrieval Augmentation and Generation Framework
Language: Python - Size: 20.4 MB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 1,657 - Forks: 154
TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Language: Python - Size: 37 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 1,655 - Forks: 137
Maks-s/sd-akashic
A compendium of informations regarding Stable Diffusion (SD)
Size: 106 MB - Last synced at: 8 months ago - Pushed at: almost 3 years ago - Stars: 1,645 - Forks: 83
Fantasy-AMAP/fantasy-talking
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Language: Python - Size: 128 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1,496 - Forks: 116
huggingface/finetrainers
Scalable and memory-optimized training of diffusion models
Language: Python - Size: 54.2 MB - Last synced at: 20 days ago - Pushed at: 7 months ago - Stars: 1,310 - Forks: 144
tin2tin/Pallaidium
PALLAIDIUM — a generative AI movie studio, seamlessly integrated into the Blender Video Editor (VSE), enabling end-to-end production from script to screen and back.
Language: Python - Size: 35.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,286 - Forks: 116
bytedance/UNO
[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
Language: Python - Size: 39.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1,275 - Forks: 77
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Language: Python - Size: 67.6 MB - Last synced at: 8 months ago - Pushed at: 11 months ago - Stars: 1,265 - Forks: 150
Uminosachi/sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Language: Python - Size: 3.44 MB - Last synced at: 8 months ago - Pushed at: about 1 year ago - Stars: 1,245 - Forks: 113
declare-lab/tango
A family of diffusion models for text-to-audio generation.
Language: Python - Size: 19.3 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1,182 - Forks: 103
THU-LYJ-Lab/T3Bench
T3Bench: Benchmarking Current Progress in Text-to-3D Generation
Language: Python - Size: 11.6 MB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 1,098 - Forks: 10
cloneofsimo/minDiffusion
Self-contained, minimalistic implementation of diffusion models with Pytorch.
Language: Python - Size: 3.51 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 1,094 - Forks: 140
EdVince/Stable-Diffusion-NCNN
Stable Diffusion in NCNN with c++, supported txt2img and img2img
Language: C++ - Size: 151 MB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 1,033 - Forks: 100
AspirinCode/papers-for-molecular-design-using-DL
List of Molecular and Material design using Generative AI and Deep Learning
Size: 4.91 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 892 - Forks: 114
LeCAR-Lab/dial-mpc
Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged robot full-order torque-level control with both precision and agility in a training-free manner.
Language: Python - Size: 269 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 841 - Forks: 89
Tencent-Hunyuan/MixGRPO
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
Language: Python - Size: 4.64 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 824 - Forks: 42
PKU-YuanGroup/UniWorld
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
Language: Python - Size: 88.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 809 - Forks: 24
sail-sg/Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Language: Python - Size: 1.31 MB - Last synced at: 27 days ago - Pushed at: 7 months ago - Stars: 804 - Forks: 69
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
Language: Python - Size: 3.37 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 796 - Forks: 71
PKU-YuanGroup/ConsisID
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Language: Python - Size: 13.3 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 786 - Forks: 44
castorini/daam
Diffusion attentive attribution maps for interpreting Stable Diffusion.
Language: Jupyter Notebook - Size: 2.15 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 774 - Forks: 68
williamyang1991/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Language: Jupyter Notebook - Size: 9.97 MB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 762 - Forks: 75
thu-ml/DiT-Extrapolation
Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) and "UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers"
Language: Python - Size: 139 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 754 - Forks: 72
ChaofanTao/Autoregressive-Models-in-Vision-Survey
[TMLR 2025🔥] A survey for the autoregressive models in vision.
Size: 8.61 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 748 - Forks: 20
fboulnois/stable-diffusion-docker
Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.
Language: Python - Size: 666 KB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 746 - Forks: 132
fishaudio/fish-diffusion
An easy to understand TTS / SVS / SVC framework
Language: Python - Size: 61.4 MB - Last synced at: 7 months ago - Pushed at: 10 months ago - Stars: 714 - Forks: 97
kandinskylab/kandinsky-5
Kandinsky 5.0: A family of diffusion models for Video & Image generation
Language: Python - Size: 398 MB - Last synced at: 3 days ago - Pushed at: 7 days ago - Stars: 664 - Forks: 42
cloneofsimo/paint-with-words-sd
Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.
Language: Jupyter Notebook - Size: 41.3 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 644 - Forks: 49
SkyWorkAIGC/SkyPaint-AI-Diffusion
基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.
Size: 7.74 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 630 - Forks: 37
omriav/blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
Language: Jupyter Notebook - Size: 9.84 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 594 - Forks: 37
yuanchenyang/smalldiffusion
Simple and readable code for training and sampling from diffusion models
Language: Python - Size: 1.6 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 590 - Forks: 45
kjsman/stable-diffusion-pytorch
Yet another PyTorch implementation of Stable Diffusion (probably easy to read)
Language: Python - Size: 25.4 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 590 - Forks: 63
some9000/StylePile
A prompt generation helper script for AUTOMATIC1111/stable-diffusion-webui and compatible forks
Language: Python - Size: 184 MB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 581 - Forks: 43
omriav/blended-diffusion
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
Language: Jupyter Notebook - Size: 42.4 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 574 - Forks: 43
microsoft/foldingdiff
Diffusion models of protein structure; trigonometry and attention are all you need!
Language: Jupyter Notebook - Size: 228 MB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 560 - Forks: 72
prs-eth/RollingDepth
[CVPR 2025] RollingDepth: Video Depth without Video Models
Language: Python - Size: 6.14 MB - Last synced at: 6 months ago - Pushed at: 10 months ago - Stars: 556 - Forks: 20
bahjat-kawar/ddrm
[NeurIPS 2022] Denoising Diffusion Restoration Models -- Official Code Repository
Language: Python - Size: 3.14 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 506 - Forks: 51
dromara/Omega-AI
Omega-AI:基于java打造的深度学习框架,帮助你快速搭建神经网络,实现模型推理与训练,引擎支持自动求导,多线程与GPU运算,GPU支持CUDA,CUDNN。
Language: Java - Size: 260 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 497 - Forks: 63
xlite-dev/Awesome-DiT-Inference
📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
Language: Python - Size: 230 KB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 481 - Forks: 24
csslc/CCSR
Official codes of CCSRv2 and CCSRv1: Improving the Stability and Efficiency of Diffusion Models for Content Consistent Super-Resolution
Language: Python - Size: 51.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 472 - Forks: 38
naver-ai/StyleKeeper
Official Pytorch implementation of "StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance"
Language: Python - Size: 36.8 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 469 - Forks: 34
afiaka87/clip-guided-diffusion
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
Language: Python - Size: 49.9 MB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 462 - Forks: 60
svg-project/Sparse-VideoGen
[ICML2025] Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Language: Python - Size: 4.51 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 438 - Forks: 24
AILab-CVC/FreeNoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
Language: Python - Size: 81.7 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 420 - Forks: 25
HaozheLiu-ST/T-GATE
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
Language: Python - Size: 54.8 MB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 415 - Forks: 24
Auto1111SDK/Auto1111SDK
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
Language: Python - Size: 10.6 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 403 - Forks: 28
scenediffuser/Scene-Diffuser
Official implementation of CVPR23 paper "Diffusion-based Generation, Optimization, and Planning in 3D Scenes"
Language: Python - Size: 1.77 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 394 - Forks: 26
OliBomby/Mapperatorinator
An AI framework for generating and modding osu! beatmaps for all gamemodes from spectrogram inputs.
Language: Python - Size: 3.61 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 384 - Forks: 44
yeungchenwa/FontDiffuser
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
Language: Python - Size: 16.4 MB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 360 - Forks: 34
kxhit/EscherNet
[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis
Language: Python - Size: 37.9 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 359 - Forks: 20
huggingface/open-muse
Open reproduction of MUSE for fast text2image generation.
Language: Python - Size: 4.33 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 356 - Forks: 32
ximinng/SVGDreamer
[CVPR 2024] Official implementation for "SVGDreamer: Text Guided SVG Generation with Diffusion Model" https://arxiv.org/abs/2312.16476
Language: Python - Size: 34.4 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 355 - Forks: 36
sanweiliti/RoHM
The official PyTorch code for RoHM: Robust Human Motion Reconstruction via Diffusion.
Language: Python - Size: 395 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 351 - Forks: 19
jychoi118/ilvr_adm
ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models (ICCV 2021 Oral)
Language: Python - Size: 313 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 344 - Forks: 46
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language: Python - Size: 121 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 337 - Forks: 45
HorizonWind2004/reconstruction-alignment
Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.
Language: Python - Size: 83.5 MB - Last synced at: 16 days ago - Pushed at: 20 days ago - Stars: 335 - Forks: 11
VisualComputingInstitute/diffusion-e2e-ft
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think. Accepted to WACV 2025 and NeurIPS AFM Workshop.
Language: Python - Size: 8.82 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 330 - Forks: 4
jabir-zheng/TCD
Official Repository of the paper "Trajectory Consistency Distillation"
Language: Python - Size: 100 MB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 327 - Forks: 13
woctezuma/stable-diffusion-colab
Colab notebook for Stable Diffusion Hyper-SDXL.
Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: 7 months ago - Pushed at: 9 months ago - Stars: 325 - Forks: 81
p1atdev/LECO
Low-rank adaptation for Erasing COncepts from diffusion models.
Language: Jupyter Notebook - Size: 3.82 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 322 - Forks: 29
robotgradient/grasp_diffusion
Pytorch implementation of diffusion models on Lie Groups for 6D grasp pose generation https://sites.google.com/view/se3dif/home
Language: Jupyter Notebook - Size: 27.9 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 313 - Forks: 36
lunarring/latentblending
Create butter-smooth transitions between prompts, powered by stable diffusion
Language: Python - Size: 8.71 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 312 - Forks: 24
RehgLab/RAVE
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]
Language: Python - Size: 157 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 310 - Forks: 20
diffusion-classifier/diffusion-classifier
Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training
Language: Python - Size: 736 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 308 - Forks: 16
nianticlabs/diffusionerf
[CVPR 2023] DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models
Language: Python - Size: 2.43 MB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 304 - Forks: 17
kwonminki/One-sentence_Diffusion_summary
The repo for studying and sharing diffusion models.
Size: 374 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 304 - Forks: 25