Topic: "diffusion"
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language: Python - Size: 34.7 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 151,240 - Forks: 28,144

huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language: Python - Size: 61.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 28,572 - Forks: 5,856

huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language: Python - Size: 15.9 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 18,115 - Forks: 1,819

datawhalechina/leedl-tutorial
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Language: Jupyter Notebook - Size: 294 MB - Last synced at: 13 days ago - Pushed at: 18 days ago - Stars: 14,905 - Forks: 3,005

easydiffusion/easydiffusion
An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.
Language: JavaScript - Size: 57.2 MB - Last synced at: 11 days ago - Pushed at: 19 days ago - Stars: 9,878 - Forks: 815

cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language: Jupyter Notebook - Size: 177 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 7,304 - Forks: 491

open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
Language: Jupyter Notebook - Size: 31.3 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 7,131 - Forks: 1,079

NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Language: Python - Size: 248 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4,003 - Forks: 252

leejet/stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
Language: C++ - Size: 21.3 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 3,997 - Forks: 363

VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Language: Jupyter Notebook - Size: 417 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 3,912 - Forks: 335

jina-ai/discoart
🪩 Create Disco Diffusion artworks in one line
Language: Python - Size: 29 MB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 3,843 - Forks: 248

riffusion/riffusion-hobby
Stable diffusion for real-time music generation
Language: Python - Size: 8.06 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 3,633 - Forks: 428

williamyang1991/Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Language: Jupyter Notebook - Size: 8.89 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 2,978 - Forks: 200

openvpi/DiffSinger Fork of MoonInTheRiver/DiffSinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Language: Python - Size: 66 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,837 - Forks: 297

ai-forever/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
Language: Jupyter Notebook - Size: 37.3 MB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 2,789 - Forks: 312

PlayVoice/whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
Language: Python - Size: 41.3 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 2,755 - Forks: 920

datawhalechina/tiny-universe
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Language: Python - Size: 9.59 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 2,690 - Forks: 283

TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language: Python - Size: 230 MB - Last synced at: 10 days ago - Pushed at: 10 months ago - Stars: 2,678 - Forks: 283

riffusion/riffusion-app-hobby
Stable diffusion for real-time music generation (web app)
Language: TypeScript - Size: 30.6 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 2,649 - Forks: 203

prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Language: Python - Size: 9.4 MB - Last synced at: 10 days ago - Pushed at: 28 days ago - Stars: 2,639 - Forks: 159

Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language: Python - Size: 58 MB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 2,180 - Forks: 91

ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
Size: 162 MB - Last synced at: 3 days ago - Pushed at: 21 days ago - Stars: 2,067 - Forks: 106

bytedance/InfiniteYou
🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Language: Python - Size: 13.5 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1,877 - Forks: 133

amirhossein-kz/Awesome-Diffusion-Models-in-Medical-Imaging
Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)
Size: 628 KB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 1,763 - Forks: 156

FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language: Python - Size: 5.35 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 1,708 - Forks: 75

varunshenoy/opendream
An extensible, easy-to-use, and portable diffusion web UI 👨🎨
Language: JavaScript - Size: 32.7 MB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 1,670 - Forks: 72

wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
Size: 617 KB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1,664 - Forks: 74

rupeshs/fastsdcpu
Fast stable diffusion on CPU
Language: Python - Size: 18.1 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1,656 - Forks: 140

Maks-s/sd-akashic
A compendium of informations regarding Stable Diffusion (SD)
Size: 106 MB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 1,640 - Forks: 84

NVIDIA/Cosmos-Tokenizer 📦
A suite of image and video neural tokenizers
Language: Jupyter Notebook - Size: 16.5 MB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 1,614 - Forks: 75

pollinations/pollinations
Free Open-Source Image and Text Generation
Language: JavaScript - Size: 363 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,598 - Forks: 174

awesome-stable-diffusion/awesome-stable-diffusion
Curated list of awesome resources for the Stable Diffusion AI Model.
Size: 414 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 1,552 - Forks: 78

TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Language: Python - Size: 37 MB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 1,548 - Forks: 127

IntelLabs/fastRAG
Efficient Retrieval Augmentation and Generation Framework
Language: Python - Size: 20.4 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 1,513 - Forks: 139

mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Language: Python - Size: 67.6 MB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 1,268 - Forks: 151

Uminosachi/sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Language: Python - Size: 3.44 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 1,231 - Forks: 110

tin2tin/Pallaidium
PALLAIDIUM - a generative AI movie studio integrated in the Blender Video Editor.
Language: Python - Size: 33.5 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1,107 - Forks: 90

THU-LYJ-Lab/T3Bench
T3Bench: Benchmarking Current Progress in Text-to-3D Generation
Language: Python - Size: 11.6 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 1,098 - Forks: 10

declare-lab/tango
A family of diffusion models for text-to-audio generation.
Language: Python - Size: 19.5 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 1,086 - Forks: 88

a-r-r-o-w/finetrainers
Memory-optimized training library for diffusion models
Language: Python - Size: 54.1 MB - Last synced at: about 1 hour ago - Pushed at: about 12 hours ago - Stars: 1,060 - Forks: 116

EdVince/Stable-Diffusion-NCNN
Stable Diffusion in NCNN with c++, supported txt2img and img2img
Language: C++ - Size: 151 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 1,033 - Forks: 100

cloneofsimo/minDiffusion
Self-contained, minimalistic implementation of diffusion models with Pytorch.
Language: Python - Size: 3.51 MB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 946 - Forks: 128

sail-sg/Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Language: Python - Size: 1.3 MB - Last synced at: 16 days ago - Pushed at: 10 months ago - Stars: 784 - Forks: 67

AspirinCode/papers-for-molecular-design-using-DL
List of Molecular and Material design using Generative AI and Deep Learning
Size: 5.75 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 776 - Forks: 103

fboulnois/stable-diffusion-docker
Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.
Language: Python - Size: 666 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 745 - Forks: 132

fishaudio/fish-diffusion
An easy to understand TTS / SVS / SVC framework
Language: Python - Size: 61.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 682 - Forks: 90

FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
Language: Python - Size: 2.45 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 671 - Forks: 54

PKU-YuanGroup/ConsisID
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Language: Python - Size: 13.3 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 667 - Forks: 33

cloneofsimo/paint-with-words-sd
Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.
Language: Jupyter Notebook - Size: 41.3 MB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 642 - Forks: 50

LeCAR-Lab/dial-mpc
Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged robot full-order torque-level control with both precision and agility in a training-free manner.
Language: Python - Size: 268 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 635 - Forks: 71

SkyWorkAIGC/SkyPaint-AI-Diffusion
基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.
Size: 7.74 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 630 - Forks: 37

castorini/daam
Diffusion attentive attribution maps for interpreting Stable Diffusion.
Language: Jupyter Notebook - Size: 2.15 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 601 - Forks: 61

omriav/blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
Language: Jupyter Notebook - Size: 9.84 MB - Last synced at: 24 days ago - Pushed at: 11 months ago - Stars: 594 - Forks: 37

some9000/StylePile
A prompt generation helper script for AUTOMATIC1111/stable-diffusion-webui and compatible forks
Language: Python - Size: 184 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 575 - Forks: 44

omriav/blended-diffusion
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
Language: Jupyter Notebook - Size: 42.4 MB - Last synced at: 24 days ago - Pushed at: 11 months ago - Stars: 574 - Forks: 43

kjsman/stable-diffusion-pytorch
Yet another PyTorch implementation of Stable Diffusion (probably easy to read)
Language: Python - Size: 25.4 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 554 - Forks: 62

williamyang1991/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Language: Jupyter Notebook - Size: 9.94 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 541 - Forks: 51

microsoft/foldingdiff
Diffusion models of protein structure; trigonometry and attention are all you need!
Language: Jupyter Notebook - Size: 228 MB - Last synced at: about 23 hours ago - Pushed at: over 1 year ago - Stars: 539 - Forks: 64

ChaofanTao/Autoregressive-Models-in-Vision-Survey
[TMLR 2025🔥] A survey for the autoregressive models in vision.
Size: 7.85 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 507 - Forks: 15

bahjat-kawar/ddrm
[NeurIPS 2022] Denoising Diffusion Restoration Models -- Official Code Repository
Language: Python - Size: 3.14 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 506 - Forks: 51

dromara/Omega-AI
Omega-AI:基于java打造的深度学习框架,帮助你快速搭建神经网络,实现模型推理与训练,引擎支持自动求导,多线程与GPU运算,GPU支持CUDA,CUDNN。
Language: Java - Size: 256 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 491 - Forks: 59

csslc/CCSR
Official codes of CCSRv2 and CCSRv1: Improving the Stability and Efficiency of Diffusion Models for Content Consistent Super-Resolution
Language: Python - Size: 51.6 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 472 - Forks: 38

yuanchenyang/smalldiffusion
Simple and readable code for training and sampling from diffusion models
Language: Python - Size: 1.05 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 470 - Forks: 34

prs-eth/RollingDepth
[CVPR 2025] Video Depth without Video Models
Language: Python - Size: 6.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 464 - Forks: 17

afiaka87/clip-guided-diffusion
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
Language: Python - Size: 51.2 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 460 - Forks: 62

Auto1111SDK/Auto1111SDK
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
Language: Python - Size: 10.6 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 403 - Forks: 28

HaozheLiu-ST/T-GATE
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
Language: Python - Size: 54.8 MB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 393 - Forks: 26

naver-ai/Visual-Style-Prompting
Official Pytorch implementation of "Visual Style Prompting with Swapping Self-Attention"
Language: Python - Size: 36.8 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 384 - Forks: 30

yeungchenwa/FontDiffuser
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
Language: Python - Size: 16.4 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 360 - Forks: 34

sanweiliti/RoHM
The official PyTorch code for RoHM: Robust Human Motion Reconstruction via Diffusion.
Language: Python - Size: 395 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 351 - Forks: 19

scenediffuser/Scene-Diffuser
Official implementation of CVPR23 paper "Diffusion-based Generation, Optimization, and Planning in 3D Scenes"
Language: Python - Size: 1.77 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 351 - Forks: 24

huggingface/open-muse
Open reproduction of MUSE for fast text2image generation.
Language: Python - Size: 4.33 MB - Last synced at: 8 days ago - Pushed at: 11 months ago - Stars: 348 - Forks: 29

jychoi118/ilvr_adm
ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models (ICCV 2021 Oral)
Language: Python - Size: 313 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 344 - Forks: 46

kxhit/EscherNet
[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis
Language: Python - Size: 37.9 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 335 - Forks: 19

keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language: Python - Size: 121 MB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 331 - Forks: 45

VisualComputingInstitute/diffusion-e2e-ft
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think. Accepted to WACV 2025 and NeurIPS AFM Workshop.
Language: Python - Size: 8.82 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 330 - Forks: 4

jabir-zheng/TCD
Official Repository of the paper "Trajectory Consistency Distillation"
Language: Python - Size: 100 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 327 - Forks: 13

AILab-CVC/FreeNoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
Language: Python - Size: 81.6 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 327 - Forks: 24

woctezuma/stable-diffusion-colab
Colab notebook for Stable Diffusion Hyper-SDXL.
Language: Jupyter Notebook - Size: 53.7 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 324 - Forks: 83

p1atdev/LECO
Low-rank adaptation for Erasing COncepts from diffusion models.
Language: Jupyter Notebook - Size: 3.82 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 322 - Forks: 27

lunarring/latentblending
Create butter-smooth transitions between prompts, powered by stable diffusion
Language: Python - Size: 8.71 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 312 - Forks: 24

diffusion-classifier/diffusion-classifier
Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training
Language: Python - Size: 736 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 308 - Forks: 16

kwonminki/One-sentence_Diffusion_summary
The repo for studying and sharing diffusion models.
Size: 374 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 304 - Forks: 25

nianticlabs/diffusionerf
[CVPR 2023] DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models
Language: Python - Size: 2.43 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 303 - Forks: 17

ximinng/SVGDreamer
[CVPR 2024] Official implementation for "SVGDreamer: Text Guided SVG Generation with Diffusion Model" https://arxiv.org/abs/2312.16476
Language: Python - Size: 34.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 301 - Forks: 29

qitianwu/DIFFormer
The official implementation for ICLR23 spotlight paper "DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion"
Language: Python - Size: 125 KB - Last synced at: 25 days ago - Pushed at: about 1 month ago - Stars: 295 - Forks: 32

RehgLab/RAVE
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]
Language: Python - Size: 157 MB - Last synced at: 24 days ago - Pushed at: 2 months ago - Stars: 291 - Forks: 20

zibojia/COCOCO
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.
Language: Python - Size: 45.7 MB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 291 - Forks: 8

DWCTOD/ECCV2022-Papers-with-Code-Demo
收集 ECCV 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!
Size: 170 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 286 - Forks: 23

RQ-Wu/LAMP
[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation
Language: Python - Size: 99.3 MB - Last synced at: 24 days ago - Pushed at: 12 months ago - Stars: 278 - Forks: 14

JoaoLages/diffusers-interpret
Diffusers-Interpret 🤗🧨🕵️♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.
Language: Jupyter Notebook - Size: 77.5 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 275 - Forks: 14

baaivision/DIVA
[ICLR 2025] Diffusion Feedback Helps CLIP See Better
Language: Python - Size: 2.47 MB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 273 - Forks: 14

ZichengDuan/TheChosenOne
Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"
Language: Python - Size: 6.7 MB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 265 - Forks: 24

pansanity666/Awesome-Avatars
List of recent advances for human avatars, including generation, reconstruction, and editing, etc.
Size: 71.3 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 259 - Forks: 16

mihirp1998/VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
Language: Python - Size: 164 MB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 253 - Forks: 15

zhanghm1995/Forge_VFM4AD
A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.
Size: 34.8 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 243 - Forks: 10

Sirui-Xu/InterDiff
[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"
Language: Python - Size: 112 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 234 - Forks: 9

keonlee9420/DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
Language: Python - Size: 133 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 233 - Forks: 30

LeCAR-Lab/model-based-diffusion
Official implementation for the paper "Model-based Diffusion for Trajectory Optimization". Model-based diffusion (MBD) is a novel diffusion-based trajectory optimization framework that employs a dynamics model to run the reverse denoising process to generate high-quality trajectories.
Language: Python - Size: 36.9 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 230 - Forks: 16

byliutao/1Prompt1Story
🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
Language: Python - Size: 29.9 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 229 - Forks: 28
