An open API service providing repository metadata for many open source software ecosystems.

Topic: "diffusion"

AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion web UI

Language: Python - Size: 34.7 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 151,240 - Forks: 28,144

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Language: Python - Size: 61.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 28,572 - Forks: 5,856

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language: Python - Size: 15.9 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 18,115 - Forks: 1,819

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Language: Jupyter Notebook - Size: 294 MB - Last synced at: 13 days ago - Pushed at: 18 days ago - Stars: 14,905 - Forks: 3,005

easydiffusion/easydiffusion

An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.

Language: JavaScript - Size: 57.2 MB - Last synced at: 11 days ago - Pushed at: 19 days ago - Stars: 9,878 - Forks: 815

cloneofsimo/lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language: Jupyter Notebook - Size: 177 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 7,304 - Forks: 491

open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language: Jupyter Notebook - Size: 31.3 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 7,131 - Forks: 1,079

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Language: Python - Size: 248 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4,003 - Forks: 252

leejet/stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

Language: C++ - Size: 21.3 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 3,997 - Forks: 363

VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Language: Jupyter Notebook - Size: 417 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 3,912 - Forks: 335

jina-ai/discoart

🪩 Create Disco Diffusion artworks in one line

Language: Python - Size: 29 MB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 3,843 - Forks: 248

riffusion/riffusion-hobby

Stable diffusion for real-time music generation

Language: Python - Size: 8.06 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 3,633 - Forks: 428

williamyang1991/Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Language: Jupyter Notebook - Size: 8.89 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 2,978 - Forks: 200

openvpi/DiffSinger Fork of MoonInTheRiver/DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Language: Python - Size: 66 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,837 - Forks: 297

ai-forever/Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language: Jupyter Notebook - Size: 37.3 MB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 2,789 - Forks: 312

PlayVoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language: Python - Size: 41.3 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 2,755 - Forks: 920

datawhalechina/tiny-universe

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Language: Python - Size: 9.59 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 2,690 - Forks: 283

TMElyralab/MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language: Python - Size: 230 MB - Last synced at: 10 days ago - Pushed at: 10 months ago - Stars: 2,678 - Forks: 283

riffusion/riffusion-app-hobby

Stable diffusion for real-time music generation (web app)

Language: TypeScript - Size: 30.6 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 2,649 - Forks: 203

prs-eth/Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language: Python - Size: 9.4 MB - Last synced at: 10 days ago - Pushed at: 28 days ago - Stars: 2,639 - Forks: 159

Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language: Python - Size: 58 MB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 2,180 - Forks: 91

ChenHsing/Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

Size: 162 MB - Last synced at: 3 days ago - Pushed at: 21 days ago - Stars: 2,067 - Forks: 106

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Language: Python - Size: 13.5 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1,877 - Forks: 133

amirhossein-kz/Awesome-Diffusion-Models-in-Medical-Imaging

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

Size: 628 KB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 1,763 - Forks: 156

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language: Python - Size: 5.35 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 1,708 - Forks: 75

varunshenoy/opendream

An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨

Language: JavaScript - Size: 32.7 MB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 1,670 - Forks: 72

wangkai930418/awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

Size: 617 KB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1,664 - Forks: 74

rupeshs/fastsdcpu

Fast stable diffusion on CPU

Language: Python - Size: 18.1 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1,656 - Forks: 140

Maks-s/sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

Size: 106 MB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 1,640 - Forks: 84

NVIDIA/Cosmos-Tokenizer 📦

A suite of image and video neural tokenizers

Language: Jupyter Notebook - Size: 16.5 MB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 1,614 - Forks: 75

pollinations/pollinations

Free Open-Source Image and Text Generation

Language: JavaScript - Size: 363 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,598 - Forks: 174

awesome-stable-diffusion/awesome-stable-diffusion

Curated list of awesome resources for the Stable Diffusion AI Model.

Size: 414 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 1,552 - Forks: 78

TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Language: Python - Size: 37 MB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 1,548 - Forks: 127

IntelLabs/fastRAG

Efficient Retrieval Augmentation and Generation Framework

Language: Python - Size: 20.4 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 1,513 - Forks: 139

mini-sora/minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language: Python - Size: 67.6 MB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 1,268 - Forks: 151

Uminosachi/sd-webui-inpaint-anything

Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

Language: Python - Size: 3.44 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 1,231 - Forks: 110

tin2tin/Pallaidium

PALLAIDIUM - a generative AI movie studio integrated in the Blender Video Editor.

Language: Python - Size: 33.5 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1,107 - Forks: 90

THU-LYJ-Lab/T3Bench

T3Bench: Benchmarking Current Progress in Text-to-3D Generation

Language: Python - Size: 11.6 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 1,098 - Forks: 10

declare-lab/tango

A family of diffusion models for text-to-audio generation.

Language: Python - Size: 19.5 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 1,086 - Forks: 88

a-r-r-o-w/finetrainers

Memory-optimized training library for diffusion models

Language: Python - Size: 54.1 MB - Last synced at: about 1 hour ago - Pushed at: about 12 hours ago - Stars: 1,060 - Forks: 116

EdVince/Stable-Diffusion-NCNN

Stable Diffusion in NCNN with c++, supported txt2img and img2img

Language: C++ - Size: 151 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 1,033 - Forks: 100

cloneofsimo/minDiffusion

Self-contained, minimalistic implementation of diffusion models with Pytorch.

Language: Python - Size: 3.51 MB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 946 - Forks: 128

sail-sg/Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Language: Python - Size: 1.3 MB - Last synced at: 16 days ago - Pushed at: 10 months ago - Stars: 784 - Forks: 67

AspirinCode/papers-for-molecular-design-using-DL

List of Molecular and Material design using Generative AI and Deep Learning

Size: 5.75 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 776 - Forks: 103

fboulnois/stable-diffusion-docker

Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.

Language: Python - Size: 666 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 745 - Forks: 132

fishaudio/fish-diffusion

An easy to understand TTS / SVS / SVC framework

Language: Python - Size: 61.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 682 - Forks: 90

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Language: Python - Size: 2.45 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 671 - Forks: 54

PKU-YuanGroup/ConsisID

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Language: Python - Size: 13.3 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 667 - Forks: 33

cloneofsimo/paint-with-words-sd

Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.

Language: Jupyter Notebook - Size: 41.3 MB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 642 - Forks: 50

LeCAR-Lab/dial-mpc

Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged robot full-order torque-level control with both precision and agility in a training-free manner.

Language: Python - Size: 268 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 635 - Forks: 71

SkyWorkAIGC/SkyPaint-AI-Diffusion

基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.

Size: 7.74 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 630 - Forks: 37

castorini/daam

Diffusion attentive attribution maps for interpreting Stable Diffusion.

Language: Jupyter Notebook - Size: 2.15 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 601 - Forks: 61

omriav/blended-latent-diffusion

Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]

Language: Jupyter Notebook - Size: 9.84 MB - Last synced at: 24 days ago - Pushed at: 11 months ago - Stars: 594 - Forks: 37

some9000/StylePile

A prompt generation helper script for AUTOMATIC1111/stable-diffusion-webui and compatible forks

Language: Python - Size: 184 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 575 - Forks: 44

omriav/blended-diffusion

Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]

Language: Jupyter Notebook - Size: 42.4 MB - Last synced at: 24 days ago - Pushed at: 11 months ago - Stars: 574 - Forks: 43

kjsman/stable-diffusion-pytorch

Yet another PyTorch implementation of Stable Diffusion (probably easy to read)

Language: Python - Size: 25.4 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 554 - Forks: 62

williamyang1991/FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Language: Jupyter Notebook - Size: 9.94 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 541 - Forks: 51

microsoft/foldingdiff

Diffusion models of protein structure; trigonometry and attention are all you need!

Language: Jupyter Notebook - Size: 228 MB - Last synced at: about 23 hours ago - Pushed at: over 1 year ago - Stars: 539 - Forks: 64

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

Size: 7.85 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 507 - Forks: 15

bahjat-kawar/ddrm

[NeurIPS 2022] Denoising Diffusion Restoration Models -- Official Code Repository

Language: Python - Size: 3.14 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 506 - Forks: 51

dromara/Omega-AI

Omega-AI:基于java打造的深度学习框架,帮助你快速搭建神经网络,实现模型推理与训练,引擎支持自动求导,多线程与GPU运算,GPU支持CUDA,CUDNN。

Language: Java - Size: 256 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 491 - Forks: 59

csslc/CCSR

Official codes of CCSRv2 and CCSRv1: Improving the Stability and Efficiency of Diffusion Models for Content Consistent Super-Resolution

Language: Python - Size: 51.6 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 472 - Forks: 38

yuanchenyang/smalldiffusion

Simple and readable code for training and sampling from diffusion models

Language: Python - Size: 1.05 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 470 - Forks: 34

prs-eth/RollingDepth

[CVPR 2025] Video Depth without Video Models

Language: Python - Size: 6.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 464 - Forks: 17

afiaka87/clip-guided-diffusion

A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.

Language: Python - Size: 51.2 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 460 - Forks: 62

Auto1111SDK/Auto1111SDK

An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models

Language: Python - Size: 10.6 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 403 - Forks: 28

HaozheLiu-ST/T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language: Python - Size: 54.8 MB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 393 - Forks: 26

naver-ai/Visual-Style-Prompting

Official Pytorch implementation of "Visual Style Prompting with Swapping Self-Attention"

Language: Python - Size: 36.8 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 384 - Forks: 30

yeungchenwa/FontDiffuser

[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Language: Python - Size: 16.4 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 360 - Forks: 34

sanweiliti/RoHM

The official PyTorch code for RoHM: Robust Human Motion Reconstruction via Diffusion.

Language: Python - Size: 395 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 351 - Forks: 19

scenediffuser/Scene-Diffuser

Official implementation of CVPR23 paper "Diffusion-based Generation, Optimization, and Planning in 3D Scenes"

Language: Python - Size: 1.77 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 351 - Forks: 24

huggingface/open-muse

Open reproduction of MUSE for fast text2image generation.

Language: Python - Size: 4.33 MB - Last synced at: 8 days ago - Pushed at: 11 months ago - Stars: 348 - Forks: 29

jychoi118/ilvr_adm

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models (ICCV 2021 Oral)

Language: Python - Size: 313 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 344 - Forks: 46

kxhit/EscherNet

[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis

Language: Python - Size: 37.9 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 335 - Forks: 19

keonlee9420/DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Language: Python - Size: 121 MB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 331 - Forks: 45

VisualComputingInstitute/diffusion-e2e-ft

Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think. Accepted to WACV 2025 and NeurIPS AFM Workshop.

Language: Python - Size: 8.82 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 330 - Forks: 4

jabir-zheng/TCD

Official Repository of the paper "Trajectory Consistency Distillation"

Language: Python - Size: 100 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 327 - Forks: 13

AILab-CVC/FreeNoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

Language: Python - Size: 81.6 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 327 - Forks: 24

woctezuma/stable-diffusion-colab

Colab notebook for Stable Diffusion Hyper-SDXL.

Language: Jupyter Notebook - Size: 53.7 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 324 - Forks: 83

p1atdev/LECO

Low-rank adaptation for Erasing COncepts from diffusion models.

Language: Jupyter Notebook - Size: 3.82 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 322 - Forks: 27

lunarring/latentblending

Create butter-smooth transitions between prompts, powered by stable diffusion

Language: Python - Size: 8.71 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 312 - Forks: 24

diffusion-classifier/diffusion-classifier

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

Language: Python - Size: 736 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 308 - Forks: 16

kwonminki/One-sentence_Diffusion_summary

The repo for studying and sharing diffusion models.

Size: 374 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 304 - Forks: 25

nianticlabs/diffusionerf

[CVPR 2023] DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models

Language: Python - Size: 2.43 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 303 - Forks: 17

ximinng/SVGDreamer

[CVPR 2024] Official implementation for "SVGDreamer: Text Guided SVG Generation with Diffusion Model" https://arxiv.org/abs/2312.16476

Language: Python - Size: 34.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 301 - Forks: 29

qitianwu/DIFFormer

The official implementation for ICLR23 spotlight paper "DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion"

Language: Python - Size: 125 KB - Last synced at: 25 days ago - Pushed at: about 1 month ago - Stars: 295 - Forks: 32

RehgLab/RAVE

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]

Language: Python - Size: 157 MB - Last synced at: 24 days ago - Pushed at: 2 months ago - Stars: 291 - Forks: 20

zibojia/COCOCO

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

Language: Python - Size: 45.7 MB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 291 - Forks: 8

DWCTOD/ECCV2022-Papers-with-Code-Demo

收集 ECCV 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!

Size: 170 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 286 - Forks: 23

RQ-Wu/LAMP

[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation

Language: Python - Size: 99.3 MB - Last synced at: 24 days ago - Pushed at: 12 months ago - Stars: 278 - Forks: 14

JoaoLages/diffusers-interpret

Diffusers-Interpret 🤗🧨🕵️‍♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.

Language: Jupyter Notebook - Size: 77.5 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 275 - Forks: 14

baaivision/DIVA

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

Language: Python - Size: 2.47 MB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 273 - Forks: 14

ZichengDuan/TheChosenOne

Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"

Language: Python - Size: 6.7 MB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 265 - Forks: 24

pansanity666/Awesome-Avatars

List of recent advances for human avatars, including generation, reconstruction, and editing, etc.

Size: 71.3 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 259 - Forks: 16

mihirp1998/VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

Language: Python - Size: 164 MB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 253 - Forks: 15

zhanghm1995/Forge_VFM4AD

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

Size: 34.8 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 243 - Forks: 10

Sirui-Xu/InterDiff

[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"

Language: Python - Size: 112 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 234 - Forks: 9

keonlee9420/DiffSinger

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

Language: Python - Size: 133 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 233 - Forks: 30

LeCAR-Lab/model-based-diffusion

Official implementation for the paper "Model-based Diffusion for Trajectory Optimization". Model-based diffusion (MBD) is a novel diffusion-based trajectory optimization framework that employs a dynamics model to run the reverse denoising process to generate high-quality trajectories.

Language: Python - Size: 36.9 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 230 - Forks: 16

byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

Language: Python - Size: 29.9 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 229 - Forks: 28