GitHub topics: latent-diffusion
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Language: Python - Size: 961 KB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 646 - Forks: 87

Stability-AI/stability-sdk
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Language: Jupyter Notebook - Size: 447 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 2,439 - Forks: 343

invoke-ai/InvokeAI
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
Language: TypeScript - Size: 327 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 24,856 - Forks: 2,527

Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Language: Python - Size: 23 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 20,928 - Forks: 2,133

JoePenna/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.
Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 3,217 - Forks: 553

Uminosachi/sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Language: Python - Size: 3.44 MB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 1,231 - Forks: 110

lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language: Python - Size: 512 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 1,316 - Forks: 104

7abushahla/Arabic-One-DM
Code and resources for the paper: “One Stroke, One Shot: Diffusing a New Era in Arabic Handwriting Generation"
Language: Jupyter Notebook - Size: 86.2 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

ml-jku/LaM-SLidE
Code for the paper LaM-SLidE - Latent Space Modeling of Spatial Dynamical Systems via Linked Entities
Language: Python - Size: 27.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 6 - Forks: 0

parlance-zz/dualdiffusion
Fourier Dual Diffusion
Language: Python - Size: 5.64 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 49 - Forks: 1

leejet/stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
Language: C++ - Size: 21.3 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 3,997 - Forks: 363

gmongaras/Latent_Diffusion_Model_Imagenet2012
A latent flow-based diffusion model trained on the 2012 ImageNet dataset from scratch.
Language: Python - Size: 16.4 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 1

atfortes/Awesome-Controllable-Diffusion
Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.
Size: 37.6 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 452 - Forks: 28

yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language: Python - Size: 131 MB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 5,620 - Forks: 529

jina-ai/discoart
🪩 Create Disco Diffusion artworks in one line
Language: Python - Size: 29 MB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 3,843 - Forks: 248

jonasricker/aeroblade
[CVPR2024] AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error
Language: Python - Size: 7.24 MB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 50 - Forks: 9

symisc/tiny-dream
Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation
Language: C - Size: 134 KB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 260 - Forks: 11

carefree0910/carefree-creator
AI magics meet Infinite draw board.
Language: Jupyter Notebook - Size: 8.06 MB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 1,948 - Forks: 180

ashleykleynhans/comfyui-docker
Docker image for ComfyUI: The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
Language: Shell - Size: 81.1 KB - Last synced at: 5 days ago - Pushed at: 20 days ago - Stars: 7 - Forks: 6

Qewertyy/LexicaAPI
Python Wrapper for LexicaAPI https://api.qewertyy.dev/docs , for Queries/Support reach out at https://telegram.me/LexicaAPI
Language: Python - Size: 2.94 MB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 16 - Forks: 5

JiauZhang/binary-latent-diffusion
Implementation of Binary Latent Diffusion
Language: Python - Size: 799 KB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 51 - Forks: 3

IceClear/LDM-SRtuning
Train latent diffusion for real-world super-resolution.
Language: Python - Size: 2.69 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 70 - Forks: 4

segments-ai/latent-diffusion-segmentation
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]
Language: Python - Size: 4.13 MB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 76 - Forks: 5

nihaomiao/CVPR23_LFDM
The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"
Language: Python - Size: 26.4 MB - Last synced at: 23 days ago - Pushed at: 10 months ago - Stars: 460 - Forks: 42

ashleykleynhans/invokeai-docker
Docker image for InvokeAI: Professional Creative AI Tools for Visual Media
Language: Shell - Size: 64.5 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

muhammad-fiaz/MaaagicUI
Maaagic UI is an open-source UI framework designed to empower developers with seamless integration and advanced features of AI applications.
Language: TypeScript - Size: 13.7 MB - Last synced at: 19 days ago - Pushed at: 10 months ago - Stars: 11 - Forks: 1

parlance-zz/g-diffuser-bot
Discord bot and Interface for Stable Diffusion
Language: Python - Size: 8.61 MB - Last synced at: 12 days ago - Pushed at: about 2 years ago - Stars: 279 - Forks: 21

olaviinha/NeuralImageSuperResolution
Colabs for Neural Image Enhancement.
Language: Jupyter Notebook - Size: 65.4 KB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 97 - Forks: 17

magnusviri/InvokeAI Fork of invoke-ai/InvokeAI
About Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
Language: TypeScript - Size: 259 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 346 - Forks: 35

mlpc-ucsd/TokenCompose
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
Language: Jupyter Notebook - Size: 209 MB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 120 - Forks: 4

afondiel/how-diffusion-models-work-crash-course-DLAI
Diffusion Models crash course with Pytorch from DeepLearningAI
Language: Jupyter Notebook - Size: 6.93 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

navervision/CompoDiff
Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)
Language: Python - Size: 5.06 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 83 - Forks: 3

RichardObi/ccnet
Official repository of "Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models"
Language: Python - Size: 30.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

rmgogogo/nano-aigc
Generative models nano version for fun. No STOA here, nano first.
Language: Jupyter Notebook - Size: 2.1 MB - Last synced at: 1 day ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

jaygshah/LDM-RR
Enhancing Amyloid PET Quantification: MRI-Guided Super-Resolution Using Latent Diffusion Models
Language: Python - Size: 212 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

cobanov/awesome-diffusion
A curated list of awesome Diffusion notebooks, tools, software, tutorials and resources.
Size: 17.6 KB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 210 - Forks: 14

WASasquatch/easydiffusion
Easy Diffusion is an advanced Stable Diffusion Notebook with a feature rich image processing suite.
Language: Jupyter Notebook - Size: 2.53 MB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 102 - Forks: 17

haiciyang/LaDiffCodec
Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.
Language: Python - Size: 22.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 48 - Forks: 3

AMfeta99/Generative_AI
This repository is a comprehensive resource for mastering generative AI, featuring in-depth notes and exciting projects. The goal is to stay updated with the latest advancements in generative AI, and explore applications in image & video generation, creative content creation. Explore the limitless possibilities of generative AI today!
Language: Jupyter Notebook - Size: 62 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

dailenson/One-DM
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
Language: Python - Size: 4.85 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 322 - Forks: 31

brian14708/artspace
🔮 AI Art Generator App
Language: Rust - Size: 2.79 MB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 0

AndranikSargsyan/Edufusion
DIY Stable Diffusion 1.5 implementation with minimal dependencies.
Language: Python - Size: 606 KB - Last synced at: 22 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

haideraqeeb/latent-diffusion
Text to Image Generator using Latent Diffusion Model
Language: Jupyter Notebook - Size: 839 KB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

explainingai-code/StableDiffusion-PyTorch
This repo implements a Stable Diffusion model in PyTorch with all the essential components.
Language: Python - Size: 71.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 143 - Forks: 31

serp-ai/ai-text-to-audio-latent-diffusion Fork of Harmonai-org/sample-generator
text-to-audio-latent-diffusion
Language: Python - Size: 58.2 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 8

kpthedev/stable-karlo
Upscaling Karlo text-to-image generation using Stable Diffusion v2.
Language: Python - Size: 76.2 KB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 60 - Forks: 6

mikonvergence/DiffusionFastForward
DiffusionFastForward: a free course and experimental framework for diffusion-based generative models
Language: Jupyter Notebook - Size: 4.09 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 601 - Forks: 54

JieChungChen/Diffusion-Model-Learning-Record
中文記錄Diffusion相關模型的知識,並用自己的習慣重新整理各種奇怪版本、可讀性糟糕的code
Language: Python - Size: 22.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Qewertyy/SDWaifuRobot
An AI Related Telegram Utility Bot, can be deployed on vercel
Language: Python - Size: 85.9 KB - Last synced at: 18 days ago - Pushed at: 5 months ago - Stars: 14 - Forks: 21

BarqueroGerman/BeLFusion
[ICCV2023] Official PyTorch Implementation of "BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction". ICCV 2023
Language: Python - Size: 7.97 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 113 - Forks: 8

AbhinavSharma07/Streamlit
Stable Diffusion
Language: Python - Size: 264 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Uminosachi/inpaint-anything
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Language: Python - Size: 4.29 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 224 - Forks: 28

kabachuha/InfiNet
Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2video model for extremely long video generation.
Language: Python - Size: 2.12 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 86 - Forks: 7

apapiu/transformer_latent_diffusion
Text to Image Latent Diffusion using a Transformer core
Language: Python - Size: 3.84 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 121 - Forks: 12

kiranchhatre/amuse
[CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion
Language: Python - Size: 78.8 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 73 - Forks: 3

steve-zeyu-zhang/MotionMamba
🔥 [ECCV 2024] Motion Mamba: Efficient and Long Sequence Motion Generation
Language: JavaScript - Size: 43 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 91 - Forks: 2

lopho/sd-video
Text to Video
Language: Python - Size: 2.31 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 26 - Forks: 3

tasinislam21/FashionFlow
This model synthesises high-fidelity fashion videos from single images featuring spontaneous and believable movements.
Language: Python - Size: 6.7 MB - Last synced at: 9 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

rensortino/material-crafter
Generate SVBRDF materials using Latent Diffusion Models, without leaving Blender.
Language: Python - Size: 21.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

NITHISHM2410/latent-diffusion-tf
LDM using tensorflow.
Language: Python - Size: 41.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

WiNE-iNEFF/Simple_Prompt_Generator
Simple prompt generator for Midjourney, DALLe, Stable and Disco Diffusion, and etc.
Language: Python - Size: 6.2 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 135 - Forks: 18

360CVGroup/Bridge_Diffusion_Model
Latent diffusion method for non-English language native Text-to-Image generation
Language: Python - Size: 6.63 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

koninik/WordStylist
Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023
Language: Python - Size: 1.26 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 51 - Forks: 7

Shen-Lab/LDM-3DG
[ICLR 2024] "Latent 3D Graph Diffusion" by Yuning You, Ruida Zhou, Jiwoong Park, Haotian Xu, Chao Tian, Zhangyang Wang, Yang Shen
Language: Python - Size: 21.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 23 - Forks: 6

ai-forever/KandinskyVideo
KandinskyVideo — multilingual end-to-end text2video latent diffusion model
Language: Python - Size: 184 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 142 - Forks: 18

mattroz/SDFT
Stable Diffusion Fine-Tuning techniques overview.
Language: Python - Size: 198 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

GrandpaXun242/AdaBLDM
The implement for paper : "A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation"
Language: Python - Size: 551 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 26 - Forks: 0

edvirs/artifex
Latent Diffusion Generation Model
Language: Python - Size: 6.84 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
Language: Jupyter Notebook - Size: 35.8 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 646 - Forks: 66

Corruptex/booru-dataset-gatherer
A .NET Core 3.1 Console application to gather tags and relevant information from Booru websites for Machine Learning.
Language: C# - Size: 59.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

Sid-047/VirtualME
Tryna Create a Digital Version of Me via StableDiffusion - LoRA
Language: Python - Size: 31.8 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

joanrod/figure-diffusion
Generating figures from research papers, using textual captions from the paper.
Language: Python - Size: 35.1 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

SkyWorkAIGC/SkyPaint-AI-Diffusion
基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.
Size: 7.74 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 630 - Forks: 37

shivamMg/stable-diffusion-on-azureml
REST APIs for StableDiffusion. Inferencing support on AzureML
Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: 21 days ago - Pushed at: almost 2 years ago - Stars: 11 - Forks: 4

guangjieguo/latent-diffusion 📦
This is for an assignment for my Deep Learning class. What I did is doing latent diffusion from scratch. smallNORB Dataset is used to train the neural networks.
Language: Python - Size: 115 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

pdoane/seed-alchemy
Frontend UI and Backend Server for Stable Diffusion models
Language: TypeScript - Size: 2.97 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 3

quickgrid/vq-compress
Image compression with pretrained latent diffusion autoencoding models.
Language: Python - Size: 97.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Hleephilip/MLVU-project
Modality Translation through Conditional Encoder-Decoder (2023-1 Machine Learning for Visual Understanding Team project)
Language: Python - Size: 1.64 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

artem-gorodetskii/WikiArt-Latent-Diffusion
Conditional denoising diffusion probabilistic model trained in latent space.
Language: Jupyter Notebook - Size: 153 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

Logeswaran123/Stable-Diffusion-Playground
An application that generates images or videos using Stable Diffusion models.
Language: Python - Size: 3.18 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 1

kpthedev/stable-karlo-colab
Port of stable-karlo for Google Colab. Upscaling Karlo text-to-image generation using Stable Diffusion v2.
Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

bharathraj-v/art_project
Mozart - A Generative Art Platform
Language: Jupyter Notebook - Size: 15.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 2

AtharvaTaras/Stable-Diffusion-Library
A library of images generated using Stable Diffusion
Size: 165 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
