An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: latent-diffusion

Text-to-Audio/Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Language: Python - Size: 961 KB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 646 - Forks: 87

Stability-AI/stability-sdk

SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)

Language: Jupyter Notebook - Size: 447 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 2,439 - Forks: 343

invoke-ai/InvokeAI

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

Language: TypeScript - Size: 327 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 24,856 - Forks: 2,527

Sanster/IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Language: Python - Size: 23 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 20,928 - Forks: 2,133

JoePenna/Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 3,217 - Forks: 553

Uminosachi/sd-webui-inpaint-anything

Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

Language: Python - Size: 3.44 MB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 1,231 - Forks: 110

lucidrains/naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language: Python - Size: 512 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 1,316 - Forks: 104

7abushahla/Arabic-One-DM

Code and resources for the paper: “One Stroke, One Shot: Diffusing a New Era in Arabic Handwriting Generation"

Language: Jupyter Notebook - Size: 86.2 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

ml-jku/LaM-SLidE

Code for the paper LaM-SLidE - Latent Space Modeling of Spatial Dynamical Systems via Linked Entities

Language: Python - Size: 27.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 6 - Forks: 0

parlance-zz/dualdiffusion

Fourier Dual Diffusion

Language: Python - Size: 5.64 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 49 - Forks: 1

leejet/stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

Language: C++ - Size: 21.3 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 3,997 - Forks: 363

gmongaras/Latent_Diffusion_Model_Imagenet2012

A latent flow-based diffusion model trained on the 2012 ImageNet dataset from scratch.

Language: Python - Size: 16.4 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 1

atfortes/Awesome-Controllable-Diffusion

Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.

Size: 37.6 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 452 - Forks: 28

yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language: Python - Size: 131 MB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 5,620 - Forks: 529

jina-ai/discoart

🪩 Create Disco Diffusion artworks in one line

Language: Python - Size: 29 MB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 3,843 - Forks: 248

jonasricker/aeroblade

[CVPR2024] AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error

Language: Python - Size: 7.24 MB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 50 - Forks: 9

symisc/tiny-dream

Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation

Language: C - Size: 134 KB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 260 - Forks: 11

carefree0910/carefree-creator

AI magics meet Infinite draw board.

Language: Jupyter Notebook - Size: 8.06 MB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 1,948 - Forks: 180

ashleykleynhans/comfyui-docker

Docker image for ComfyUI: The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Language: Shell - Size: 81.1 KB - Last synced at: 5 days ago - Pushed at: 20 days ago - Stars: 7 - Forks: 6

Qewertyy/LexicaAPI

Python Wrapper for LexicaAPI https://api.qewertyy.dev/docs , for Queries/Support reach out at https://telegram.me/LexicaAPI

Language: Python - Size: 2.94 MB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 16 - Forks: 5

JiauZhang/binary-latent-diffusion

Implementation of Binary Latent Diffusion

Language: Python - Size: 799 KB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 51 - Forks: 3

IceClear/LDM-SRtuning

Train latent diffusion for real-world super-resolution.

Language: Python - Size: 2.69 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 70 - Forks: 4

segments-ai/latent-diffusion-segmentation

A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]

Language: Python - Size: 4.13 MB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 76 - Forks: 5

nihaomiao/CVPR23_LFDM

The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"

Language: Python - Size: 26.4 MB - Last synced at: 23 days ago - Pushed at: 10 months ago - Stars: 460 - Forks: 42

ashleykleynhans/invokeai-docker

Docker image for InvokeAI: Professional Creative AI Tools for Visual Media

Language: Shell - Size: 64.5 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

muhammad-fiaz/MaaagicUI

Maaagic UI is an open-source UI framework designed to empower developers with seamless integration and advanced features of AI applications.

Language: TypeScript - Size: 13.7 MB - Last synced at: 19 days ago - Pushed at: 10 months ago - Stars: 11 - Forks: 1

parlance-zz/g-diffuser-bot

Discord bot and Interface for Stable Diffusion

Language: Python - Size: 8.61 MB - Last synced at: 12 days ago - Pushed at: about 2 years ago - Stars: 279 - Forks: 21

olaviinha/NeuralImageSuperResolution

Colabs for Neural Image Enhancement.

Language: Jupyter Notebook - Size: 65.4 KB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 97 - Forks: 17

magnusviri/InvokeAI Fork of invoke-ai/InvokeAI

About Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

Language: TypeScript - Size: 259 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 346 - Forks: 35

mlpc-ucsd/TokenCompose

(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision

Language: Jupyter Notebook - Size: 209 MB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 120 - Forks: 4

afondiel/how-diffusion-models-work-crash-course-DLAI

Diffusion Models crash course with Pytorch from DeepLearningAI

Language: Jupyter Notebook - Size: 6.93 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

navervision/CompoDiff

Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)

Language: Python - Size: 5.06 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 83 - Forks: 3

RichardObi/ccnet

Official repository of "Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models"

Language: Python - Size: 30.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

rmgogogo/nano-aigc

Generative models nano version for fun. No STOA here, nano first.

Language: Jupyter Notebook - Size: 2.1 MB - Last synced at: 1 day ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

jaygshah/LDM-RR

Enhancing Amyloid PET Quantification: MRI-Guided Super-Resolution Using Latent Diffusion Models

Language: Python - Size: 212 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

cobanov/awesome-diffusion

A curated list of awesome Diffusion notebooks, tools, software, tutorials and resources.

Size: 17.6 KB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 210 - Forks: 14

WASasquatch/easydiffusion

Easy Diffusion is an advanced Stable Diffusion Notebook with a feature rich image processing suite.

Language: Jupyter Notebook - Size: 2.53 MB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 102 - Forks: 17

haiciyang/LaDiffCodec

Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.

Language: Python - Size: 22.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 48 - Forks: 3

AMfeta99/Generative_AI

This repository is a comprehensive resource for mastering generative AI, featuring in-depth notes and exciting projects. The goal is to stay updated with the latest advancements in generative AI, and explore applications in image & video generation, creative content creation. Explore the limitless possibilities of generative AI today!

Language: Jupyter Notebook - Size: 62 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

dailenson/One-DM

Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation

Language: Python - Size: 4.85 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 322 - Forks: 31

brian14708/artspace

🔮 AI Art Generator App

Language: Rust - Size: 2.79 MB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 0

AndranikSargsyan/Edufusion

DIY Stable Diffusion 1.5 implementation with minimal dependencies.

Language: Python - Size: 606 KB - Last synced at: 22 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

haideraqeeb/latent-diffusion

Text to Image Generator using Latent Diffusion Model

Language: Jupyter Notebook - Size: 839 KB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

explainingai-code/StableDiffusion-PyTorch

This repo implements a Stable Diffusion model in PyTorch with all the essential components.

Language: Python - Size: 71.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 143 - Forks: 31

serp-ai/ai-text-to-audio-latent-diffusion Fork of Harmonai-org/sample-generator

text-to-audio-latent-diffusion

Language: Python - Size: 58.2 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 8

kpthedev/stable-karlo

Upscaling Karlo text-to-image generation using Stable Diffusion v2.

Language: Python - Size: 76.2 KB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 60 - Forks: 6

mikonvergence/DiffusionFastForward

DiffusionFastForward: a free course and experimental framework for diffusion-based generative models

Language: Jupyter Notebook - Size: 4.09 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 601 - Forks: 54

JieChungChen/Diffusion-Model-Learning-Record

中文記錄Diffusion相關模型的知識,並用自己的習慣重新整理各種奇怪版本、可讀性糟糕的code

Language: Python - Size: 22.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Qewertyy/SDWaifuRobot

An AI Related Telegram Utility Bot, can be deployed on vercel

Language: Python - Size: 85.9 KB - Last synced at: 18 days ago - Pushed at: 5 months ago - Stars: 14 - Forks: 21

BarqueroGerman/BeLFusion

[ICCV2023] Official PyTorch Implementation of "BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction". ICCV 2023

Language: Python - Size: 7.97 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 113 - Forks: 8

AbhinavSharma07/Streamlit

Stable Diffusion

Language: Python - Size: 264 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Uminosachi/inpaint-anything

Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

Language: Python - Size: 4.29 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 224 - Forks: 28

kabachuha/InfiNet

Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2video model for extremely long video generation.

Language: Python - Size: 2.12 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 86 - Forks: 7

apapiu/transformer_latent_diffusion

Text to Image Latent Diffusion using a Transformer core

Language: Python - Size: 3.84 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 121 - Forks: 12

kiranchhatre/amuse

[CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion

Language: Python - Size: 78.8 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 73 - Forks: 3

steve-zeyu-zhang/MotionMamba

🔥 [ECCV 2024] Motion Mamba: Efficient and Long Sequence Motion Generation

Language: JavaScript - Size: 43 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 91 - Forks: 2

lopho/sd-video

Text to Video

Language: Python - Size: 2.31 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 26 - Forks: 3

tasinislam21/FashionFlow

This model synthesises high-fidelity fashion videos from single images featuring spontaneous and believable movements.

Language: Python - Size: 6.7 MB - Last synced at: 9 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

rensortino/material-crafter

Generate SVBRDF materials using Latent Diffusion Models, without leaving Blender.

Language: Python - Size: 21.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

NITHISHM2410/latent-diffusion-tf

LDM using tensorflow.

Language: Python - Size: 41.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

WiNE-iNEFF/Simple_Prompt_Generator

Simple prompt generator for Midjourney, DALLe, Stable and Disco Diffusion, and etc.

Language: Python - Size: 6.2 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 135 - Forks: 18

360CVGroup/Bridge_Diffusion_Model

Latent diffusion method for non-English language native Text-to-Image generation

Language: Python - Size: 6.63 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

koninik/WordStylist

Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023

Language: Python - Size: 1.26 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 51 - Forks: 7

Shen-Lab/LDM-3DG

[ICLR 2024] "Latent 3D Graph Diffusion" by Yuning You, Ruida Zhou, Jiwoong Park, Haotian Xu, Chao Tian, Zhangyang Wang, Yang Shen

Language: Python - Size: 21.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 23 - Forks: 6

ai-forever/KandinskyVideo

KandinskyVideo — multilingual end-to-end text2video latent diffusion model

Language: Python - Size: 184 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 142 - Forks: 18

mattroz/SDFT

Stable Diffusion Fine-Tuning techniques overview.

Language: Python - Size: 198 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

GrandpaXun242/AdaBLDM

The implement for paper : "A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation"

Language: Python - Size: 551 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 26 - Forks: 0

edvirs/artifex

Latent Diffusion Generation Model

Language: Python - Size: 6.84 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

teticio/audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Language: Jupyter Notebook - Size: 35.8 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 646 - Forks: 66

Corruptex/booru-dataset-gatherer

A .NET Core 3.1 Console application to gather tags and relevant information from Booru websites for Machine Learning.

Language: C# - Size: 59.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

Sid-047/VirtualME

Tryna Create a Digital Version of Me via StableDiffusion - LoRA

Language: Python - Size: 31.8 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

joanrod/figure-diffusion

Generating figures from research papers, using textual captions from the paper.

Language: Python - Size: 35.1 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

SkyWorkAIGC/SkyPaint-AI-Diffusion

基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.

Size: 7.74 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 630 - Forks: 37

shivamMg/stable-diffusion-on-azureml

REST APIs for StableDiffusion. Inferencing support on AzureML

Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: 21 days ago - Pushed at: almost 2 years ago - Stars: 11 - Forks: 4

guangjieguo/latent-diffusion 📦

This is for an assignment for my Deep Learning class. What I did is doing latent diffusion from scratch. smallNORB Dataset is used to train the neural networks.

Language: Python - Size: 115 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

pdoane/seed-alchemy

Frontend UI and Backend Server for Stable Diffusion models

Language: TypeScript - Size: 2.97 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 3

quickgrid/vq-compress

Image compression with pretrained latent diffusion autoencoding models.

Language: Python - Size: 97.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Hleephilip/MLVU-project

Modality Translation through Conditional Encoder-Decoder (2023-1 Machine Learning for Visual Understanding Team project)

Language: Python - Size: 1.64 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

artem-gorodetskii/WikiArt-Latent-Diffusion

Conditional denoising diffusion probabilistic model trained in latent space.

Language: Jupyter Notebook - Size: 153 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

Logeswaran123/Stable-Diffusion-Playground

An application that generates images or videos using Stable Diffusion models.

Language: Python - Size: 3.18 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 1

kpthedev/stable-karlo-colab

Port of stable-karlo for Google Colab. Upscaling Karlo text-to-image generation using Stable Diffusion v2.

Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

bharathraj-v/art_project

Mozart - A Generative Art Platform

Language: Jupyter Notebook - Size: 15.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 2

AtharvaTaras/Stable-Diffusion-Library

A library of images generated using Stable Diffusion

Size: 165 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0