An open API service providing repository metadata for many open source software ecosystems.

Topic: "text-to-image"

lucidrains/DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Language: Python - Size: 3.76 MB - Last synced at: 6 months ago - Pushed at: 12 months ago - Stars: 11,112 - Forks: 1,088

lucidrains/imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Language: Python - Size: 1.07 MB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 8,228 - Forks: 782

XavierXiao/Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language: Jupyter Notebook - Size: 5.71 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 7,693 - Forks: 801

lucidrains/DALLE-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Language: Python - Size: 13.5 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 5,567 - Forks: 643

lucidrains/deep-daze

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun

Language: Python - Size: 6.68 MB - Last synced at: 12 days ago - Pushed at: about 3 years ago - Stars: 4,365 - Forks: 319

promptslab/Awesome-Prompt-Engineering

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

Language: Python - Size: 187 KB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 4,356 - Forks: 409

kuprel/min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Language: Python - Size: 46.5 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 3,489 - Forks: 253

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

Diffusion model papers, survey, and taxonomy

Size: 272 KB - Last synced at: 13 days ago - Pushed at: about 2 months ago - Stars: 3,157 - Forks: 263

jamez-bondos/awesome-gpt4o-images

Awesome curated collection of GPT-4o images & prompts. Explore diverse AI-generated art styles (Ghibli, 3D, etc.) from OpenAI's latest model.

Size: 19 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3,107 - Forks: 262

ai-forever/Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language: Jupyter Notebook - Size: 37.3 MB - Last synced at: 9 days ago - Pushed at: 12 months ago - Stars: 2,789 - Forks: 312

filipecalegario/awesome-generative-ai

A curated list of Generative AI tools, works, models, and references

Size: 1.56 MB - Last synced at: 6 days ago - Pushed at: 14 days ago - Stars: 2,778 - Forks: 460

saharmor/dalle-playground

A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini)

Language: JavaScript - Size: 3.01 MB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 2,767 - Forks: 598

nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

Language: Python - Size: 31.7 MB - Last synced at: 9 days ago - Pushed at: over 2 years ago - Stars: 2,649 - Forks: 432

lucidrains/big-sleep

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun

Language: Python - Size: 6.89 MB - Last synced at: 12 days ago - Pushed at: about 3 years ago - Stars: 2,572 - Forks: 303

FurkanGozukara/Stable-Diffusion

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, Kaggle, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya, Midjourney, RunPod

Language: Jupyter Notebook - Size: 3.36 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 2,396 - Forks: 324

Yutong-Zhou-cv/Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

Size: 69.2 MB - Last synced at: about 2 hours ago - Pushed at: about 3 hours ago - Stars: 2,325 - Forks: 200

SamurAIGPT/AI-Youtube-Shorts-Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

Language: Python - Size: 99 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 2,081 - Forks: 284

carefree0910/carefree-creator

AI magics meet Infinite draw board.

Language: Jupyter Notebook - Size: 8.06 MB - Last synced at: 10 days ago - Pushed at: 12 months ago - Stars: 1,948 - Forks: 180

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Language: Python - Size: 13.5 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1,877 - Forks: 133

YangLing0818/RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Language: Jupyter Notebook - Size: 64.2 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 1,793 - Forks: 102

THUDM/CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

Language: Python - Size: 12.4 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 1,778 - Forks: 178

omerbt/TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Language: Python - Size: 27.4 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 1,649 - Forks: 139

ai-forever/ru-dalle

Generate images from texts. In Russian

Language: Jupyter Notebook - Size: 26.9 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 1,647 - Forks: 245

TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Language: Python - Size: 37 MB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 1,548 - Forks: 127

fofr/cog-face-to-many

Turn any face into a video game character, pixel art, claymation, 3D or toy

Language: Python - Size: 32.2 KB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 1,327 - Forks: 205

lukasHoel/text2room

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

Language: Python - Size: 8.85 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,051 - Forks: 73

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

Language: Python - Size: 4.45 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 1,027 - Forks: 82

PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models

A collection of resources on controllable generation with text-to-image diffusion models.

Size: 3.04 MB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 1,027 - Forks: 27

omerbt/MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Language: Jupyter Notebook - Size: 6.93 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 1,027 - Forks: 59

ddPn08/Radiata

Stable diffusion webui based on diffusers.

Language: Python - Size: 15.6 MB - Last synced at: about 13 hours ago - Pushed at: over 1 year ago - Stars: 981 - Forks: 69

THUDM/CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Language: Python - Size: 24.1 MB - Last synced at: 13 days ago - Pushed at: 25 days ago - Stars: 975 - Forks: 71

FoundationVision/Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Language: Python - Size: 9.83 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 957 - Forks: 40

THUDM/CogView2

official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"

Language: Python - Size: 148 KB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 951 - Forks: 77

lucidrains/muse-maskgit-pytorch

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Language: Python - Size: 285 KB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 893 - Forks: 84

Shilin-LU/TF-ICON

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)

Language: Python - Size: 75.1 MB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 813 - Forks: 104

haofanwang/Lora-for-Diffusers

The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥

Language: Python - Size: 97.7 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 801 - Forks: 53

eps696/aphantasia

CLIP + FFT/DWT/RGB = text to image/video

Language: Python - Size: 35.2 MB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 786 - Forks: 103

mfrashad/text2art

AI-powered Text-to-Art Generator - Text2Art.com

Language: Jupyter Notebook - Size: 31.5 MB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 785 - Forks: 206

fboulnois/stable-diffusion-docker

Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.

Language: Python - Size: 666 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 745 - Forks: 132

vicgalle/stable-diffusion-aesthetic-gradients

Personalization for Stable Diffusion via Aesthetic Gradients 🎨

Language: Jupyter Notebook - Size: 92.5 MB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 730 - Forks: 62

yuval-alaluf/Attend-and-Excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

Language: Jupyter Notebook - Size: 103 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 729 - Forks: 62

jianzhnie/awesome-text-to-video

A Survey on Text-to-Video Generation/Synthesis.

Size: 41 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 705 - Forks: 89

SkyWorkAIGC/SkyPaint-AI-Diffusion

基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.

Size: 7.74 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 630 - Forks: 37

fofr/cog-face-to-sticker

face-to-sticker

Language: Python - Size: 72.3 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 624 - Forks: 63

PaddlePaddle/PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

Language: Python - Size: 163 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 621 - Forks: 204

ChenWu98/cycle-diffusion

[ICCV 2023] A latent space for stochastic diffusion models

Language: Python - Size: 52.5 MB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 617 - Forks: 36

zsdonghao/text-to-image

Generative Adversarial Text to Image Synthesis / Please Star -->

Language: Python - Size: 755 KB - Last synced at: 19 days ago - Pushed at: about 4 years ago - Stars: 602 - Forks: 161

omriav/blended-latent-diffusion

Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]

Language: Jupyter Notebook - Size: 9.84 MB - Last synced at: 26 days ago - Pushed at: 11 months ago - Stars: 594 - Forks: 37

limuloo/MIGC

[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)

Language: Python - Size: 33.4 MB - Last synced at: 26 days ago - Pushed at: 2 months ago - Stars: 588 - Forks: 28

omriav/blended-diffusion

Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]

Language: Jupyter Notebook - Size: 42.4 MB - Last synced at: 26 days ago - Pushed at: 11 months ago - Stars: 574 - Forks: 43

gojasper/flash-diffusion

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)

Language: Python - Size: 50.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 566 - Forks: 40

ironjr/semantic-draw

Official code for the CVPR 2025 paper "SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models."

Language: Jupyter Notebook - Size: 303 MB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 552 - Forks: 48

akanimax/T2F

T2F: text to face generation using Deep Learning

Language: Python - Size: 498 MB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 548 - Forks: 100

AlonzoLeeeooo/awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

Language: TeX - Size: 2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 539 - Forks: 29

lucidrains/parti-pytorch

Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch

Language: Python - Size: 339 KB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 530 - Forks: 24

google/break-a-scene

Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]

Language: Python - Size: 13.1 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 520 - Forks: 25

jaketae/storyteller

Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech

Language: Python - Size: 4.59 MB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 519 - Forks: 63

AlaaLab/InstructCV

[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"

Language: Python - Size: 76.1 MB - Last synced at: 8 months ago - Pushed at: 12 months ago - Stars: 516 - Forks: 46

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

Size: 7.85 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 507 - Forks: 15

SamurAIGPT/Text-To-Video-AI

Generate video from text using AI

Language: Jupyter Notebook - Size: 15.8 MB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 478 - Forks: 170

TonyLianLong/LLM-groundedDiffusion

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)

Language: Python - Size: 268 KB - Last synced at: 26 days ago - Pushed at: 8 months ago - Stars: 465 - Forks: 33

afiaka87/clip-guided-diffusion

A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.

Language: Python - Size: 51.2 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 460 - Forks: 62

atfortes/Awesome-Controllable-Diffusion

Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.

Size: 37.6 MB - Last synced at: 1 day ago - Pushed at: 14 days ago - Stars: 458 - Forks: 28

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

Language: HTML - Size: 12.7 MB - Last synced at: 9 days ago - Pushed at: 19 days ago - Stars: 455 - Forks: 26

EleutherAI/DALLE-mtf

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Language: Python - Size: 272 KB - Last synced at: 18 days ago - Pushed at: about 3 years ago - Stars: 433 - Forks: 46

aelnouby/Text-to-Image-Synthesis

Pytorch implementation of Generative Adversarial Text-to-Image Synthesis paper

Language: Python - Size: 454 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 410 - Forks: 90

Auto1111SDK/Auto1111SDK

An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models

Language: Python - Size: 10.6 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 403 - Forks: 28

Shilin-LU/MACE

[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)

Language: Jupyter Notebook - Size: 28.1 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 392 - Forks: 32

nerdyrodent/CLIP-Guided-Diffusion

Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

Language: Python - Size: 2.09 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 386 - Forks: 49

Capsize-Games/airunner

A privacy focused, local-first, multi-modal inference engine and agent platform for running LLMs, image generation, speech processing, and tool-based automation

Language: Python - Size: 21.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 378 - Forks: 35

open-mmlab/StyleShot

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!

Language: Python - Size: 97.1 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 372 - Forks: 23

garibida/cross-image-attention

Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"

Language: Python - Size: 64.6 MB - Last synced at: 26 days ago - Pushed at: 12 months ago - Stars: 359 - Forks: 27

jonathandinu/ai4artists

A list of AI Art courses, tools, libraries, people, and places.

Size: 661 KB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 355 - Forks: 25

FoundationVision/Liquid

Liquid: Language Models are Scalable and Unified Multi-modal Generators

Language: Python - Size: 31.6 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 353 - Forks: 24

OSU-NLP-Group/MagicBrush

[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

Language: Python - Size: 112 MB - Last synced at: 14 days ago - Pushed at: 2 months ago - Stars: 345 - Forks: 15

sayakpaul/diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

Language: Python - Size: 183 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 340 - Forks: 11

awekrx/ChatGPT-MidJourney-prompt

This is a ChatGPT based prompt generation model for MidJorney. The purpose of this model is to simplify the creation of images and increase their creativity. By introducing a partial hint, ChatGPT creates a follow-up that can be used to stimulate creativity and provide new ideas.

Language: Python - Size: 11.7 MB - Last synced at: 11 days ago - Pushed at: about 2 years ago - Stars: 336 - Forks: 51

jabir-zheng/TCD

Official Repository of the paper "Trajectory Consistency Distillation"

Language: Python - Size: 100 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 327 - Forks: 13

woctezuma/stable-diffusion-colab

Colab notebook for Stable Diffusion Hyper-SDXL.

Language: Jupyter Notebook - Size: 53.7 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 324 - Forks: 83

mkshing/e4t-diffusion

Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models

Language: Python - Size: 4.04 MB - Last synced at: 27 days ago - Pushed at: almost 2 years ago - Stars: 324 - Forks: 24

tobran/DF-GAN

[CVPR2022 oral] A Simple and Effective Baseline for Text-to-Image Synthesis

Language: Python - Size: 3.26 MB - Last synced at: 20 days ago - Pushed at: about 2 years ago - Stars: 314 - Forks: 69

MohamadZeina/Disco_Diffusion_Local

Getting the latest versions of Disco Diffusion to work locally, instead of colab. Including how I run this on Windows, despite some Linux only dependencies ;)

Language: Jupyter Notebook - Size: 2.16 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 312 - Forks: 36

AssemblyAI-Community/MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model

Language: Python - Size: 6.53 MB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 302 - Forks: 57

viiika/Meissonic

[ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Language: Python - Size: 148 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 289 - Forks: 10

rinnakk/japanese-stable-diffusion

Japanese Stable Diffusion is a Japanese specific latent text-to-image diffusion model capable of generating photo-realistic images given any text input.

Language: Jupyter Notebook - Size: 3.24 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 274 - Forks: 15

SamurAIGPT/AI-Faceless-Video-Generator

Generate a video script, voice and a talking face completely with AI

Language: Jupyter Notebook - Size: 16.6 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 270 - Forks: 42

kfirgoldberg/ConceptLab

Official Implementation for "ConceptLab: Creative Generation using Diffusion Prior Constraints"

Language: Python - Size: 137 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 250 - Forks: 18

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

Language: Python - Size: 29.9 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 247 - Forks: 5

Karine-Huang/T2I-CompBench

[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation

Language: Python - Size: 77.4 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 247 - Forks: 11

wooyeolbaek/attention-map-diffusers

🚀 Cross attention map tools for huggingface/diffusers

Language: Python - Size: 7.89 MB - Last synced at: 15 days ago - Pushed at: 3 months ago - Stars: 247 - Forks: 19

tobran/GALIP

[CVPR2023] A faster, smaller, and better text-to-image model for large-scale training

Language: Python - Size: 1.17 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 238 - Forks: 30

lucidrains/perfusion-pytorch

Implementation of Key-Locked Rank One Editing, from Nvidia AI

Language: Python - Size: 3.14 MB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 233 - Forks: 7

finegrain-ai/refiners

A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation

Language: Python - Size: 88 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 230 - Forks: 27

byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

Language: Python - Size: 29.9 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 229 - Forks: 28

KwokKwok/Silo

多模型同时对话、文生图,纯前端。Multi-model simultaneous chat、text-to-image generation, all done through pure front-end (API mode, no server-side needed).

Language: JavaScript - Size: 2.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 226 - Forks: 25

garibida/ReNoise-Inversion

Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"

Language: Python - Size: 8.71 MB - Last synced at: 27 days ago - Pushed at: 10 months ago - Stars: 222 - Forks: 8

VinAIResearch/Anti-DreamBooth

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)

Language: Python - Size: 106 MB - Last synced at: 27 days ago - Pushed at: 4 months ago - Stars: 220 - Forks: 19

yeungchenwa/Recommendations-Diffusion-Text-Image

A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten generation, scene text recognition and scene text detection.

Size: 2.32 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 216 - Forks: 6

xuyang-liu16/Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

Size: 637 KB - Last synced at: about 8 hours ago - Pushed at: about 9 hours ago - Stars: 215 - Forks: 6

lzhbrian/arbitrary-text-to-image-papers

A collection of arbitrary text to image papers with code (constantly updating)

Size: 19.5 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 214 - Forks: 25