GitHub topics: text-to-image-generation
etranHOLI/NanoBananaEditor
🍌 Generate stunning images and edit them conversationally with the powerful NanoBananaEditor, built on React and TypeScript using AI technology.
Language: TypeScript - Size: 109 KB - Last synced at: about 5 hours ago - Pushed at: about 7 hours ago - Stars: 0 - Forks: 0

AndreH219/freeflux
Smart and Simple Flux for GPU-poor
Language: Python - Size: 393 KB - Last synced at: about 9 hours ago - Pushed at: about 11 hours ago - Stars: 1 - Forks: 0

youweiliang/RichHF
Code for CVPR'24 best paper: Rich Human Feedback for Text-to-Image Generation (https://arxiv.org/pdf/2312.10240)
Language: Python - Size: 33.2 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 21 - Forks: 1

NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Language: Python - Size: 248 MB - Last synced at: 1 day ago - Pushed at: 17 days ago - Stars: 4,464 - Forks: 293

tuxxxu/vista
VISTA provides an open standard for JSON prompts for AI image generation, ensuring consistency, organization, and interoperability for creating, sharing, and reusing visual scene prompts.
Size: 3.91 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

markfulton/NanoBananaEditor
The most advanced Nano Banana image generator and editor application. Your central hub for AI image generation and revisions. Intuitive UI features reference images, editing with image masks, version history, and more. Powered by Gemini 2.5 Flash images API.
Language: HTML - Size: 86.9 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

ai-action/diffused
🤗 Generate images with diffusion models.
Language: Python - Size: 479 KB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 2 - Forks: 0

AIDC-AI/Awesome-Unified-Multimodal-Models
Awesome Unified Multimodal Models
Size: 14.2 MB - Last synced at: 9 days ago - Pushed at: 21 days ago - Stars: 623 - Forks: 16

Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python - Size: 4.71 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 2,332 - Forks: 215

Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
Size: 369 KB - Last synced at: 8 days ago - Pushed at: 9 months ago - Stars: 428 - Forks: 49

201Harsh/EndVerse-AI
A PowerFul AI-Chat Bot Powerd By EndGaming
Language: JavaScript - Size: 1.43 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 0

muzishen/IMAGDressing
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.
Language: Python - Size: 47.7 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1,277 - Forks: 112

CSU-JPG/TextAtlas
A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation
Language: Python - Size: 3.8 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 74 - Forks: 0

CihangPeng/ROVI
ICCV 2025 | ROVI: A 1M-scale dataset with comprehensive image descriptions and open-vocabulary bounding box annotations for instance-grounded text-to-image generation
Language: Python - Size: 6.03 MB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

moatifbutt/awesome-diffusion-iclr-2025
List of diffusion related active submissions on OpenReview for ICLR 2025.
Size: 521 KB - Last synced at: 13 days ago - Pushed at: 10 months ago - Stars: 37 - Forks: 1

huggingface/diffusion-fast
Faster generation with text-to-image diffusion models.
Language: Python - Size: 113 KB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 225 - Forks: 15

ByteVisionLab/TokenFlow
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
Language: Python - Size: 28 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 366 - Forks: 3

FoundationVision/Infinity
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Language: Python - Size: 10.1 MB - Last synced at: 27 days ago - Pushed at: 2 months ago - Stars: 1,396 - Forks: 74

Anthonyk9999/AI-Picture-Generator
Generate stunning images from text descriptions using our AI Picture Generator. Choose from multiple models for versatile results. 🌟 #GitHub
Language: Python - Size: 18.6 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

oussema277/product-classification
Automate product classification for e-commerce with our machine learning model. Enhance user experience for sellers and buyers. 🌟📦
Language: Python - Size: 15.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

PKU-YuanGroup/UniWorld-V1
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
Language: Python - Size: 14.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 664 - Forks: 19

AKAME007/n8n-nodes-fusionbrain
Use fusionbrain.ai in your n8n workflows.
Language: TypeScript - Size: 67.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 0

Jityan/lapgan
Code Repository for "LAP-GAN: Label augmentation with perceptual loss for self-supervised text-to-image synthesis"
Language: Python - Size: 2.62 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

glami/glami-1m
The largest multilingual image-text classification dataset. It contains fashion products.
Language: Jupyter Notebook - Size: 5.43 MB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 73 - Forks: 7

yunqing-me/AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
Language: Python - Size: 25.1 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 207 - Forks: 16

emansarahafi/cGAN-Text-To-Image-Bird-Generation
Text-to-Image Generation with Birds using cGAN.
Language: Jupyter Notebook - Size: 672 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Robin-WZQ/TwT
Trigger without Trace: Towards Stealthy Backdoor Attack on Text-to-Image Diffusion Models
Language: Python - Size: 4.12 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

twardoch/magespace-importer
Helps importing models from a list of URLs into https://mage.space/
Language: Python - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 1

gmum/beta-CFG
This paper presents β-CFG, a dynamic guidance method for text-to-image diffusion models. Unlike standard CFG, which uses a fixed guidance scale, β-CFG adapts guidance strength over time using a β-distribution. This improves image quality, keeps sampling closer to the data manifold, and achieves better FID while maintaining prompt alignment.
Language: Python - Size: 3.68 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

basiclab/Unraveling-Information-Mix-ups
🔥 [NeurIPS 2024] A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedding Optimization
Language: Python - Size: 12.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 1

yandex-research/swd
Scale-wise Distillation of Diffusion Models
Size: 8.19 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 101 - Forks: 3

Circuit-Overtime/jackeyBot
A free text2image discord bot service on discord, implementing discord.js
Language: CSS - Size: 68.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

keshik6/grafting
Exploring Diffusion Transformer Designs via Grafting
Language: Jupyter Notebook - Size: 2.78 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 18 - Forks: 1

RianNegreiros/AiShortsVideosGenerator
Full-Stack ASP.NET Core and Next.js to create AI-Generated Short Videos with Captions using only free resources
Language: TypeScript - Size: 16.4 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 29 - Forks: 10

Pavansomisetty21/Text-to-Images-Leveraging-Flux-AI-for-Text-to-Image-Generation
we explores the fascinating domain of text-to-image generation using the powerful capabilities of the Flux API. The objective is to transform textual descriptions into vivid and accurate visual representations, leveraging state-of-the-art artificial intelligence
Language: Jupyter Notebook - Size: 3.54 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 2

gudaochangsheng/MaskUnet
[CVPR 2025] Official PyTorch implementation of Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
Language: Python - Size: 2.14 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 13 - Forks: 0

IthicalHolder/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python - Size: 4.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Parii05/text-to-image-generator
Developed an AI-based text-to-image generator using the DALL·E API from OpenAI. The project enables users to input textual descriptions and generate corresponding images. It demonstrates the power of AI in creative fields
Language: HTML - Size: 87.9 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

woctezuma/stable-diffusion-colab
Colab notebook for Stable Diffusion Hyper-SDXL.
Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 325 - Forks: 81

OSU-NLP-Group/MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Language: Python - Size: 112 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 356 - Forks: 14

Gen-Verse/Diffusion-Sharpening
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
Language: Python - Size: 2.34 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 58 - Forks: 5

Mattrg1989/LTX-Video
Official repository for LTX-Video
Language: Python - Size: 164 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
Language: Python - Size: 60.5 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1,938 - Forks: 139

py-img-gen/python-image-generation
🎨 書籍「Pythonで学ぶ画像生成」のコードを配置したリポジトリです
Language: Jupyter Notebook - Size: 74.2 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 12 - Forks: 2

ashutoshbhole1/AI-Image-Generator
😎AI Image Generator is a cutting-edge system that transforms text prompts into high-quality images using advanced deep learning models like Stable Diffusion and Flux.
Language: JavaScript - Size: 7.18 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

xmed-lab/UniEval
UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation
Language: Python - Size: 26 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 1

Correr-Zhou/MagicTailor
[IJCAI 2025] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".
Language: Python - Size: 122 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 85 - Forks: 3

hapheus/n8n-nodes-fusionbrain 📦
Use fusionbrain.ai in your n8n workflows.
Language: TypeScript - Size: 67.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 0

j-min/DSG
Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)
Language: Jupyter Notebook - Size: 4.42 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 87 - Forks: 5

pyladiesams/personalization-with-text-to-image-diffusion-models-feb2024
Get familiar with different fine-tuning techniques for text-to-image models, and learn how to teach a diffusion model a concept of your choosing
Language: Jupyter Notebook - Size: 91.4 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 2

zituitui/BELM
[NeurIPS 2024] Official implementation of "BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models".
Language: Python - Size: 48.7 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 128 - Forks: 7

abidakram01/text-to-image-ai-generator-angular
An AI-powered text-to-image generator built with Angular. Enter text prompts and generate stunning images using cutting-edge AI models.
Language: SCSS - Size: 124 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

donahowe/AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Language: Jupyter Notebook - Size: 20.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 438 - Forks: 33

FoundationVision/Liquid
Liquid: Language Models are Scalable and Unified Multi-modal Generators
Language: Python - Size: 31.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 353 - Forks: 24

RockeyCoss/SPO
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
Language: Python - Size: 30.3 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 197 - Forks: 6

darkhiem/Gen-AI-Chatbot
Voice Assistant with AI text & image generation. Features speech recognition, Gemini AI integration, Stable Diffusion image creation, voice output, and MongoDB conversation storage. Built with Streamlit for an intuitive interface. The perfect personal AI assistant for both voice and text interactions.
Language: Python - Size: 13.7 KB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

alessioborgi/Z-SAMB_StyleAligned_MultiReference-MultiModal
Novel framework for Zero-Shot Style Alignment in Text-to-Image generation, incorporating Multi-Modal Context-Awareness and Multi-Reference Style Alignment, using minimal attention sharing, ensuring consistent style transfer without fine-tuning.
Language: Jupyter Notebook - Size: 1.49 GB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

LayoutLLM-T2I/LayoutLLM-T2I
Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
Language: Python - Size: 17.3 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 45 - Forks: 0

songweige/rich-text-to-image
Rich-Text-to-Image Generation
Language: Python - Size: 41.4 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 786 - Forks: 67

haoosz/ConceptExpress
[ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Language: Python - Size: 58.6 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 63 - Forks: 8

louisYen/Gen4Gen
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
Language: Python - Size: 181 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 105 - Forks: 5

CFGpp-diffusion/CFGpp
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)
Language: Python - Size: 8.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 191 - Forks: 6

humansensinglab/ITI-GEN
[ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation
Language: Python - Size: 13.6 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 67 - Forks: 11

YangLing0818/ContextDiff
[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation
Language: Python - Size: 97 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 66 - Forks: 4

anggra474/freeflux
Smart and Simple Flux for GPU-poor
Language: Python - Size: 393 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Shaik-Nisar-Ahmed/freeflux
Language: Python - Size: 393 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Shentao-YANG/Dense_Reward_T2I
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
Language: Python - Size: 11.3 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 38 - Forks: 0

QY-H00/attention-interpolation-diffusion
[NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion
Language: Jupyter Notebook - Size: 296 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 94 - Forks: 4

mapo-t2i/mapo
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
Language: Python - Size: 5.38 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 71 - Forks: 8

PangzeCheung/SingDiffusion
[CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
Language: Python - Size: 23.4 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 66 - Forks: 4

litaotju/freeflux
Smart and Simple Flux for GPU-poor
Language: Python - Size: 392 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

j-min/VPGen
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
Language: Jupyter Notebook - Size: 5.87 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 56 - Forks: 3

xuyang-liu16/VGDiffZero
[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
Language: Python - Size: 1.07 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 1

kanugurajesh/Student-LMS
An application to make learning as fun as gaming
Language: TypeScript - Size: 334 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

TrustAIRLab/proactive_unsafe_generation
[Usenix Security 2025] On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Language: Python - Size: 171 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

ali-vilab/IDEA-Bench
Official repository of IDEA-Bench
Language: Python - Size: 14.8 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 21 - Forks: 1

Mamadou-Keita/VLM-DETECT
[ICASSP 2024] The official repo for Harnessing the Power of Large Vision Language Models for Synthetic Image Detection
Language: Python - Size: 134 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 21 - Forks: 2

Darshan-Rajanna/text-to-image-generator
Generating AI images from user Input, using Python, Diffusers, OpenCV, MediaPipe, Flask, Google Colab.
Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

AlonzoLeeeooo/LCDG
The official code implementation of "LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis".
Language: Python - Size: 114 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 32 - Forks: 4

ExplainableML/ReNO
[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
Language: Python - Size: 7 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 113 - Forks: 9

vishnux/polaris
AI-Powered Fashion Product Photo Generator
Language: Python - Size: 27 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

vit718/GenZAI-ImageGeneration_MERN
Turn text into captivating visuals with GenZAI, a full-stack web app powered by OpenAI DALL-E. Experience seamless text-to-image generation with the powerful MERN stack (Node.js, Express.js, MongoDB, React.js) and a sleek Tailwind CSS interface. Securely store your creations with Cloudinary
Language: JavaScript - Size: 1.72 MB - Last synced at: 6 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

GuoLanqing/Awesome-High-Resolution-Diffusion
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
Size: 161 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 114 - Forks: 4

MohammadShabazuddin/Image-Generation-with-DALL-E
Created a system using DALL-E to generate unique, high-quality images from text descriptions for creative applications.
Language: HTML - Size: 264 KB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

cedonulfi/pixa
Pixa is a simple AI-powered image generator using Pollinations AI, allowing users to create stunning visuals based on text prompts. Customize your images by adjusting width, height, and seed for unique results. Built with PHP and responsive design, it’s easy to use and perfect for creative projects.
Language: PHP - Size: 111 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

ChocoWu/T2I-Salad
Codes for NeurIPS 2023 paper:Imagine That! Abstract-to-Intricate Text-to-Image Synthesis with Scene Graph Hallucination Diffusion.
Language: Python - Size: 1000 Bytes - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

shahariar-shibli/Adversarial-Attack-on-POS-Tags
Adversarial Attacks on Parts of Speech: An Empirical Study in Text-to-Image Generation
Language: Jupyter Notebook - Size: 101 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

Bayunova28/GenAI_Playground_Explorations
This repository contains about my personal project about generative AI for generate text, audio & image
Language: Jupyter Notebook - Size: 21.2 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ArdeniusAI/ai-toolkit Fork of ostris/ai-toolkit
text to image AI training
Language: Python - Size: 28.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

nazmul-karim170/SAVE
Implementation of "SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing" Paper
Language: Python - Size: 6.54 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 8 - Forks: 1

textboost/textboost.github.io
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
Language: JavaScript - Size: 187 MB - Last synced at: 2 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

RomailButt/Text-to-image-using-FLUX.1-dev
Text To Image using hugging face API FLUX.1
Language: Python - Size: 30.3 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

SubramanyaKS/PhrasePic
A Text to Image generator website with the support of hugging face and stability ai,with authentication using next-auth. The project is under development
Language: TypeScript - Size: 893 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Marco4413/Text2Image
A Text to Image converter written in Python.
Language: Python - Size: 455 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

PSYGNEX/dalle3mwp
"DALLE-3 MASTERWARE +" is a wrapper application of OpenAI's Dall-e-3 built in Gradio to provide a GUI interface for user friendly interactions with the official OpenAI API
Language: Python - Size: 207 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Islam-hady9/Generative-AI-Models
Generative AI Models is a comprehensive repository dedicated to the implementation of cutting-edge generative AI models using Python. It features various models, including those for image captioning and text-to-image generation, leveraging advanced architectures like Vision Transformers (ViT), GPT-2, and Stable Diffusion.
Language: Jupyter Notebook - Size: 11.2 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

zeyofu/Commonsense-T2I
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]
Language: Python - Size: 81.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 0

subratamondal1/ai_photo_generator
Using Stable Diffusion API to generate and enhance photo of the users.
Language: Python - Size: 1.88 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

FrostBird347/ParrotNightmares
An extremely cursed text to image AI which generates terrifying parrot abominations.
Language: Jupyter Notebook - Size: 26.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

1jsingh/Divide-Evaluate-and-Refine
Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Language: Jupyter Notebook - Size: 48.6 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 23 - Forks: 1
