GitHub topics: text-to-image
jeree02847/Dream
Dream is a collaborative platform that helps users explore and share their creative visions. It offers tools for brainstorming, visual storytelling, and community feedback, making idea development easier and more engaging.
Size: 3.91 KB - Last synced at: about 11 hours ago - Pushed at: about 12 hours ago - Stars: 0 - Forks: 0

geoffsmith82/Symposium2023
Demonstrates Voice Recognition, Text to Speech, Language Translation, OAuth2, Image Generation, Face Detection and Voice Chatbot.
Language: Pascal - Size: 13.7 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 57 - Forks: 13

KanekilTheAogiri/awesome-gpt4o-images
Awesome curated collection of GPT-4o images & prompts. Explore diverse AI-generated art styles (Ghibli, 3D, etc.) from OpenAI's latest model.
Size: 20.5 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 1

Capsize-Games/airunner
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Language: Python - Size: 29.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1,197 - Forks: 96

promptslab/Awesome-Prompt-Engineering
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
Language: Python - Size: 187 KB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 4,604 - Forks: 434

nikhom14/Real-Time-Chatbot
# Real-Time Chatbot with Emotion-Based ResponsesThis project features a real-time chatbot that uses rule-based logic and emotion triggers for meaningful conversations. Users can interact through quick buttons or type messages, enhancing their experience. 🐙✨
Language: CSS - Size: 35.2 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

ergonomech/Art-Style-Fusion-Prompt-Enginner
Art Style Fusion Prompt Engineering is a Gradio app that blends art styles, descriptions, and artist recommendations into prompts for T5 text encoders (Tested Heavily with Flux)
Language: Python - Size: 3.65 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

moatifbutt/awesome-diffusion-iclr-2025
List of diffusion related active submissions on OpenReview for ICLR 2025.
Size: 521 KB - Last synced at: about 22 hours ago - Pushed at: 8 months ago - Stars: 31 - Forks: 0

PaddlePaddle/PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Language: Python - Size: 179 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 656 - Forks: 218

Isi-dev/Google-Colab_Notebooks
A Collection of Google Colab Notebooks for various projects
Language: Jupyter Notebook - Size: 18 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 117 - Forks: 58

nglcobdai/text2image
This is a generate image from text.
Language: Python - Size: 5.51 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

omeregev/click2mask
[AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.
Language: Python - Size: 62.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 17 - Forks: 2

DarshParikh25/imaginAI
imaginAI is a full-stack MERN-based SaaS app that transforms text prompts into AI-generated images using ClipDrop API. It includes OTP email verification via Nodemailer while registration, a credit system, and Razorpay integration for buying credits. With a responsive TailwindCSS UI, imaginAI makes image creation faster, easier, and more secure.
Language: JavaScript - Size: 17.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

yadavnikhil03/text-to-handwriting
Hate writing by hand? Let this tool do it for you! Just type, and it converts your text into realistic handwriting—quick, effortless, and stress-free. Perfect for assignments, notes, or anything else!
Language: JavaScript - Size: 27.3 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

BaronAlviar/stable-diffusion-3.5-lora-finetuning
Language: Python - Size: 41 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

joshirugved11/imagine
Imagine is an state of the art image generation model
Language: Python - Size: 1000 Bytes - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

aitoollist/awesome-ai-tool-list
An awesome directory of AI tools. The list here is the data source for the searchable web directory @ https://www.aitoollist.org . Discover tools to supercharge your AI journey. Star this repo and join the AI revolution!
Size: 72.3 KB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 11 - Forks: 30

kastalimohammed1965/CLIP-fine-tune-registers-gated
Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!
Language: Python - Size: 16.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3 - Forks: 1

zanvari/stable-diffusion-lab
Hands-on tutorials for generating and editing images using Stable Diffusion with Hugging Face Diffusers — includes text-to-image, inpainting, and image-to-image pipelines.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

AndreH219/freeflux
Smart and Simple Flux for GPU-poor
Language: Python - Size: 393 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

lunaniro/Neural.Image_Genv3.0
Neural.Image_Genv3.0 is web app. hacker-inspired interface. Powered by reverse engineered API!!
Size: 1000 Bytes - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 6 - Forks: 0

atfortes/Awesome-Controllable-Diffusion
Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.
Size: 37.6 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 470 - Forks: 30

sayakpaul/diffusers-torchao
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
Language: Python - Size: 187 KB - Last synced at: 1 day ago - Pushed at: 23 days ago - Stars: 360 - Forks: 13

MiniMax-AI/MiniMax-MCP
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
Language: Python - Size: 113 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 570 - Forks: 62

KwokKwok/Silo
多模型同时对话、文生图,纯前端。Multi-model simultaneous chat、text-to-image generation, all done through pure front-end (API mode, no server-side needed).
Language: JavaScript - Size: 2.32 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 235 - Forks: 25

ai-action/diffused
🤗 Generate images with diffusion models.
Language: Python - Size: 452 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

ZiyiZhang27/MVC-ZigAL
Code for the paper "Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning"
Language: Python - Size: 2.96 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 8 - Forks: 0

SamurAIGPT/AI-Influencer-Generator
Create and customize your AI influencer open-source
Language: Jupyter Notebook - Size: 50.8 KB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 123 - Forks: 37

RhythrosaLabs/loom
Loom is a Streamlit-based application designed to generate and process videos and images using multiple AI services such as Luma AI, Replicate, Stable Diffusion, DALL·E, and RunwayML. This app allows users to create custom AI-generated media, process it, and download the results seamlessly.
Language: Python - Size: 78.1 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 2

aadhi001/AutoText-text-to-image
A Flask-based REST API that converts marketing-friendly text prompts into unique, AI-generated images using OpenAI's DALL·E and DaVinci models. Includes content moderation to filter inappropriate inputs and prompt enhancement to improve image quality and style.
Language: Python - Size: 69 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

FurkanGozukara/Stable-Diffusion
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, Kaggle, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya, Midjourney, RunPod
Language: Jupyter Notebook - Size: 3.45 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 2,446 - Forks: 336

ashbuilds/payload-ai
AI Plugin is a powerful extension for the Payload CMS, integrating advanced AI capabilities to enhance content creation and management.
Language: TypeScript - Size: 82.3 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 241 - Forks: 38

Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python - Size: 4.56 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 2,058 - Forks: 179

ChaofanTao/Autoregressive-Models-in-Vision-Survey
[TMLR 2025🔥] A survey for the autoregressive models in vision.
Size: 7.74 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 630 - Forks: 19

filipecalegario/awesome-generative-ai
A curated list of Generative AI tools, works, models, and references
Size: 1.16 MB - Last synced at: 9 days ago - Pushed at: 18 days ago - Stars: 2,864 - Forks: 499

manh9011/Perchance-T2I-Desktop
.NET 8 Client for Perchance Text to Image
Language: C# - Size: 2.35 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 9 - Forks: 2

saharmor/dalle-playground
A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini)
Language: JavaScript - Size: 3.01 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 2,761 - Forks: 596

PRITHIVSAKTHIUR/Agent-Dino
Dino: The Minimalist Multipurpose Chat System
Language: Python - Size: 431 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0

jamez-bondos/awesome-gpt4o-images
Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capabilities.
Language: JavaScript - Size: 141 MB - Last synced at: 10 days ago - Pushed at: 27 days ago - Stars: 6,328 - Forks: 569

XiaomingX/awesome-qwen-prompt-insight
🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute your own prompts!
Size: 25.3 MB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 225 - Forks: 20

ddPn08/Radiata
Stable diffusion webui based on diffusers.
Language: Python - Size: 15.6 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 978 - Forks: 68

bean980310/ai-companion
AI companions including generative AI such as chatbots, image generation, text generation, and audio generation.
Language: Python - Size: 32.9 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 0

Chaoses-Ib/VisualComputing
Language: Markdown - Size: 27.1 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 8 - Forks: 1

snap-research/stable-flow
Official implementation for "Stable Flow: Vital Layers for Training-Free Image Editing" [CVPR 2025]
Language: Python - Size: 2.9 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 360 - Forks: 22

gokhaneraslan/stable-diffusion-3.5-lora-finetuning
A comprehensive, modular framework for fine-tuning Stable Diffusion 3.5 models using LoRA (Low-Rank Adaptation). Create custom AI image generators tailored to your artistic style, objects, or concepts with memory-efficient training on consumer GPUs.
Language: Python - Size: 71.3 KB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

Raxephion/AuraGen-AuraFlow-WebUI
Lightweight 6GB VRAM Gradio web app with auto-installer for running AuraFlow locally — no cloud, no clutter.
Language: Python - Size: 6.12 MB - Last synced at: 5 days ago - Pushed at: 15 days ago - Stars: 3 - Forks: 0

yczhou001/LongBench-T2I
Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation
Language: Python - Size: 15.7 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 14 - Forks: 0

cilabuniba/i-dream-my-painting
[WACV 2025] I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting
Language: Jupyter Notebook - Size: 58.7 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 12 - Forks: 0

gokhaneraslan/text_to_image_dataset_toolkit
Preparing high-quality datasets for text-to-image
Language: Python - Size: 25.4 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

JiauZhang/tracking-arxiv
微信公众号:机器感知 | Tracking the Latest Arxiv Papers
Size: 1000 Bytes - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 37 - Forks: 4

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language: HTML - Size: 12.7 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 479 - Forks: 27

doanbactam/awesome-stable-diffusion
A curated list of awesome stable diffusion resources 🌟
Size: 105 KB - Last synced at: 3 days ago - Pushed at: 26 days ago - Stars: 58 - Forks: 2

lzyhha/VisualCloze
VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)
Language: Python - Size: 129 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 230 - Forks: 11

HJH-AILab/hjh-ai-demo
好机绘AIGC云服务示例,选择玩法,上传人脸图像完成AI绘图,AI绘图主要玩法包括创意大变身、主角梦想秀,本项目调用了好机绘AIGC云服务接口
Language: HTML - Size: 39.2 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

ArchieMeng/stable-diffusion-notebookui
Colab notebook for quick Stable Diffusion model evaluations.
Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

IBM/DiffuseKronA
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models
Language: Python - Size: 11 MB - Last synced at: 13 days ago - Pushed at: about 2 months ago - Stars: 130 - Forks: 2

awekrx/ChatGPT-MidJourney-prompt
This is a ChatGPT based prompt generation model for MidJorney. The purpose of this model is to simplify the creation of images and increase their creativity. By introducing a partial hint, ChatGPT creates a follow-up that can be used to stimulate creativity and provide new ideas.
Language: Python - Size: 11.7 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 337 - Forks: 51

jonathandinu/ai4artists
A list of AI Art courses, tools, libraries, people, and places.
Size: 661 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 355 - Forks: 25

davutbayik/ai-product-image-generator
Automatically generate commercial-quality product mockup images using OpenAI and Google Workspace integrations (Sheets + Drive).
Language: Python - Size: 30.9 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

zer0int/CLIP-fine-tune-registers-gated
Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!
Language: Python - Size: 28.4 MB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 42 - Forks: 1

Shilin-LU/MACE
[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)
Language: Jupyter Notebook - Size: 28.1 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 395 - Forks: 32

fmp453/erase-eval
Erasing with Precision: Evaluating Specific Concept Erasure from Text-to-Image Generative Models
Language: Python - Size: 1.18 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 0

Emotion-Director/Emotion-Director.github.io
[Anonymous NeurIPS 2025 Submission] Website of Emotion-Director: Multi-Modal Prompt for Emotion-Oriented Text-to-Image Generation
Language: Vue - Size: 25.7 MB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

ironjr/semantic-draw
Official code for the CVPR 2025 paper "SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models."
Language: Jupyter Notebook - Size: 334 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 562 - Forks: 49

TanishqaRavirala/Text-to-Image-App-Using-Stable-Diffusion
This Text to Image app is a GUI-based tool built with Tkinter and customtkinter. It uses the Stable Diffusion model from Hugging Face to generate images from text prompts. The app accepts user input, processes it, and displays the generated image. It's ideal for visualizing creative text ideas in real-time.
Language: Python - Size: 15.6 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

sail-sg/finetune-fair-diffusion
Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness
Language: Python - Size: 47.1 MB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 43 - Forks: 3

unburn/prodia.js
A simple and up to date wrapper for prodia api with all features included.
Language: TypeScript - Size: 19 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 12

byliutao/1Prompt1Story
🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
Language: Python - Size: 29.9 MB - Last synced at: 21 days ago - Pushed at: 22 days ago - Stars: 265 - Forks: 32

dngda/self-bot
Typescript - Lightweights WhatsApp bot 🤖 made to response only self message with Baileys ✨
Language: TypeScript - Size: 553 KB - Last synced at: 22 days ago - Pushed at: 23 days ago - Stars: 25 - Forks: 9

HJH-AILab/portal-web
portal-web 北京好机绘科技有限公司,作为AIGC领域的先行者,一直积极探索 如何通过科技手段,将中华传统文化和普通百姓的日常生活紧密结 合起来。 正如白居易诗云:“别有天地非人间”,北京好机绘科技有限公司通过人工智能和各种前沿技术,为用户打造了一个全新的、超越个人能力限制的文化
Language: HTML - Size: 26.1 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

IDouble/ChatGPT-Simple-Tutorial-Image-Text-Code-Generation
🖼️ A simple ChatGPT AI tutorial on how to generate images/text/code and its limitations 🤖
Language: Python - Size: 4.56 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 117 - Forks: 23

WhereIsAI/WhereIsAI
AI company, product, and tool collection.
Size: 68.4 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 30 - Forks: 4

yunqing-me/WatermarkDM
Code of the paper: A Recipe for Watermarking Diffusion Models
Language: Jupyter Notebook - Size: 24.2 MB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 147 - Forks: 9

TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Language: Python - Size: 37 MB - Last synced at: 25 days ago - Pushed at: 6 months ago - Stars: 1,603 - Forks: 134

XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Language: Jupyter Notebook - Size: 5.71 MB - Last synced at: 25 days ago - Pushed at: over 2 years ago - Stars: 7,725 - Forks: 801

nerdyrodent/VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
Language: Python - Size: 31.7 MB - Last synced at: 24 days ago - Pushed at: over 2 years ago - Stars: 2,650 - Forks: 432

basiclab/MAD
MAD: Makeup All-in-One with Cross-Domain Diffusion Model
Language: Python - Size: 2.96 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 14 - Forks: 1

THUDM/CogView4
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Language: Python - Size: 24.1 MB - Last synced at: 24 days ago - Pushed at: 3 months ago - Stars: 1,043 - Forks: 76

NotCookey/Quote2Image
Quote2Image is a python library for turning text quotes into graphical images
Language: Python - Size: 603 KB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 65 - Forks: 5

bowen-upenn/ControlText
ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations
Language: Python - Size: 72.8 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 26 - Forks: 0

kuprel/min-dalle
min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch
Language: Python - Size: 46.5 MB - Last synced at: 25 days ago - Pushed at: about 2 months ago - Stars: 3,487 - Forks: 252

ooLuan/awesome-generative-ai
Multimodal generative AI resources : talking heads, STT, TTS, image & video generation, and more.
Size: 1.81 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 3 - Forks: 0

PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
Size: 3.04 MB - Last synced at: 27 days ago - Pushed at: 6 months ago - Stars: 1,042 - Forks: 28

a-r-r-o-w/stablefused 📦
StableFused is a toy library for experimenting with Diffusion Models, inspired by various sources.
Language: Python - Size: 1020 KB - Last synced at: about 23 hours ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 2

IthicalHolder/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python - Size: 4.5 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

bytedance/ComfyUI_InfiniteYou
🔥 Official ComfyUI native node for InfiniteYou with FLUX
Language: Python - Size: 1.57 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 138 - Forks: 26

SamurAIGPT/AI-Faceless-Video-Generator
Generate a video script, voice and a talking face completely with AI
Language: Jupyter Notebook - Size: 16.6 MB - Last synced at: 30 days ago - Pushed at: 4 months ago - Stars: 305 - Forks: 49

omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
Language: Python - Size: 27.4 MB - Last synced at: 25 days ago - Pushed at: 5 months ago - Stars: 1,657 - Forks: 139

woctezuma/stable-diffusion-colab
Colab notebook for Stable Diffusion Hyper-SDXL.
Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 325 - Forks: 81

DenOfEquity/forge_space_NitroFusion
NitroFusion 1step text to image as a Space for Forge2
Language: Python - Size: 1.19 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

amirhossein-jahangiri/Ai-Image-Generator
A Flutter app for generating AI images from text prompts. Includes prompt suggestions and the ability to save generated images.
Language: Dart - Size: 2.15 MB - Last synced at: 16 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

fofr/cog-face-to-many
Turn any face into a video game character, pixel art, claymation, 3D or toy
Language: Python - Size: 32.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 1,338 - Forks: 204

Yushi-Hu/tifa
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Language: Python - Size: 6.08 MB - Last synced at: 28 days ago - Pushed at: about 1 year ago - Stars: 162 - Forks: 11

ai-forever/ru-dalle
Generate images from texts. In Russian
Language: Jupyter Notebook - Size: 26.9 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1,646 - Forks: 245

ai-forever/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
Language: Jupyter Notebook - Size: 37.3 MB - Last synced at: 30 days ago - Pushed at: about 1 year ago - Stars: 2,798 - Forks: 310

SamurAIGPT/Text-To-Video-API
Text to Video API generation documentation
Size: 7.81 KB - Last synced at: 17 days ago - Pushed at: 10 months ago - Stars: 19 - Forks: 4

lucidrains/big-sleep
A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun
Language: Python - Size: 6.89 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 2,571 - Forks: 306

lucidrains/muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Language: Python - Size: 285 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 894 - Forks: 83

OSU-NLP-Group/MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Language: Python - Size: 112 MB - Last synced at: 30 days ago - Pushed at: 4 months ago - Stars: 356 - Forks: 14

SamurAIGPT/AI-Youtube-Shorts-Generator
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
Language: Python - Size: 99 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 2,219 - Forks: 314
