An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: image-generation

omeregev/click2mask

[AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.

Language: Python - Size: 62.8 MB - Last synced at: about 2 hours ago - Pushed at: about 2 hours ago - Stars: 17 - Forks: 2

saadamir1/vae-gan-comparison

Comprehensive comparison of VAE vs GAN architectures for image generation on CIFAR-10 dataset with quantitative evaluation metrics

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 0 - Forks: 0

TechyCSR/AdvAITelegramBot

Telegram Advance AI ChatBot: GPT-4.1, Qwen-3, DeepSeek-R1, Dall-E-3, Flux, Flux-Pro, Dall-E Model, OCR and Google Voice2Text.

Language: Python - Size: 8.1 MB - Last synced at: about 8 hours ago - Pushed at: about 8 hours ago - Stars: 10 - Forks: 2

CUHK-AIM-Group/Polyp-Gen

[ICRA 2025] Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion

Language: Python - Size: 1.94 MB - Last synced at: about 8 hours ago - Pushed at: about 9 hours ago - Stars: 20 - Forks: 2

mzattera/predictive-powers

A Java library to (easily) create GenAI-powered autonomous agents

Language: Java - Size: 32.4 MB - Last synced at: about 8 hours ago - Pushed at: about 9 hours ago - Stars: 10 - Forks: 0

jiuntian/interactdiffusion

[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".

Language: Python - Size: 14.8 MB - Last synced at: about 8 hours ago - Pushed at: about 10 hours ago - Stars: 119 - Forks: 9

Huanst/img_generator-vue3

文字生成图片,硅基流动 API,用的是快手的 kolors模型

Language: Vue - Size: 134 KB - Last synced at: about 10 hours ago - Pushed at: about 11 hours ago - Stars: 0 - Forks: 1

BaronAlviar/stable-diffusion-3.5-lora-finetuning

Language: Python - Size: 41 KB - Last synced at: about 14 hours ago - Pushed at: about 14 hours ago - Stars: 0 - Forks: 0

Circuit-Overtime/jackeyBot

A free text2image discord bot service on discord, implementing discord.js

Language: CSS - Size: 68.4 KB - Last synced at: about 16 hours ago - Pushed at: about 17 hours ago - Stars: 3 - Forks: 0

screenshothis/screenshothis

The screenshot API for modern applications. Automate web captures and generate visuals instantly.

Language: TypeScript - Size: 11 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 28 - Forks: 5

zanvari/stable-diffusion-lab

Hands-on tutorials for generating and editing images using Stable Diffusion with Hugging Face Diffusers — includes text-to-image, inpainting, and image-to-image pipelines.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 21 hours ago - Pushed at: about 21 hours ago - Stars: 0 - Forks: 0

keshik6/grafting

Exploring Diffusion Transformer Designs via Grafting

Language: Jupyter Notebook - Size: 2.78 MB - Last synced at: about 22 hours ago - Pushed at: about 23 hours ago - Stars: 18 - Forks: 1

3587jjh/LSRNA

Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models (CVPR 2025)

Language: Python - Size: 49.9 MB - Last synced at: about 22 hours ago - Pushed at: about 23 hours ago - Stars: 23 - Forks: 0

invoke-ai/InvokeAI

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

Language: TypeScript - Size: 329 MB - Last synced at: about 24 hours ago - Pushed at: 1 day ago - Stars: 25,314 - Forks: 2,586

mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

Language: Go - Size: 20 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 33,253 - Forks: 2,552

hammad2006sid/Komiko

Komiko - Create comics, manhwa, manga, webtoon, and anime with AI - AI Comic Factory

Size: 4.88 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

mhohamad/Deep-Research-AI-Agent

Build a powerful Deep Research AI agent like Gemini or ChatGPT. Using Next.js, Vercel AI SDK, and Exa Search API, An intelligent system that generates follow-up questions, crafts optimal search queries, and compiles comprehensive research reports.

Language: TypeScript - Size: 34.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 9 - Forks: 4

lunaniro/Neural.Image_Genv3.0

Neural.Image_Genv3.0 is web app. hacker-inspired interface. Powered by reverse engineered API!!

Size: 1000 Bytes - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 6 - Forks: 0

GraphiteEditor/Graphite

2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.

Language: Rust - Size: 37.8 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 12,964 - Forks: 613

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Language: Python - Size: 68.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 29,382 - Forks: 6,032

valle123321123/AlphaChat

AlphaChat is a chatbot with a GUI and console interface for real-time conversations, Q&A management, and AI-driven features like voice interaction, image generation, and PDF summarization.

Size: 2.93 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 1

MiniMax-AI/MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

Language: Python - Size: 113 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 570 - Forks: 62

Hedlen/awesome-segment-anything

Tracking and collecting papers/projects/others related to Segment Anything.

Size: 11.7 MB - Last synced at: about 9 hours ago - Pushed at: 3 months ago - Stars: 1,623 - Forks: 133

jhj0517/finetuning-notebooks

Language: Jupyter Notebook - Size: 148 KB - Last synced at: about 22 hours ago - Pushed at: 3 months ago - Stars: 69 - Forks: 6

egeyavuzcan/diffusion-flow-models-research

A comprehensive collection of research papers and resources on diffusion&flow based models, systematically organized by application and architecture. It highlights cutting-edge advances in flow-guided diffusion techniques for image, video, and multimodal generation.

Size: 4.88 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

Tiefflieger06/comfyui-simple-frontend

A basic web frontend for ComfyUI with the goal of being as simple to use as possible

Language: HTML - Size: 164 KB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

davidelobba/TEMU-VTOFF

Official implementation of the paper "Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals"

Language: Python - Size: 1.06 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 9 - Forks: 3

Revanrie/Variational-Autoencoder-on-FashionMNIST

This repository features a Variational Autoencoder (VAE) built with PyTorch, designed to compress and generate FashionMNIST images. Explore the model's latent space and gain insights into its workings while leveraging the power of deep learning! 🐙💻

Language: Jupyter Notebook - Size: 2.08 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

tryAGI/Recraft

C# SDK based on official Recraft OpenAPI specification

Language: C# - Size: 341 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

jamilll999/Solid_Color_PNG_Generators

# Solid Color PNG GeneratorsThis Python script generates solid color PNG images, allowing users to customize dimensions and colors. It's open-source and easy to modify, making it a useful tool for learning about image generation. 🖼️✨

Language: Python - Size: 10.7 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

suzukimain/auto_diffusers

diffusers with search engine

Language: Python - Size: 924 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 10 - Forks: 1

tryAGI/Leonardo

Generated C# SDK based on Leonardo AI OpenAPI specification

Language: C# - Size: 771 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 3

tryAGI/Ideogram

Generated C# SDK based on official Ideogram OpenAPI specification

Language: C# - Size: 534 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

pinkpixel-dev/MCPollinations

A Model Context Protocol (MCP) server that enables AI assistants to generate images, text, and audio through the Pollinations APIs. Supports customizable parameters, image saving, and multiple model options.

Language: JavaScript - Size: 42 KB - Last synced at: 1 day ago - Pushed at: 12 days ago - Stars: 21 - Forks: 4

ApexGen-X/MergeVQ

[CVPR] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization

Language: Python - Size: 9.75 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 28 - Forks: 3

mcmonkeyprojects/SwarmUI

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

Language: C# - Size: 30.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,673 - Forks: 241

ALEEEHU/World-Simulator

Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" and Awesome-Text2X-Resources. Watch this repository for the latest updates! 🔥

Size: 18.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 254 - Forks: 14

VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Language: Jupyter Notebook - Size: 399 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 4,125 - Forks: 356

UCSC-VLAA/story-adapter

A Training-free Iterative Framework for Long Story Visualization

Language: Python - Size: 280 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 899 - Forks: 128

LUKMASTER12-12/fashion-ai-studio

AI fashion model photoshoot SaaS web app.

Language: TypeScript - Size: 1.04 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 1

Phile779/tech-explorer-hub

Create the best technical resources for developers to build a strong foundation for professional growth.

Language: HTML - Size: 57.6 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

JoePenna/Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 3,218 - Forks: 553

BetaKors/worley-noise-rs

Simple worley noise implementation in Rust.

Language: Rust - Size: 5.86 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Lunashia/Solid_Color_PNG_Generators

Language: Python - Size: 8.79 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

sayakpaul/caption-upsampling

This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.

Language: Python - Size: 45.9 KB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 152 - Forks: 2

ashbuilds/payload-ai

AI Plugin is a powerful extension for the Payload CMS, integrating advanced AI capabilities to enhance content creation and management.

Language: TypeScript - Size: 82.3 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 241 - Forks: 38

The-Martyr/Awesome-Multimodal-Reasoning

Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models

Size: 174 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 29 - Forks: 0

XBastille/AIComicX

AIComicX is an AI-powered platform that transforms user-uploaded stories into comics, allowing users to either post their own stories or generate new ones using AI. With multiple comic styles and intelligent storytelling features, it revolutionizes comic creation.

Language: Python - Size: 9.47 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0

taylordotfish/plumage

A colorful picture generator

Language: Rust - Size: 646 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 1

KanekilTheAogiri/awesome-gpt4o-images

Awesome curated collection of GPT-4o images & prompts. Explore diverse AI-generated art styles (Ghibli, 3D, etc.) from OpenAI's latest model.

Size: 20.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 1

AnotherWorkingNerd/LatentEye

LatentEye - Browse AI generated images and reveal the hidden metadata in them. With advanced features for viewing, metadata handling, and clipboard operations. Great with ComfyUI or Stable Diffusion tools

Language: Python - Size: 16 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

MartynasJakutis/ChroMS_GUI

Graphical user interface (GUI) for HPLC-MS data analysis. Enables HPLC and MS data file processing and the visualization of the data. Used for data generated by LabSolutions data analysis software.

Language: Tcl - Size: 432 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

cuixing158/Awesome-CV-MasterHub

:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works

Size: 37.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 396 - Forks: 27

SverreNystad/gpt-dungeon-master

Welcome to the GPT Dungeon Master repository! This project harnesses the power of GPT models to create a dynamic and responsive Dungeon Master (DM) for tabletop role-playing games (RPGs). Whether you're a seasoned player looking for a quick rule reference or a group in need of an AI-driven DM for your next adventure, the GPT Dungeon Master is here

Language: Python - Size: 18.6 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 42 - Forks: 4

huanngzh/MV-Adapter

[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Texture Synthesis] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"

Language: Python - Size: 17.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,025 - Forks: 57

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

Size: 7.74 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 630 - Forks: 19

MiniMax-AI/MiniMax-MCP-JS

Official MiniMax Model Context Protocol (MCP) JavaScript implementation that provides seamless integration with MiniMax's powerful AI capabilities including image generation, video generation, text-to-speech, and voice cloning APIs.

Language: TypeScript - Size: 219 KB - Last synced at: 3 days ago - Pushed at: 20 days ago - Stars: 46 - Forks: 10

karanIPS/claude-deep-research

Claude Deep Research config for Claude Code.

Size: 10.7 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 6 - Forks: 0

sinhajiya/image-through-waves

Demonstration of how each image is simply a combination of different waves using MatLab.

Language: MATLAB - Size: 30.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

DarrenPan/Awesome-NeurIPS2023-Low-Level-Vision

A Collection of Papers and Codes in NeurIPS2022/2021 related to Low-Level Vision

Size: 57.6 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 20 - Forks: 0

vuongkiranocode2004/ai-blog-poster

This repo provides a RESTful API for automatically generating SEO-optimized blog posts and AI-generated hero images using OpenAI models. It supports metadata configuration, frontmatter customization, and structured local file output with markdown formatting.

Language: Python - Size: 39.1 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

caiyuanhao1998/PNGAN

"Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training" (NeurIPS 2021)

Language: Python - Size: 44.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 142 - Forks: 21

lapismyt/pyAIHorde

Simple library for interacting with AI Horde API.

Language: Python - Size: 45.9 KB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 5 - Forks: 0

elegantapp/pwa-asset-generator

Automates PWA asset generation and image declaration. Automatically generates icon and splash screen images, favicons and mstile images. Updates manifest.json and index.html files with the generated images according to Web App Manifest specs and Apple Human Interface guidelines.

Language: TypeScript - Size: 39.5 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 2,852 - Forks: 145

nikhom14/Real-Time-Chatbot

# Real-Time Chatbot with Emotion-Based ResponsesThis project features a real-time chatbot that uses rule-based logic and emotion triggers for meaningful conversations. Users can interact through quick buttons or type messages, enhancing their experience. 🐙✨

Language: CSS - Size: 35.2 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

majestikmagik/Automation_Create_post_Instagram_Facebook

This Make.com scenario automates social media posts by pulling prompts from Google Sheets to generate AI images, then automatically posting those images to Instagram and Facebook.

Size: 73.2 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

HanaokaYuzu/Gemini-API

✨ Elegant async Python API for Google Gemini web app

Language: Python - Size: 334 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 762 - Forks: 108

Amitgajare2/imgenius

Discover, copy, and create. Your ultimate hub for free AI image prompts.

Language: JavaScript - Size: 2.41 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

kane50613/takumi

High-performance Rust library for generating images with CSS Flexbox-like layouts.

Language: Rust - Size: 2.28 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

Capsize-Games/airunner

Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows

Language: Python - Size: 27.9 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 1,195 - Forks: 95

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language: Python - Size: 5.35 MB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 1,774 - Forks: 78

intelligentnode/IntelliServer

AI models as scalable microservices, enabling evaluation of LLMs and offering end-to-end functions such as chatbot, semantic search, image generation and beyond.

Language: JavaScript - Size: 2.89 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 29 - Forks: 3

meterhub/meter-viewer

View meter dataset and process them.

Language: Python - Size: 21.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

aws-samples/generative-ai-use-cases

Application implementation with business use cases for safely utilizing generative AI in business operations

Language: TypeScript - Size: 141 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,060 - Forks: 271

Kobaayyy/Awesome-ICCV2021-Low-Level-Vision

A Collection of Papers and Codes for ICCV2021 Low Level Vision and Image Generation

Size: 173 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 215 - Forks: 97

mikeesto/gif4o

Create animated GIFs from GPT-4o-generated image grids

Language: HTML - Size: 6.84 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

khoj-ai/khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

Language: Python - Size: 110 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 30,297 - Forks: 1,711

YoannDev90/AlphaLLM

A Discord Bot using LLMs

Language: Python - Size: 340 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 5 - Forks: 1

TashonBraganca/ImaGen

Local LCM Image Generation : A lightweight, open-source, free image generator that anyone can run locally to create stunning AI-generated digital art.

Language: Python - Size: 1.68 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

KnpLabs/snappy

PHP library allowing thumbnail, snapshot or PDF generation from a url or a html page. Wrapper for wkhtmltopdf/wkhtmltoimage

Language: PHP - Size: 569 KB - Last synced at: 3 days ago - Pushed at: 14 days ago - Stars: 4,447 - Forks: 438

Thelyoncrypt/openai-image-1-ideogram-mcp

OpenAI Image 1 Ideogram MCP - Enterprise-grade Model Context Protocol server for Ideogram v3.0 API with Style References, Rendering Speed Control, and Enhanced Features

Language: TypeScript - Size: 1.94 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

HaoyuanYang-2023/ImagineFSL

Official implementation of "ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning" [CVPR 2025]

Language: Python - Size: 8.95 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 4 - Forks: 0

6Morpheus6/Fooocus-API

Fooocus powered by FastAPI (Supported on all OS and GPU's)

Language: JavaScript - Size: 21.5 KB - Last synced at: 2 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

yohasebe/openai-chat-api-workflow

🎩 An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT models 🤖💬 It also allows image generation/editing/understanding 🖼️, speech-to-text conversion 🎤, and text-to-speech synthesis 🔈

Size: 113 MB - Last synced at: 2 days ago - Pushed at: 8 days ago - Stars: 315 - Forks: 9

jamez-bondos/awesome-gpt4o-images

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capabilities.

Language: JavaScript - Size: 141 MB - Last synced at: 7 days ago - Pushed at: 24 days ago - Stars: 6,328 - Forks: 569

sethbang/venice-ai

🐍 Python client for Venice.ai. Seamlessly integrate powerful GenAI: chat 💬, image gen 🖼️, audio TTS 🔊, embeddings & more. Supports sync/async, streaming & robust error handling.

Language: Python - Size: 875 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

Echsecutor/gen_ai_container

Container running Invoke AI webservice for AI image generation

Language: Shell - Size: 1.36 MB - Last synced at: 4 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Kobaayyy/Awesome-CVPR2025-CVPR2024-ECCV2024-AIGC

A Collection of Papers and Codes for CVPR2025/CVPR2024/ECCV2024 AIGC

Size: 351 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 556 - Forks: 14

techthoughts2/pwshBedrock

pwshBedrock is a PowerShell module designed to simplify interaction with Amazon Bedrock foundation models. It enables users to send messages, retrieve responses, manage conversation contexts, generate images and videos, and estimate costs. Supporting both InvokeModel and Converse API, it streamlines AI integration in PowerShell workflows.

Language: PowerShell - Size: 5.37 MB - Last synced at: 4 days ago - Pushed at: 8 days ago - Stars: 7 - Forks: 1

FotographerAI/ZenCtrl

In-context subject-driven image generation while preserving foreground fidelity

Language: Python - Size: 5.58 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 278 - Forks: 23

neggles/animatediff-cli

a CLI utility/library for AnimateDiff stable diffusion generation

Language: Python - Size: 123 KB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 261 - Forks: 133

Zyriix/D2O

Official implemention for Diffusion Models Are Innate One-Step Generators

Language: Python - Size: 125 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 22 - Forks: 1

TIGER-AI-Lab/VIEScore

Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024)

Language: Python - Size: 21.6 MB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 46 - Forks: 1

mokira3d48/CVA_NET

This repository contents different scripts for different models to allow you to train a model of images classification.

Language: Python - Size: 163 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

zachdwight/image-gen-huggingface-example

AI Image Generation with HuggingFace Diffusers

Language: Python - Size: 20.5 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

ai-dock/onetrainer

OneTrainer docker images for use in GPU cloud and local environments. Includes AI-Dock KDE Plasma desktop with GPU acceleration and audio for authentication and improved user experience.

Language: Shell - Size: 661 KB - Last synced at: about 20 hours ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 3

MyHoldFast/uebekbot

Best tg bot in the world

Language: Python - Size: 330 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 7 - Forks: 0

zcemycl/Matlab-GAN

MATLAB implementations of Generative Adversarial Networks -- from GAN to Pixel2Pixel, CycleGAN

Language: MATLAB - Size: 124 MB - Last synced at: 4 days ago - Pushed at: about 2 years ago - Stars: 199 - Forks: 85

usetrmnl/byos_node_lite

Image server for TRMNL built with Node.js, JSX and HTML

Language: TypeScript - Size: 1.2 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 3 - Forks: 0

Vahab95/comfyui_HiDream-Sampler

ComfyUI Wrapper for HiDream

Language: Python - Size: 7.43 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Related Keywords
image-generation 1,846 ai 356 deep-learning 285 stable-diffusion 272 pytorch 241 python 235 gan 186 diffusion-models 164 computer-vision 149 generative-adversarial-network 141 machine-learning 138 generative-ai 132 openai 130 text-to-image 124 image-processing 123 artificial-intelligence 118 diffusion 82 image-manipulation 73 image 71 llm 68 image-generator 65 tensorflow 63 chatgpt 62 video-generation 61 ai-art 61 gans 58 dall-e 58 text-generation 57 generative-art 57 api 51 generative-model 51 text2image 51 chatbot 50 openai-api 48 image-editing 47 python3 46 javascript 44 huggingface 42 txt2img 38 react 38 diffusers 36 nodejs 35 typescript 34 image-synthesis 33 nextjs 32 gpt 32 cyclegan 31 generative-models 31 dalle2 30 vae 29 dcgan 29 gradio 29 img2img 28 telegram-bot 28 stable-diffusion-webui 28 discord-bot 28 computer-graphics 28 text-to-speech 27 image-to-image-translation 27 cnn 27 variational-autoencoder 27 reactjs 27 flux 26 pillow 26 image-classification 25 open-source 25 docker 24 image2image 24 prompt-engineering 24 convolutional-neural-networks 24 keras 24 neural-networks 23 style-transfer 23 deep-neural-networks 23 comfyui 22 image-translation 22 sdxl 22 midjourney 22 latent-diffusion 22 neural-network 21 pix2pix 21 torch 20 transformer 20 transformers 20 bot 20 image-generator-using-openai-api 20 image-generation-ai 20 flask 20 gpt-4 19 discord 19 large-language-models 19 inpainting 19 nlp 19 streamlit 19 art 18 gemini 18 automation 17 dalle 17 lora 17 text-to-image-generation 17