GitHub topics: image-generation
omeregev/click2mask
[AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.
Language: Python - Size: 62.8 MB - Last synced at: about 2 hours ago - Pushed at: about 2 hours ago - Stars: 17 - Forks: 2

saadamir1/vae-gan-comparison
Comprehensive comparison of VAE vs GAN architectures for image generation on CIFAR-10 dataset with quantitative evaluation metrics
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 0 - Forks: 0

TechyCSR/AdvAITelegramBot
Telegram Advance AI ChatBot: GPT-4.1, Qwen-3, DeepSeek-R1, Dall-E-3, Flux, Flux-Pro, Dall-E Model, OCR and Google Voice2Text.
Language: Python - Size: 8.1 MB - Last synced at: about 8 hours ago - Pushed at: about 8 hours ago - Stars: 10 - Forks: 2

CUHK-AIM-Group/Polyp-Gen
[ICRA 2025] Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
Language: Python - Size: 1.94 MB - Last synced at: about 8 hours ago - Pushed at: about 9 hours ago - Stars: 20 - Forks: 2

mzattera/predictive-powers
A Java library to (easily) create GenAI-powered autonomous agents
Language: Java - Size: 32.4 MB - Last synced at: about 8 hours ago - Pushed at: about 9 hours ago - Stars: 10 - Forks: 0

jiuntian/interactdiffusion
[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".
Language: Python - Size: 14.8 MB - Last synced at: about 8 hours ago - Pushed at: about 10 hours ago - Stars: 119 - Forks: 9

Huanst/img_generator-vue3
文字生成图片,硅基流动 API,用的是快手的 kolors模型
Language: Vue - Size: 134 KB - Last synced at: about 10 hours ago - Pushed at: about 11 hours ago - Stars: 0 - Forks: 1

BaronAlviar/stable-diffusion-3.5-lora-finetuning
Language: Python - Size: 41 KB - Last synced at: about 14 hours ago - Pushed at: about 14 hours ago - Stars: 0 - Forks: 0

Circuit-Overtime/jackeyBot
A free text2image discord bot service on discord, implementing discord.js
Language: CSS - Size: 68.4 KB - Last synced at: about 16 hours ago - Pushed at: about 17 hours ago - Stars: 3 - Forks: 0

screenshothis/screenshothis
The screenshot API for modern applications. Automate web captures and generate visuals instantly.
Language: TypeScript - Size: 11 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 28 - Forks: 5

zanvari/stable-diffusion-lab
Hands-on tutorials for generating and editing images using Stable Diffusion with Hugging Face Diffusers — includes text-to-image, inpainting, and image-to-image pipelines.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 21 hours ago - Pushed at: about 21 hours ago - Stars: 0 - Forks: 0

keshik6/grafting
Exploring Diffusion Transformer Designs via Grafting
Language: Jupyter Notebook - Size: 2.78 MB - Last synced at: about 22 hours ago - Pushed at: about 23 hours ago - Stars: 18 - Forks: 1

3587jjh/LSRNA
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models (CVPR 2025)
Language: Python - Size: 49.9 MB - Last synced at: about 22 hours ago - Pushed at: about 23 hours ago - Stars: 23 - Forks: 0

invoke-ai/InvokeAI
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
Language: TypeScript - Size: 329 MB - Last synced at: about 24 hours ago - Pushed at: 1 day ago - Stars: 25,314 - Forks: 2,586

mudler/LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
Language: Go - Size: 20 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 33,253 - Forks: 2,552

hammad2006sid/Komiko
Komiko - Create comics, manhwa, manga, webtoon, and anime with AI - AI Comic Factory
Size: 4.88 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

mhohamad/Deep-Research-AI-Agent
Build a powerful Deep Research AI agent like Gemini or ChatGPT. Using Next.js, Vercel AI SDK, and Exa Search API, An intelligent system that generates follow-up questions, crafts optimal search queries, and compiles comprehensive research reports.
Language: TypeScript - Size: 34.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 9 - Forks: 4

lunaniro/Neural.Image_Genv3.0
Neural.Image_Genv3.0 is web app. hacker-inspired interface. Powered by reverse engineered API!!
Size: 1000 Bytes - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 6 - Forks: 0

GraphiteEditor/Graphite
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Language: Rust - Size: 37.8 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 12,964 - Forks: 613

huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language: Python - Size: 68.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 29,382 - Forks: 6,032

valle123321123/AlphaChat
AlphaChat is a chatbot with a GUI and console interface for real-time conversations, Q&A management, and AI-driven features like voice interaction, image generation, and PDF summarization.
Size: 2.93 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 1

MiniMax-AI/MiniMax-MCP
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
Language: Python - Size: 113 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 570 - Forks: 62

Hedlen/awesome-segment-anything
Tracking and collecting papers/projects/others related to Segment Anything.
Size: 11.7 MB - Last synced at: about 9 hours ago - Pushed at: 3 months ago - Stars: 1,623 - Forks: 133

jhj0517/finetuning-notebooks
Language: Jupyter Notebook - Size: 148 KB - Last synced at: about 22 hours ago - Pushed at: 3 months ago - Stars: 69 - Forks: 6

egeyavuzcan/diffusion-flow-models-research
A comprehensive collection of research papers and resources on diffusion&flow based models, systematically organized by application and architecture. It highlights cutting-edge advances in flow-guided diffusion techniques for image, video, and multimodal generation.
Size: 4.88 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

Tiefflieger06/comfyui-simple-frontend
A basic web frontend for ComfyUI with the goal of being as simple to use as possible
Language: HTML - Size: 164 KB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

davidelobba/TEMU-VTOFF
Official implementation of the paper "Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals"
Language: Python - Size: 1.06 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 9 - Forks: 3

Revanrie/Variational-Autoencoder-on-FashionMNIST
This repository features a Variational Autoencoder (VAE) built with PyTorch, designed to compress and generate FashionMNIST images. Explore the model's latent space and gain insights into its workings while leveraging the power of deep learning! 🐙💻
Language: Jupyter Notebook - Size: 2.08 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

tryAGI/Recraft
C# SDK based on official Recraft OpenAPI specification
Language: C# - Size: 341 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

jamilll999/Solid_Color_PNG_Generators
# Solid Color PNG GeneratorsThis Python script generates solid color PNG images, allowing users to customize dimensions and colors. It's open-source and easy to modify, making it a useful tool for learning about image generation. 🖼️✨
Language: Python - Size: 10.7 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

suzukimain/auto_diffusers
diffusers with search engine
Language: Python - Size: 924 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 10 - Forks: 1

tryAGI/Leonardo
Generated C# SDK based on Leonardo AI OpenAPI specification
Language: C# - Size: 771 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 3

tryAGI/Ideogram
Generated C# SDK based on official Ideogram OpenAPI specification
Language: C# - Size: 534 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

pinkpixel-dev/MCPollinations
A Model Context Protocol (MCP) server that enables AI assistants to generate images, text, and audio through the Pollinations APIs. Supports customizable parameters, image saving, and multiple model options.
Language: JavaScript - Size: 42 KB - Last synced at: 1 day ago - Pushed at: 12 days ago - Stars: 21 - Forks: 4

ApexGen-X/MergeVQ
[CVPR] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization
Language: Python - Size: 9.75 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 28 - Forks: 3

mcmonkeyprojects/SwarmUI
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Language: C# - Size: 30.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,673 - Forks: 241

ALEEEHU/World-Simulator
Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" and Awesome-Text2X-Resources. Watch this repository for the latest updates! 🔥
Size: 18.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 254 - Forks: 14

VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Language: Jupyter Notebook - Size: 399 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 4,125 - Forks: 356

UCSC-VLAA/story-adapter
A Training-free Iterative Framework for Long Story Visualization
Language: Python - Size: 280 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 899 - Forks: 128

LUKMASTER12-12/fashion-ai-studio
AI fashion model photoshoot SaaS web app.
Language: TypeScript - Size: 1.04 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 1

Phile779/tech-explorer-hub
Create the best technical resources for developers to build a strong foundation for professional growth.
Language: HTML - Size: 57.6 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

JoePenna/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.
Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 3,218 - Forks: 553

BetaKors/worley-noise-rs
Simple worley noise implementation in Rust.
Language: Rust - Size: 5.86 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Lunashia/Solid_Color_PNG_Generators
Language: Python - Size: 8.79 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

sayakpaul/caption-upsampling
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
Language: Python - Size: 45.9 KB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 152 - Forks: 2

ashbuilds/payload-ai
AI Plugin is a powerful extension for the Payload CMS, integrating advanced AI capabilities to enhance content creation and management.
Language: TypeScript - Size: 82.3 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 241 - Forks: 38

The-Martyr/Awesome-Multimodal-Reasoning
Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models
Size: 174 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 29 - Forks: 0

XBastille/AIComicX
AIComicX is an AI-powered platform that transforms user-uploaded stories into comics, allowing users to either post their own stories or generate new ones using AI. With multiple comic styles and intelligent storytelling features, it revolutionizes comic creation.
Language: Python - Size: 9.47 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0

taylordotfish/plumage
A colorful picture generator
Language: Rust - Size: 646 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 1

KanekilTheAogiri/awesome-gpt4o-images
Awesome curated collection of GPT-4o images & prompts. Explore diverse AI-generated art styles (Ghibli, 3D, etc.) from OpenAI's latest model.
Size: 20.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 1

AnotherWorkingNerd/LatentEye
LatentEye - Browse AI generated images and reveal the hidden metadata in them. With advanced features for viewing, metadata handling, and clipboard operations. Great with ComfyUI or Stable Diffusion tools
Language: Python - Size: 16 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

MartynasJakutis/ChroMS_GUI
Graphical user interface (GUI) for HPLC-MS data analysis. Enables HPLC and MS data file processing and the visualization of the data. Used for data generated by LabSolutions data analysis software.
Language: Tcl - Size: 432 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

cuixing158/Awesome-CV-MasterHub
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
Size: 37.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 396 - Forks: 27

SverreNystad/gpt-dungeon-master
Welcome to the GPT Dungeon Master repository! This project harnesses the power of GPT models to create a dynamic and responsive Dungeon Master (DM) for tabletop role-playing games (RPGs). Whether you're a seasoned player looking for a quick rule reference or a group in need of an AI-driven DM for your next adventure, the GPT Dungeon Master is here
Language: Python - Size: 18.6 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 42 - Forks: 4

huanngzh/MV-Adapter
[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Texture Synthesis] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"
Language: Python - Size: 17.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,025 - Forks: 57

ChaofanTao/Autoregressive-Models-in-Vision-Survey
[TMLR 2025🔥] A survey for the autoregressive models in vision.
Size: 7.74 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 630 - Forks: 19

MiniMax-AI/MiniMax-MCP-JS
Official MiniMax Model Context Protocol (MCP) JavaScript implementation that provides seamless integration with MiniMax's powerful AI capabilities including image generation, video generation, text-to-speech, and voice cloning APIs.
Language: TypeScript - Size: 219 KB - Last synced at: 3 days ago - Pushed at: 20 days ago - Stars: 46 - Forks: 10

karanIPS/claude-deep-research
Claude Deep Research config for Claude Code.
Size: 10.7 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 6 - Forks: 0

sinhajiya/image-through-waves
Demonstration of how each image is simply a combination of different waves using MatLab.
Language: MATLAB - Size: 30.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

DarrenPan/Awesome-NeurIPS2023-Low-Level-Vision
A Collection of Papers and Codes in NeurIPS2022/2021 related to Low-Level Vision
Size: 57.6 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 20 - Forks: 0

vuongkiranocode2004/ai-blog-poster
This repo provides a RESTful API for automatically generating SEO-optimized blog posts and AI-generated hero images using OpenAI models. It supports metadata configuration, frontmatter customization, and structured local file output with markdown formatting.
Language: Python - Size: 39.1 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

caiyuanhao1998/PNGAN
"Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training" (NeurIPS 2021)
Language: Python - Size: 44.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 142 - Forks: 21

lapismyt/pyAIHorde
Simple library for interacting with AI Horde API.
Language: Python - Size: 45.9 KB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 5 - Forks: 0

elegantapp/pwa-asset-generator
Automates PWA asset generation and image declaration. Automatically generates icon and splash screen images, favicons and mstile images. Updates manifest.json and index.html files with the generated images according to Web App Manifest specs and Apple Human Interface guidelines.
Language: TypeScript - Size: 39.5 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 2,852 - Forks: 145

nikhom14/Real-Time-Chatbot
# Real-Time Chatbot with Emotion-Based ResponsesThis project features a real-time chatbot that uses rule-based logic and emotion triggers for meaningful conversations. Users can interact through quick buttons or type messages, enhancing their experience. 🐙✨
Language: CSS - Size: 35.2 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

majestikmagik/Automation_Create_post_Instagram_Facebook
This Make.com scenario automates social media posts by pulling prompts from Google Sheets to generate AI images, then automatically posting those images to Instagram and Facebook.
Size: 73.2 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

HanaokaYuzu/Gemini-API
✨ Elegant async Python API for Google Gemini web app
Language: Python - Size: 334 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 762 - Forks: 108

Amitgajare2/imgenius
Discover, copy, and create. Your ultimate hub for free AI image prompts.
Language: JavaScript - Size: 2.41 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

kane50613/takumi
High-performance Rust library for generating images with CSS Flexbox-like layouts.
Language: Rust - Size: 2.28 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

Capsize-Games/airunner
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Language: Python - Size: 27.9 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 1,195 - Forks: 95

FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language: Python - Size: 5.35 MB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 1,774 - Forks: 78

intelligentnode/IntelliServer
AI models as scalable microservices, enabling evaluation of LLMs and offering end-to-end functions such as chatbot, semantic search, image generation and beyond.
Language: JavaScript - Size: 2.89 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 29 - Forks: 3

meterhub/meter-viewer
View meter dataset and process them.
Language: Python - Size: 21.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

aws-samples/generative-ai-use-cases
Application implementation with business use cases for safely utilizing generative AI in business operations
Language: TypeScript - Size: 141 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,060 - Forks: 271

Kobaayyy/Awesome-ICCV2021-Low-Level-Vision
A Collection of Papers and Codes for ICCV2021 Low Level Vision and Image Generation
Size: 173 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 215 - Forks: 97

mikeesto/gif4o
Create animated GIFs from GPT-4o-generated image grids
Language: HTML - Size: 6.84 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

khoj-ai/khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
Language: Python - Size: 110 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 30,297 - Forks: 1,711

YoannDev90/AlphaLLM
A Discord Bot using LLMs
Language: Python - Size: 340 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 5 - Forks: 1

TashonBraganca/ImaGen
Local LCM Image Generation : A lightweight, open-source, free image generator that anyone can run locally to create stunning AI-generated digital art.
Language: Python - Size: 1.68 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

KnpLabs/snappy
PHP library allowing thumbnail, snapshot or PDF generation from a url or a html page. Wrapper for wkhtmltopdf/wkhtmltoimage
Language: PHP - Size: 569 KB - Last synced at: 3 days ago - Pushed at: 14 days ago - Stars: 4,447 - Forks: 438

Thelyoncrypt/openai-image-1-ideogram-mcp
OpenAI Image 1 Ideogram MCP - Enterprise-grade Model Context Protocol server for Ideogram v3.0 API with Style References, Rendering Speed Control, and Enhanced Features
Language: TypeScript - Size: 1.94 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

HaoyuanYang-2023/ImagineFSL
Official implementation of "ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning" [CVPR 2025]
Language: Python - Size: 8.95 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 4 - Forks: 0

6Morpheus6/Fooocus-API
Fooocus powered by FastAPI (Supported on all OS and GPU's)
Language: JavaScript - Size: 21.5 KB - Last synced at: 2 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

yohasebe/openai-chat-api-workflow
🎩 An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT models 🤖💬 It also allows image generation/editing/understanding 🖼️, speech-to-text conversion 🎤, and text-to-speech synthesis 🔈
Size: 113 MB - Last synced at: 2 days ago - Pushed at: 8 days ago - Stars: 315 - Forks: 9

jamez-bondos/awesome-gpt4o-images
Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capabilities.
Language: JavaScript - Size: 141 MB - Last synced at: 7 days ago - Pushed at: 24 days ago - Stars: 6,328 - Forks: 569

sethbang/venice-ai
🐍 Python client for Venice.ai. Seamlessly integrate powerful GenAI: chat 💬, image gen 🖼️, audio TTS 🔊, embeddings & more. Supports sync/async, streaming & robust error handling.
Language: Python - Size: 875 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

Echsecutor/gen_ai_container
Container running Invoke AI webservice for AI image generation
Language: Shell - Size: 1.36 MB - Last synced at: 4 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Kobaayyy/Awesome-CVPR2025-CVPR2024-ECCV2024-AIGC
A Collection of Papers and Codes for CVPR2025/CVPR2024/ECCV2024 AIGC
Size: 351 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 556 - Forks: 14

techthoughts2/pwshBedrock
pwshBedrock is a PowerShell module designed to simplify interaction with Amazon Bedrock foundation models. It enables users to send messages, retrieve responses, manage conversation contexts, generate images and videos, and estimate costs. Supporting both InvokeModel and Converse API, it streamlines AI integration in PowerShell workflows.
Language: PowerShell - Size: 5.37 MB - Last synced at: 4 days ago - Pushed at: 8 days ago - Stars: 7 - Forks: 1

FotographerAI/ZenCtrl
In-context subject-driven image generation while preserving foreground fidelity
Language: Python - Size: 5.58 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 278 - Forks: 23

neggles/animatediff-cli
a CLI utility/library for AnimateDiff stable diffusion generation
Language: Python - Size: 123 KB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 261 - Forks: 133

Zyriix/D2O
Official implemention for Diffusion Models Are Innate One-Step Generators
Language: Python - Size: 125 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 22 - Forks: 1

TIGER-AI-Lab/VIEScore
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024)
Language: Python - Size: 21.6 MB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 46 - Forks: 1

mokira3d48/CVA_NET
This repository contents different scripts for different models to allow you to train a model of images classification.
Language: Python - Size: 163 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

zachdwight/image-gen-huggingface-example
AI Image Generation with HuggingFace Diffusers
Language: Python - Size: 20.5 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

ai-dock/onetrainer
OneTrainer docker images for use in GPU cloud and local environments. Includes AI-Dock KDE Plasma desktop with GPU acceleration and audio for authentication and improved user experience.
Language: Shell - Size: 661 KB - Last synced at: about 20 hours ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 3

MyHoldFast/uebekbot
Best tg bot in the world
Language: Python - Size: 330 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 7 - Forks: 0

zcemycl/Matlab-GAN
MATLAB implementations of Generative Adversarial Networks -- from GAN to Pixel2Pixel, CycleGAN
Language: MATLAB - Size: 124 MB - Last synced at: 4 days ago - Pushed at: about 2 years ago - Stars: 199 - Forks: 85

usetrmnl/byos_node_lite
Image server for TRMNL built with Node.js, JSX and HTML
Language: TypeScript - Size: 1.2 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 3 - Forks: 0

Vahab95/comfyui_HiDream-Sampler
ComfyUI Wrapper for HiDream
Language: Python - Size: 7.43 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0
