GitHub topics: vqgan

Repositories

fishaudio/fish-speech

SOTA Open Source TTS

Language: Python - Size: 18 MB - Last synced at: 1 day ago - Pushed at: 8 days ago - Stars: 20,735 - Forks: 1,640

sczhou/CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language: Python - Size: 16.8 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 16,914 - Forks: 3,524

markweberdev/maskbit

Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"

Language: Jupyter Notebook - Size: 73.9 MB - Last synced at: 8 days ago - Pushed at: 10 days ago - Stars: 61 - Forks: 4

chaofengc/FeMaSR

PyTorch codes for "Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors", ACM MM2022 (Oral)

Language: Python - Size: 167 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 220 - Forks: 15

olaviinha/NeuralTextToImage

Colabs for text prompt steered image generators

Language: Jupyter Notebook - Size: 271 KB - Last synced at: 16 days ago - Pushed at: 6 months ago - Stars: 16 - Forks: 5

andrew264/ImageExpts

Doing devious stuff with images

Language: Jupyter Notebook - Size: 5.39 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

JiauZhang/binary-latent-diffusion

Implementation of Binary Latent Diffusion

Language: Python - Size: 799 KB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 51 - Forks: 3

RQ-Wu/RIDCP_dehazing

[CVPR 2023] | RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors

Language: Python - Size: 99.1 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 223 - Forks: 24

SerezD/vqvae-vqgan-pytorch-lightning

VQ-VAE/GAN implementation in pytorch-lightning

Language: Python - Size: 4.06 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 44 - Forks: 4

Art generation using VQGAN + CLIP using docker containers. A simplified, updated, and expanded upon version of Kevin Costa's work. This project tries to make generating art as easy as possible for anyone with a GPU by providing a simple web UI.

Language: Python - Size: 2.23 MB - Last synced at: 6 days ago - Pushed at: almost 3 years ago - Stars: 18 - Forks: 4

khanhvu207/vqgan

An unofficial PyTorch implementation of VQGAN

Language: HTML - Size: 5.66 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

zbr17/OptVQ

Towards training VQ-VAE models robustly!

Language: Python - Size: 28.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 42 - Forks: 0

youngsheen/SimVQ

SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Language: Python - Size: 4.86 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 184 - Forks: 5

sbmagar13/VQGAN-CLIP-Text-to-Image

Text-to-Image Synthesis using Multimodal (VQGAN + CLIP) Architectures

Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 8 - Forks: 3

kcosta42/VQGAN-CLIP-Docker

Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized

Language: Python - Size: 2.14 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 17

affromero/NTIRE22_Inpainting

NTIRE 2022 - Image Inpainting Challenge

Language: Jupyter Notebook - Size: 101 MB - Last synced at: 17 days ago - Pushed at: almost 3 years ago - Stars: 46 - Forks: 5

Rivera-ai/VQGAN-pytorch-Inference Fork of dome272/VQGAN-pytorch

Branch of the original Project "dome272/VQGAN-pytorch" adding an inference file for the VQGAN (Not for the VQGAN Transformers)

Language: Python - Size: 17.6 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

ericyangchen/MaskGIT-for-image-inpainting_VQGAN-with-Transformer

Implementing MaskGIT for image inpainting with PyTorch

Language: Python - Size: 12.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

joanrod/paper2figure-dataset

Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer Vision)

Language: Python - Size: 2.03 MB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

hhguo/MSMC-TTS

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

Language: Python - Size: 1.15 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 14

mehdidc/vqgan_nodep

VQGAN from LDM without hell of dependencies

Language: Python - Size: 99.6 KB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

LIAGM/DAEFR

[ICLR 2024] DAEFR: Dual Associated Encoder for Face Restoration

Language: Python - Size: 2.49 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 0

sayedmohamedscu/VQGAN

Vector-Quantized Generative Adversarial Networks

Language: Python - Size: 6.16 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

pytti-tools/pytti-notebook

Start here

Language: Jupyter Notebook - Size: 338 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 108 - Forks: 26

rkhamilton/vqgan-clip-generator

Implements VQGAN+CLIP for image and video generation, and style transfers, based on text and image prompts. Emphasis on ease-of-use, documentation, and smooth video creation.

Language: Jupyter Notebook - Size: 106 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 110 - Forks: 28

mehdidc/feed_forward_vqgan_clip

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

Language: Python - Size: 2.4 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 136 - Forks: 18

CasualGANPapers/Make-A-Scene

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Language: Python - Size: 125 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 300 - Forks: 21

robobeebop/VQGAN-CLIP-Video

Traditional deepdream with VQGAN+CLIP and optical flow. Ready to use in Google Colab.

Language: Python - Size: 9.46 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 22 - Forks: 5

Kurokabe/GANime

Video generation of anime content based on the first and last frame

Language: Jupyter Notebook - Size: 111 MB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 1

rosinality/taming-transformers-pytorch

Implementation of Taming Transformers for High-Resolution Image Synthesis (https://arxiv.org/abs/2012.09841) in PyTorch

Size: 1.95 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 15 - Forks: 0

mahalrs/newsgen

Multi-Modal Image Generation for News Stories

Language: Python - Size: 356 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Qiyuan-Ge/PaintMind

Fast and controllable text-to-iamge model.

Language: Python - Size: 24.8 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 18 - Forks: 4

joanrod/ocr-vqgan

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers

Language: Python - Size: 2.76 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 44 - Forks: 1

happy-jihye/Streamlit-Tutorial

Streamlit Tutorial (ex: stock price dashboard, cartoon-stylegan, vqgan-clip, stylemixing, styleclip, sefa)

Language: Python - Size: 180 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 30 - Forks: 15

TabuaTambalam/DalleWebms

Language: Jupyter Notebook - Size: 187 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 0

KR-HappyFace/KoDALLE

🇰🇷 Text to Image in Korean

Language: Python - Size: 6.55 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 74 - Forks: 21

sborquez/VQGAN_CLIP_docker

Docker for VQGAN+CLIP (z+quantize method)

Language: Python - Size: 3.99 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 14 - Forks: 7

Xibanya/VQGAN-CLIP Fork of nerdyrodent/VQGAN-CLIP

yet another VQGAN-CLIP variation

Language: Python - Size: 38 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

benckx/baudelaire-on-vqgan

Experiments with Baudelaire and a text-to-image GAN.

Language: HTML - Size: 5.12 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

dredwardhyde/vqgan_clip_local

Language: Python - Size: 22.8 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

Related Keywords

vqgan 40 pytorch 12 vqgan-clip 8 text-to-image 7 gan 7 image-generation 7 deep-learning 6 clip 5 vq-vae 4 taming-transformers 4 generative-adversarial-network 4 vqvae 4 machine-learning 3 codebook 3 docker 3 deep-neural-networks 2 ai 2 image-processing 2 image 2 python 2 pytorch-lightning 2 google-colab 2 cuda 2 generative-model 2 vector-quantization 2 artificial-neural-networks 2 generative-art 2 transformers 2 gans 2 paper2fig100k 2 paper2fig 2 ocr-vqgan 2 deep-generative-model 2 dataset 2 super-resolution 2 face-restoration 2 tts 2 transformer 2 stable-diffusion 2 text2image 2 stylegan2 1 art 1 stylemixing 1 style-transfer 1 min-dalle 1 image-manipulation 1 animation 1 encoder-decoder-model 1 ncnn 1 daefr 1 ncnn-model 1 portfolio 1 dalle 1 korean 1 ldm 1 latent-diffusion-models 1 jupyter-notebook 1 vocoder 1 text-to-speech 1 speech-synthesis 1 demo 1 gallery 1 image-reconstruction 1 ocr 1 vit-vqgan 1 muse-maskgit 1 text-reconstruction 1 text-image 1 cartoon-stylegan 1 multi-modal 1 dalle-mini 1 sefa 1 video-generation 1 stock 1 tensorflow2 1 gpt2 1 anime 1 video 1 streamlit 1 streamlit-application 1 optical-flow 1 google-colab-notebook 1 streamlit-dashboard 1 deepdream 1 openai-clip 1 streamlit-tutorial 1 streamlit-webapp 1 styleclip 1 dehaze 1 cvpr2023 1 latent-diffusion 1 unet 1 txt2img 1 rudalle 1 openai 1 neural-networks 1 neural-network 1 laion 1 dalle2 1 dall-e 1