GitHub topics: vqgan
fishaudio/fish-speech
SOTA Open Source TTS
Language: Python - Size: 18 MB - Last synced at: 1 day ago - Pushed at: 8 days ago - Stars: 20,735 - Forks: 1,640

sczhou/CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Language: Python - Size: 16.8 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 16,914 - Forks: 3,524

markweberdev/maskbit
Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"
Language: Jupyter Notebook - Size: 73.9 MB - Last synced at: 8 days ago - Pushed at: 10 days ago - Stars: 61 - Forks: 4

chaofengc/FeMaSR
PyTorch codes for "Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors", ACM MM2022 (Oral)
Language: Python - Size: 167 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 220 - Forks: 15

olaviinha/NeuralTextToImage
Colabs for text prompt steered image generators
Language: Jupyter Notebook - Size: 271 KB - Last synced at: 16 days ago - Pushed at: 6 months ago - Stars: 16 - Forks: 5

andrew264/ImageExpts
Doing devious stuff with images
Language: Jupyter Notebook - Size: 5.39 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

JiauZhang/binary-latent-diffusion
Implementation of Binary Latent Diffusion
Language: Python - Size: 799 KB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 51 - Forks: 3

RQ-Wu/RIDCP_dehazing
[CVPR 2023] | RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors
Language: Python - Size: 99.1 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 223 - Forks: 24

SerezD/vqvae-vqgan-pytorch-lightning
VQ-VAE/GAN implementation in pytorch-lightning
Language: Python - Size: 4.06 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 44 - Forks: 4

overshard/ai-art
Art generation using VQGAN + CLIP using docker containers. A simplified, updated, and expanded upon version of Kevin Costa's work. This project tries to make generating art as easy as possible for anyone with a GPU by providing a simple web UI.
Language: Python - Size: 2.23 MB - Last synced at: 6 days ago - Pushed at: almost 3 years ago - Stars: 18 - Forks: 4

khanhvu207/vqgan
An unofficial PyTorch implementation of VQGAN
Language: HTML - Size: 5.66 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

zbr17/OptVQ
Towards training VQ-VAE models robustly!
Language: Python - Size: 28.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 42 - Forks: 0

youngsheen/SimVQ
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Language: Python - Size: 4.86 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 184 - Forks: 5

sbmagar13/VQGAN-CLIP-Text-to-Image
Text-to-Image Synthesis using Multimodal (VQGAN + CLIP) Architectures
Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 8 - Forks: 3

kcosta42/VQGAN-CLIP-Docker
Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized
Language: Python - Size: 2.14 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 17

affromero/NTIRE22_Inpainting
NTIRE 2022 - Image Inpainting Challenge
Language: Jupyter Notebook - Size: 101 MB - Last synced at: 17 days ago - Pushed at: almost 3 years ago - Stars: 46 - Forks: 5

Rivera-ai/VQGAN-pytorch-Inference Fork of dome272/VQGAN-pytorch
Branch of the original Project "dome272/VQGAN-pytorch" adding an inference file for the VQGAN (Not for the VQGAN Transformers)
Language: Python - Size: 17.6 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

ericyangchen/MaskGIT-for-image-inpainting_VQGAN-with-Transformer
Implementing MaskGIT for image inpainting with PyTorch
Language: Python - Size: 12.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

joanrod/paper2figure-dataset
Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer Vision)
Language: Python - Size: 2.03 MB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

hhguo/MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
Language: Python - Size: 1.15 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 14

mehdidc/vqgan_nodep
VQGAN from LDM without hell of dependencies
Language: Python - Size: 99.6 KB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

LIAGM/DAEFR
[ICLR 2024] DAEFR: Dual Associated Encoder for Face Restoration
Language: Python - Size: 2.49 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 0

sayedmohamedscu/VQGAN
Vector-Quantized Generative Adversarial Networks
Language: Python - Size: 6.16 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

pytti-tools/pytti-notebook
Start here
Language: Jupyter Notebook - Size: 338 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 108 - Forks: 26

rkhamilton/vqgan-clip-generator
Implements VQGAN+CLIP for image and video generation, and style transfers, based on text and image prompts. Emphasis on ease-of-use, documentation, and smooth video creation.
Language: Jupyter Notebook - Size: 106 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 110 - Forks: 28

mehdidc/feed_forward_vqgan_clip
Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt
Language: Python - Size: 2.4 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 136 - Forks: 18

CasualGANPapers/Make-A-Scene
Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Language: Python - Size: 125 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 300 - Forks: 21

robobeebop/VQGAN-CLIP-Video
Traditional deepdream with VQGAN+CLIP and optical flow. Ready to use in Google Colab.
Language: Python - Size: 9.46 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 22 - Forks: 5

Kurokabe/GANime
Video generation of anime content based on the first and last frame
Language: Jupyter Notebook - Size: 111 MB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 1

rosinality/taming-transformers-pytorch
Implementation of Taming Transformers for High-Resolution Image Synthesis (https://arxiv.org/abs/2012.09841) in PyTorch
Size: 1.95 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 15 - Forks: 0

mahalrs/newsgen
Multi-Modal Image Generation for News Stories
Language: Python - Size: 356 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Qiyuan-Ge/PaintMind
Fast and controllable text-to-iamge model.
Language: Python - Size: 24.8 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 18 - Forks: 4

joanrod/ocr-vqgan
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers
Language: Python - Size: 2.76 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 44 - Forks: 1

happy-jihye/Streamlit-Tutorial
Streamlit Tutorial (ex: stock price dashboard, cartoon-stylegan, vqgan-clip, stylemixing, styleclip, sefa)
Language: Python - Size: 180 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 30 - Forks: 15

TabuaTambalam/DalleWebms
Language: Jupyter Notebook - Size: 187 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 0

KR-HappyFace/KoDALLE
🇰🇷 Text to Image in Korean
Language: Python - Size: 6.55 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 74 - Forks: 21

sborquez/VQGAN_CLIP_docker
Docker for VQGAN+CLIP (z+quantize method)
Language: Python - Size: 3.99 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 14 - Forks: 7

Xibanya/VQGAN-CLIP Fork of nerdyrodent/VQGAN-CLIP
yet another VQGAN-CLIP variation
Language: Python - Size: 38 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

benckx/baudelaire-on-vqgan
Experiments with Baudelaire and a text-to-image GAN.
Language: HTML - Size: 5.12 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

dredwardhyde/vqgan_clip_local
Language: Python - Size: 22.8 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1
