taming-transformers | Topic | Ecosyste.ms: Repos

Topic: "taming-transformers"

joanrod/ocr-vqgan

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers

Language: Python - Size: 2.76 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 44 - Forks: 1

rosinality/taming-transformers-pytorch

Implementation of Taming Transformers for High-Resolution Image Synthesis (https://arxiv.org/abs/2012.09841) in PyTorch

Size: 1.95 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 16 - Forks: 1

sbmagar13/VQGAN-CLIP-Text-to-Image

Text-to-Image Synthesis using Multimodal (VQGAN + CLIP) Architectures

Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: 8 days ago - Pushed at: 7 months ago - Stars: 8 - Forks: 3

mehdidc/vqgan_nodep

VQGAN from LDM without hell of dependencies

Language: Python - Size: 99.6 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

krishnakaushik25/VQGAN-CLIP

Gradio Web app for running VQGAN-CLIP locally

Language: Jupyter Notebook - Size: 52.3 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 6

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos