An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: text-reconstruction

joanrod/ocr-vqgan

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers

Language: Python - Size: 2.76 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 44 - Forks: 1