GitHub topics: vqvae
fishaudio/fish-speech
SOTA Open Source TTS
Language: Python - Size: 18 MB - Last synced at: about 3 hours ago - Pushed at: 10 days ago - Stars: 20,785 - Forks: 1,642

MissMeriel/PreFixer
Learning universal transformations between perception datasets to overcome sensor hardware versioning
Language: Python - Size: 2.09 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Language: Python - Size: 45.4 MB - Last synced at: 14 days ago - Pushed at: about 1 month ago - Stars: 7,049 - Forks: 1,117

FoundationVision/OmniTokenizer
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
Language: Python - Size: 68.9 MB - Last synced at: 15 days ago - Pushed at: 10 months ago - Stars: 287 - Forks: 7

v-iashin/SpecVQGAN
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Language: Jupyter Notebook - Size: 163 MB - Last synced at: 13 days ago - Pushed at: 9 months ago - Stars: 360 - Forks: 39

haoliuhl/language-quantized-autoencoders
Language Quantized AutoEncoders
Language: Python - Size: 37.1 KB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 103 - Forks: 5

mahmoodlab/SISH
Fast and scalable search of whole-slide images via self-supervised deep learning - Nature Biomedical Engineering
Language: Python - Size: 963 KB - Last synced at: 23 days ago - Pushed at: almost 2 years ago - Stars: 100 - Forks: 27

amzn/sparse-vqvae
Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper
Language: Python - Size: 64.5 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 34 - Forks: 14

k2kobayashi/crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Language: Python - Size: 12.4 MB - Last synced at: 16 days ago - Pushed at: 9 months ago - Stars: 171 - Forks: 31

BhanuPrakashPebbeti/Image-Generation-Using-VQVAE
Image Generation using VQVAE and GPT Models
Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 13 - Forks: 2

unshun0120/Apply-FederatedLearning-into-Autoencoder
Using Federated Learning to train Autoencoder and its variants' models in pytorch
Language: Python - Size: 900 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ZhengdiYu/SignAvatars
(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark
Language: Python - Size: 130 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 77 - Forks: 4

SerezD/vqvae-vqgan-pytorch-lightning
VQ-VAE/GAN implementation in pytorch-lightning
Language: Python - Size: 4.06 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 44 - Forks: 4

Vermeille/Torchelie
Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.
Language: Python - Size: 1.7 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 110 - Forks: 11

SnowYJ/T5VQVAE
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders
Language: Python - Size: 4.12 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

vsimkus/vae-voice-conversion
Voice conversion (VC) investigation using three variants of VAE
Language: Python - Size: 33.2 MB - Last synced at: 20 days ago - Pushed at: over 5 years ago - Stars: 57 - Forks: 11

zbr17/OptVQ
Towards training VQ-VAE models robustly!
Language: Python - Size: 28.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 42 - Forks: 0

MIMICLab/BITTERS
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Language: Python - Size: 15 MB - Last synced at: 9 months ago - Pushed at: about 2 years ago - Stars: 21 - Forks: 2

comibear/Autoencoders
Language: Jupyter Notebook - Size: 29.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition
Inverse DALL-E for Optical Character Recognition
Language: Python - Size: 6.9 MB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 38 - Forks: 6

mehdidc/vqvae_lightning
Language: Python - Size: 155 KB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

jaywalnut310/Vector-Quantized-Autoencoders
Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"
Language: Python - Size: 497 KB - Last synced at: 23 days ago - Pushed at: about 6 years ago - Stars: 14 - Forks: 3

mehdidc/vqgan_nodep
VQGAN from LDM without hell of dependencies
Language: Python - Size: 99.6 KB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

lionelblonde/vq-compression-pytorch
Compression via Vector Quantization in PyTorch
Language: Python - Size: 126 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

explainingai-code/VQVAE-Pytorch
This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE
Language: Python - Size: 35.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 1

rogertrullo/VQVAE_Pytorch
implementation of VQVAE in pytorch
Language: Jupyter Notebook - Size: 108 KB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

aillaud/VQVAE_Flax
Implementation of basic autoencodeur, VAE and VQVAE in Flax
Language: Jupyter Notebook - Size: 1.24 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hqyyqh888/RobustSemanComm
Demo of robust semantic communication against semantic noise
Language: Python - Size: 84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 8

aillaud/Diffusion-models
State of the art of generative models and in-depth study of diffusion models
Language: Jupyter Notebook - Size: 127 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

viviaxenov/text_to_image_with_transformer Fork of stankevich-mipt/text_to_image_with_transformer
An educational project dedicated to text-to-image generation with neural networks. VQVAE and BPE autoencoders are used to learn the embedding of text and image respectively. A transformer-based model then is trained to predict the next token in the concatenated sequence of image and text tokens and used for generation.
Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

fostiropoulos/dvq
Applying multiple VQ along the feature axis
Language: Jupyter Notebook - Size: 45.5 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 2

DongyaoZhu/Real-Time-Accent-Conversion
Real Time Foreign Accent Conversion
Language: Python - Size: 1.03 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 46 - Forks: 8

maj34/facegram Fork of face-gram/facegram
[GDSC Solution Challenge] Facegram All-Part Merged Repository
Language: Jupyter Notebook - Size: 5.22 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

lupalab/posterior-matching
Official code for the NeurIPS 2022 paper "Posterior Matching for Arbitrary Conditioning".
Language: Python - Size: 607 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

DYZhang09/VQ-VAE
naive pytorch implementation of VQ-VAE
Language: Python - Size: 45.9 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 1
