GitHub topics: vqvae

Repositories

fishaudio/fish-speech

SOTA Open Source TTS

Language: Python - Size: 18.7 MB - Last synced at: 2 days ago - Pushed at: 14 days ago - Stars: 22,000 - Forks: 1,800

FoundationVision/OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Language: Python - Size: 68.9 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 295 - Forks: 6

AntixK/PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language: Python - Size: 45.4 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 7,151 - Forks: 1,130

mahmoodlab/SISH

Fast and scalable search of whole-slide images via self-supervised deep learning - Nature Biomedical Engineering

Language: Python - Size: 963 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 102 - Forks: 27

MissMeriel/PreFixer

Learning universal transformations between perception datasets to overcome sensor hardware versioning

Language: Python - Size: 2.09 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

v-iashin/SpecVQGAN

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

Language: Jupyter Notebook - Size: 163 MB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 360 - Forks: 39

haoliuhl/language-quantized-autoencoders

Language Quantized AutoEncoders

Language: Python - Size: 37.1 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 103 - Forks: 5

amzn/sparse-vqvae

Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper

Language: Python - Size: 64.5 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 34 - Forks: 14

k2kobayashi/crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Language: Python - Size: 12.4 MB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 171 - Forks: 31

BhanuPrakashPebbeti/Image-Generation-Using-VQVAE

Image Generation using VQVAE and GPT Models

Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 13 - Forks: 2

unshun0120/Apply-FederatedLearning-into-Autoencoder

Using Federated Learning to train Autoencoder and its variants' models in pytorch

Language: Python - Size: 900 KB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ZhengdiYu/SignAvatars

(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark

Language: Python - Size: 130 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 77 - Forks: 4

SerezD/vqvae-vqgan-pytorch-lightning

VQ-VAE/GAN implementation in pytorch-lightning

Language: Python - Size: 4.06 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 44 - Forks: 4

Vermeille/Torchelie

Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.

Language: Python - Size: 1.7 MB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 110 - Forks: 11

SnowYJ/T5VQVAE

Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders

Language: Python - Size: 4.12 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

vsimkus/vae-voice-conversion

Voice conversion (VC) investigation using three variants of VAE

Language: Python - Size: 33.2 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 57 - Forks: 11

zbr17/OptVQ

Towards training VQ-VAE models robustly!

Language: Python - Size: 28.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 42 - Forks: 0

MIMICLab/BITTERS

Large-Scale Bidirectional Training for Zero-Shot Image Captioning

Language: Python - Size: 15 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 2

comibear/Autoencoders

Language: Jupyter Notebook - Size: 29.5 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition

Inverse DALL-E for Optical Character Recognition

Language: Python - Size: 6.9 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 38 - Forks: 6

jaywalnut310/Vector-Quantized-Autoencoders

Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"

Language: Python - Size: 497 KB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 14 - Forks: 3

mehdidc/vqgan_nodep

VQGAN from LDM without hell of dependencies

Language: Python - Size: 99.6 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

lionelblonde/vq-compression-pytorch

Compression via Vector Quantization in PyTorch

Language: Python - Size: 126 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

explainingai-code/VQVAE-Pytorch

This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE

Language: Python - Size: 35.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

rogertrullo/VQVAE_Pytorch

implementation of VQVAE in pytorch

Language: Jupyter Notebook - Size: 108 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

aillaud/VQVAE_Flax

Implementation of basic autoencodeur, VAE and VQVAE in Flax

Language: Jupyter Notebook - Size: 1.24 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hqyyqh888/RobustSemanComm

Demo of robust semantic communication against semantic noise

Language: Python - Size: 84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 8

aillaud/Diffusion-models

State of the art of generative models and in-depth study of diffusion models

Language: Jupyter Notebook - Size: 127 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

viviaxenov/text_to_image_with_transformer Fork of stankevich-mipt/text_to_image_with_transformer

An educational project dedicated to text-to-image generation with neural networks. VQVAE and BPE autoencoders are used to learn the embedding of text and image respectively. A transformer-based model then is trained to predict the next token in the concatenated sequence of image and text tokens and used for generation.

Size: 4.88 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

fostiropoulos/dvq

Applying multiple VQ along the feature axis

Language: Jupyter Notebook - Size: 45.5 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 2

DongyaoZhu/Real-Time-Accent-Conversion

Real Time Foreign Accent Conversion

Language: Python - Size: 1.03 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 46 - Forks: 8

maj34/facegram Fork of face-gram/facegram

[GDSC Solution Challenge] Facegram All-Part Merged Repository

Language: Jupyter Notebook - Size: 5.22 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

vqvae 35 pytorch 13 deep-learning 9 vae 8 transformer 5 vector-quantization 4 vqgan 4 vq-vae 4 autoencoder 3 image-generation 3 pytorch-lightning 3 autoencoders 2 computer-vision 2 text-to-image 2 multimodal 2 neural-networks 2 stable-diffusion 2 gan 2 dalle 2 vocoder 2 voice-conversion 2 melgan 2 ai 2 machine-learning 2 variational-autoencoder 2 image-captioning 2 loss 1 perceptual 1 torch 1 ldm 1 latent-diffusion-models 1 wmt 1 tensorflow 1 transformers 1 utils 1 language-model 1 vae-pytorch 1 optical-character-recognition 1 optimal-transport 1 ocr 1 nlp 1 image-to-text 1 huggingface 1 gpt2 1 autoencoder-mnist 1 vector-quantized 1 bitters 1 ge2e 1 generalised-end-to-end 1 multiband-melgan 1 speaker-embedding 1 speech-recognition 1 voice-cloning 1 backend 1 face-generation 1 frontend 1 koclip 1 arbitrary-conditioning 1 density-estimation 1 imputation 1 inpainting 1 posterior-matching 1 vade 1 vdvae 1 pytorch-vqvae 1 taming-transformers 1 bigearthnet 1 compression 1 earth-observation 1 learned-compression 1 convolutional-neural-networks 1 pixelcnn 1 flax 1 mask 1 semantic-communication 1 semantic-noise 1 diffusion-models 1 gans 1 bpe 1 accent 1 acoustic-model 1 domain-transfer 1 foreign-accent-conversion 1 llama 1 vae-implementation 1 variational-autoencoders 1 wae 1 bioimage-analysis 1 bioimage-informatics 1 fish 1 histology 1 histopathology 1 image-retrieval 1 image-search-engine 1 mahmoodlab 1 pathology 1 wsi-images 1 beamng 1 dave2-architecture 1 hardware-migration 1