Topic: "vq-vae"
openai/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
Language: Python - Size: 2.68 MB - Last synced at: 17 days ago - Pushed at: 10 months ago - Stars: 7,965 - Forks: 1,444

clementchadebec/benchmark_VAE
Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)
Language: Python - Size: 42.5 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 1,899 - Forks: 174

rosinality/vq-vae-2-pytorch
Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch
Language: Python - Size: 6.72 MB - Last synced at: 18 days ago - Pushed at: about 2 years ago - Stars: 1,713 - Forks: 278

ritheshkumar95/pytorch-vqvae
Vector Quantized VAEs - PyTorch Implementation
Language: Python - Size: 5.13 MB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 791 - Forks: 132

Mael-zys/T2M-GPT
(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”
Language: Python - Size: 19.1 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 663 - Forks: 54

zalandoresearch/pytorch-vq-vae
PyTorch implementation of VQ-VAE by Aäron van den Oord et al.
Language: Jupyter Notebook - Size: 977 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 559 - Forks: 103

nadavbh12/VQ-VAE
Minimalist implementation of VQ-VAE in Pytorch
Language: Python - Size: 514 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 429 - Forks: 82

EugenHotaj/pytorch-generative
Easy generative modeling in PyTorch.
Language: Python - Size: 53.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 366 - Forks: 59

bshall/ZeroSpeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
Language: Python - Size: 17.3 MB - Last synced at: 20 days ago - Pushed at: almost 2 years ago - Stars: 334 - Forks: 45

rese1f/Awesome-VQVAE
A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application
Size: 134 KB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 268 - Forks: 10

swasun/VQ-VAE-Speech 📦
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Language: Python - Size: 82.4 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 247 - Forks: 52

sony/sqvae
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
Language: Python - Size: 158 KB - Last synced at: 18 days ago - Pushed at: almost 3 years ago - Stars: 187 - Forks: 23

USTC-JialunPeng/Diverse-Structure-Inpainting
CVPR 2021: "Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE"
Language: Python - Size: 9.58 MB - Last synced at: 10 months ago - Pushed at: almost 4 years ago - Stars: 175 - Forks: 19

hhguo/MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
Language: Python - Size: 1.15 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 14

vvvm23/vqvae-2
PyTorch implementation of VQ-VAE-2 from "Generating Diverse High-Fidelity Images with VQ-VAE-2"
Language: Python - Size: 79.6 MB - Last synced at: 11 days ago - Pushed at: about 2 years ago - Stars: 152 - Forks: 20

bshall/VectorQuantizedCPC
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
Language: Python - Size: 10.4 MB - Last synced at: 18 days ago - Pushed at: over 4 years ago - Stars: 141 - Forks: 23

awei669/VQ-Font
[ICCV 2023] Few shot font generation via transferring similarity guided global and quantization local styles
Language: Jupyter Notebook - Size: 6.73 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 112 - Forks: 2

MIMICLab/L-Verse
L-Verse: Bidirectional Generation Between Image and Text
Language: Python - Size: 1.83 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 108 - Forks: 6

dhgrs/chainer-VQ-VAE
A Chainer implementation of VQ-VAE.
Language: Python - Size: 457 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 82 - Forks: 19

bshall/VectorQuantizedVAE
A PyTorch implementation of "Continuous Relaxation Training of Discrete Latent Variable Image Models"
Language: Jupyter Notebook - Size: 975 KB - Last synced at: 18 days ago - Pushed at: about 5 years ago - Stars: 73 - Forks: 16

Kyushik/Generative-Model
Repository for implementation of generative models with Tensorflow 1.x
Language: Jupyter Notebook - Size: 209 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 69 - Forks: 8

hyperion-ml/hyperion
Python toolkit for speech processing
Language: Python - Size: 150 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 61 - Forks: 18

samxuxiang/hnc-cad
[ICML 2023] Official PyTorch Implementation of "Hierarchical Neural Coding for Controllable CAD Model Generation".
Language: Python - Size: 3.59 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 59 - Forks: 9

1Konny/VQ-VAE
Pytorch Implementation of "Neural Discrete Representation Learning"
Language: Jupyter Notebook - Size: 4.51 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 57 - Forks: 10

DongyaoZhu/Real-Time-Accent-Conversion
Real Time Foreign Accent Conversion
Language: Python - Size: 1.03 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 46 - Forks: 8

zbr17/OptVQ
Towards training VQ-VAE models robustly!
Language: Python - Size: 28.1 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 42 - Forks: 0

SonyCSLParis/interactive-spectrogram-inpainting
Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published at the 2020 Joint Conference on AI Music Creativity.
Language: Python - Size: 113 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 33 - Forks: 4

HenningBuhl/VQ-VAE_Keras_Implementation
Keras Implementation of Vector Quantizer Variational AutoEncoder (VQ-VAE)
Language: Jupyter Notebook - Size: 330 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 29 - Forks: 8

shaojinding/GroupLatentEmbedding Fork of mkotha/WaveRNN
Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Interspeech 2019]
Language: Python - Size: 38.8 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 29 - Forks: 11

jiazhao97/VQ-VAE_withPixelCNNprior
Implementation of Vector Quantised VAE (VQ-VAE) with PixelCNN prior in Tensorflow.
Language: Python - Size: 8.61 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 29 - Forks: 9

tomhosking/hercules
Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)
Language: Python - Size: 81.4 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 4

Tayjsl97/MusER
This is the official implementation of MusER (AAAI'24).
Language: Python - Size: 1.52 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 20 - Forks: 0

pier-maker92/bachsformer
A Bach music generator with Artificial Intelligence. This model is made by a VQ-VAE + Transformer (decoder-only). Sequences of midi 1 quarter length are compressed into 16 codebooks via VQ-VAE and a transformer learns how to generate the codebooks sequence to obtain a midi score.
Language: Python - Size: 21.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 1

swasun/VQ-VAE-Images 📦
PyTorch implementation of VQ-VAE applied on CIFAR10 dataset
Language: Python - Size: 23 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 17 - Forks: 6

danieltudosiu/nmpevqvae
Neuromorphologicaly Preserving Volumetric Data Encoding Using VQ-VAE
Language: Python - Size: 89.8 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 14 - Forks: 6

mehta-lab/dynamorph 📦
Learn morphological states of dynamic cells
Language: Jupyter Notebook - Size: 1.58 GB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 13 - Forks: 1

AndrewBoessen/VQ-VAE
A PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE) with EMA updates, pretrained encoder, and K-means initialization. Efficient discrete representation learning for various data types.
Language: Python - Size: 136 MB - Last synced at: 27 days ago - Pushed at: 4 months ago - Stars: 9 - Forks: 0

IDSIA/kohonen-vae
Official repository for the paper "Topological Neural Discrete Representation Learning à la Kohonen" (ICML 2023 Workshop on Sampling and Optimization in Discrete Space)
Language: Python - Size: 77.1 KB - Last synced at: 21 days ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 1

JiwanSeo/RAQ-VAE
Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models
Language: Python - Size: 199 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 8 - Forks: 1

fostiropoulos/Depthwise-Quantization
code for paper: https://arxiv.org/pdf/2203.08080.pdf
Language: Python - Size: 482 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 0

explainingai-code/VQVAE-Pytorch
This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE
Language: Python - Size: 35.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 1

andylolu2/jax-vqvae-gpt
Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.
Language: Python - Size: 315 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 2

marksgraham/transformer-ood
Official PyTorch code for "Transformer-based out-of-distribution detection for clinically safe segmentation"
Language: Python - Size: 53.7 KB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

tomhosking/hiro
Language: HTML - Size: 160 MB - Last synced at: 22 days ago - Pushed at: 9 months ago - Stars: 5 - Forks: 3

samedii/perceptor
Modular image generation library
Language: Python - Size: 4.02 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

mehdihosseinimoghadam/Signal-Processing
Signal Processing with Python and Librosa
Language: Jupyter Notebook - Size: 46.6 MB - Last synced at: 22 days ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

TheEngineerProgrammer/my_masters_thesis
cVAE, VQ-VAE, VQ-VAE2, cVAE-cGAN, PixelCNN and Gated PixelCNN in tensorflow 2.x and keras
Language: Jupyter Notebook - Size: 8.8 MB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

AmigoLab/nmpevqvae
Neuromorphologicaly Preserving Volumetric Data Encoding Using VQ-VAE
Language: Python - Size: 89.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

CompletedProjects/Multimodal_VQ-VAE
Implementation of a multi-modal VQ-VAE
Language: Python - Size: 181 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 1

AndrewBoessen/neural-game-engine
Neural network approach for modeling interactive game environments using Vector Quantized Variational Auto-encoder (VQ-VAE) and Spatio-Temporal Transformers. Trained on Atari Skiing gameplay data.
Language: Python - Size: 6.21 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

SekiroRong/KAN-AutoEncoder
KAE : KAN-based AutoEncoder (AE, VAE, VQ-VAE, RVQ, etc.)
Language: Jupyter Notebook - Size: 2.02 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

sayedmohamedscu/VQGAN
Vector-Quantized Generative Adversarial Networks
Language: Python - Size: 6.16 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

tomouellette/autoencodersplz
Generative modeling and representation learning through reconstruction
Language: Python - Size: 27.8 MB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

carlosholivan/audiolm-google-torch
Implementation of the AudioLM model by Google in Pytorch
Size: 420 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

Ighina/VQ-VAE_Topic
An implementation of the paper [Vector-Quantization-Based Topic Modeling](https://dl.acm.org/doi/10.1145/3450946), providing a series of VQ-VAE models for topic modelling. The model reaches state-of-the-art performance on Ng20 and enables the extraction of dense topic vectors for downstream tasks.
Language: Jupyter Notebook - Size: 2.96 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 2

imironhead/ml_representation
replicate representation learning papers
Language: Python - Size: 3.78 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

AdityaNG/nD_VQ_VAE
PyTorch Implementation of an N-Dimensional VQ-VAE
Language: Python - Size: 7.41 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Rishikesh-Jadhav/Video-Compression-and-Future-Prediction-Using-GPT
This repository presents a project focused on advanced video compression and future prediction using Generative Pre-trained Transformer (GPT) and other state-of-the-art techniques.
Language: Jupyter Notebook - Size: 70.2 MB - Last synced at: 14 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 1

Rivera-ai/VQGAN-pytorch-Inference Fork of dome272/VQGAN-pytorch
Branch of the original Project "dome272/VQGAN-pytorch" adding an inference file for the VQGAN (Not for the VQGAN Transformers)
Language: Python - Size: 17.6 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

pranavAL/DART
Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024
Size: 1.87 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

rmgogogo/nano-aigc
Generative models nano version for fun. No STOA here, nano first.
Language: Jupyter Notebook - Size: 2.1 MB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

reshalfahsi/medical-image-latent-space-visualization
Medical Image Latent Space Visualization Using VQ-VAE
Language: Jupyter Notebook - Size: 257 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

aju22/VQ-GANs
This is a simplified implementation of VQ-GANs written in PyTorch. The architecture is borrowed from the paper "Taming Transformers for High-Resolution Image Synthesis".
Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

proger/world-model
Discrete world modeling by recording Coppelia simulations with ROS
Language: Python - Size: 6.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

SiavashShams/Generating-Stuff-with-GANs-and-VAEs
Generating images in different contexts using GANs and Variational Autoencoders
Language: Jupyter Notebook - Size: 14.2 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ondrejbiza/vae
Variational autoencoders implemented in Tensorflow.
Language: Python - Size: 884 KB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

unshun0120/Apply-FederatedLearning-into-Autoencoder
Using Federated Learning to train Autoencoder and its variants' models in pytorch
Language: Python - Size: 900 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

vidursatija/vq-vae
PyTorch implementation for Neural Discrete Representation Learning https://arxiv.org/abs/1711.00937
Language: Jupyter Notebook - Size: 20.6 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

SajjadHm/VQ-VAE
Vector Quantized Variational Auto-Encoder (VQ-VAE): Neural Discrete Representation Learning
Language: Jupyter Notebook - Size: 18 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

zimeng44/Foley-Gen
A generative machine learning model that generates noval foley sounds
Language: Python - Size: 653 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

zoppellarielena/Paper-Presentation-for-Natural-Language-Processing
This presentation, conducted for the "Natural Language Processing" course, delves into the paper's content, which addresses the challenge of generating images for a story using a text-to-image framework. The paper can be accessed at https://arxiv.org/abs/2210.08465.
Size: 0 Bytes - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

howsmyanimeprofilepicture/image-tokenizer-for-mbrl
VQ-VAE-based image tokenizer for model-based RL
Language: Jupyter Notebook - Size: 8.25 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

zzmtsvv/ml_sandbox
Language: Jupyter Notebook - Size: 422 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

wx-b/VideoGPT Fork of wilson1yan/VideoGPT
Size: 3.88 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

mmcenta/discrete-embeddings
Diverse VAE experiments and implementations in TensorFlow.
Language: Python - Size: 39.2 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
