Implementing MaskGIT for image inpainting with PyTorch
-
Updated
May 20, 2024 - Python
Implementing MaskGIT for image inpainting with PyTorch
yet another VQGAN-CLIP variation
Multi-Modal Image Generation for News Stories
Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer Vision)
Vector-Quantized Generative Adversarial Networks
VQGAN from LDM without hell of dependencies
Art generation using VQGAN + CLIP using docker containers. A simplified, updated, and expanded upon version of Kevin Costa's work. This project tries to make generating art as easy as possible for anyone with a GPU by providing a simple web UI.
Traditional deepdream with VQGAN+CLIP and optical flow. Ready to use in Google Colab.
[ICLR 2024] DAEFR: Dual Associated Encoder for Face Restoration
VQ-VAE/GAN implementation in pytorch-lightning
Fast and controllable text-to-image model.
Streamlit Tutorial (ex: stock price dashboard, cartoon-stylegan, vqgan-clip, stylemixing, styleclip, sefa)
Implementation of Binary Latent Diffusion
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers
Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized
Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
Add a description, image, and links to the vqgan topic page so that developers can more easily learn about it.
To associate your repository with the vqgan topic, visit your repo's landing page and select "manage topics."