GAN-Diffusion-Model-for-Colorization

This repository contains the code for Image Colorization using Generative Adversarial Network and Guided-Diffusion Model.

Generative Adversarial Network Model:

The colorization GAN code is available as Jupyter Notebook and is quite straight forward. This GAN model is trained over Stanford Cars, COCO dataset and Landscapes dataset for a fair number of epochs.

Link to GAN checkpoint: 256x256_GAN_checkpoint.pt

Guided-Diffusion Model:

This code is a replica of the OpenAI's Guided-Diffusion code Original Repo, where we have added changes necessary for Image Colorization using Class Conditioned Diffusion.

This model is trained only for a few iterations over just Stanford Cars Dataset.

Link to trained model: 64x64_cond_diffusion.pt

You might want to run this command before getting started.

pip install -e .

Sampling

The below command can be used to get the images colorized. Here in our code we have trained the diffusion only to enhance the images from GAN model. So in our case the conditioning is over GAN output images and not gray scale images directly.

mpiexec -n 1 python scripts/colorize_sample.py --attention_resolutions 32,16,8 --class_cond True --diffusion_steps 1000 --dropout 0.1 --image_size 64 --noise_schedule linear --num_channels 128 --num_head_channels -1 --num_res_blocks 2 --resblock_updown True --use_fp16 False --use_scale_shift_norm True --model_path models/64x64_cond_diffusion.pt --base_samples test_image.npz --batch_size 1 --num_samples 1 --timestep_respacing 250 --learn_sigma True

For sampling the images need to be stored as a .npz file. Utility code to convert the images of .jpg or .png or any other image formats to .npz format is included in the utility directory.

Training

The below command can be used to train the colorization model on a machine with a single GPU of atleast 6GB VRAM.

mpiexec -n 1 python scripts/colorize_train.py --data_dir "path/to/orig_dataset" --batch_size 1 --lr 3e-4 --save_interval 100 --log_interval 100 --weight_decay 0.05 --image_size 64 --attention_resolutions 32,16,8 --resblock_updown True --use_scale_shift_norm True --learn_sigma True --num_channels 128 --noise_schedule linear --class_cond True

As this is a class conditioned model we will be concatenating the gray scale image or an equivalent to the noise during the sampling phase. So we would need the gray scale or equivalent images in another directory where the original data is present. eg. "path/to/dataset"

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Images		Images
datasets		datasets
evaluations		evaluations
guided_diffusion		guided_diffusion
scripts		scripts
utility		utility
GAN.ipynb		GAN.ipynb
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GAN-Diffusion-Model-for-Colorization

Generative Adversarial Network Model:

Guided-Diffusion Model:

Sampling

Training

Sample Outputs

Sample Outputs from GAN

Sample Outputs from Diffusion

About

Uh oh!

Releases

Packages

Languages

manojkumar202/Diffusion-Model-for-Colorization

Folders and files

Latest commit

History

Repository files navigation

GAN-Diffusion-Model-for-Colorization

Generative Adversarial Network Model:

Guided-Diffusion Model:

Sampling

Training

Sample Outputs

Sample Outputs from GAN

Sample Outputs from Diffusion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages