BiOcularGAN: Bimodal Synthesis and Annotation of Ocular Images

BiOcularGAN: Bimodal Synthesis and Annotation of Ocular Images
Darian Tomašević, Peter Peer, Vitomir Štruc
https://ieeexplore.ieee.org/document/10007982

Abstract : Current state-of-the-art segmentation techniques for ocular images are critically dependent on large-scale annotated datasets, which are labor-intensive to gather and often raise privacy concerns. In this paper, we present a novel framework, called BiOcularGAN, capable of generating synthetic large-scale datasets of photorealistic (visible light and near infrared) ocular images, together with corresponding segmentation labels to address these issues. At its core, the framework relies on a novel Dual-Branch StyleGAN2 (DB-StyleGAN2) model that facilitates bimodal image generation, and a Semantic Mask Generator (SMG) that produces semantic annotations by exploiting DB-StyleGAN2's feature space. We evaluate BiOcularGAN through extensive experiments across five diverse ocular datasets and analyze the effects of bimodal data generation on image quality and the produced annotations. Our experimental results show that BiOcularGAN is able to produce high-quality matching bimodal images and annotations (with minimal manual intervention) that can be used to train highly competitive (deep) segmentation models that perform well across multiple real-world datasets.

Release Notes:

The BiOcularGAN PyTorch framework allows for generation of matching bimodal data along with corresponding annotations. The framework is made up of a Dual-Branch StyleGAN2, based on the StyleGAN2-ADA implementation, and a Style Interpreter, based on the DatasetGAN implementation updated for use with StyleGAN2.

This repository follows the Nvidia Source Code License.

Requirements and Setup:

Linux and Windows are supported, but we recommend Linux for performance and compatibility reasons.
1–8 high-end NVIDIA GPUs with at least 12 GB of memory. We have tested our implementation on a NVIDIA RTX 3060 GPU and a NVIDIA RTX 3090 GPU. Parallelization across multiple GPUs are also supported for training the DB-StyleGAN2 network.
We highly recommend using Docker to setup the environment. Please use the provided Dockerfile to build an image with the required library dependencies. (The Docker image requires NVIDIA driver release r455.23 or later.)
Otherwise the requirements remain the same as in StyleGAN2-ADA. These being 64-bit Python 3.7, PyTorch 1.7.1, and CUDA toolkit 11.0 or later. Use at least version 11.1 if running on RTX 3090. Check the linked repository if you are having any problems.

How to build Docker environment:

docker build --tag sg2ada:latest .

How to Run (using Docker):

To run the BiOcularGAN framework use the main_BiOcularGAN.ipynb Jupyter Notebook, or follow the below steps:

Step 1. Train the Dual-Branch StyleGAN2 network:

./docker_run.sh python train_DB_StyleGAN2.py --cfg="auto" --snap=20  --data="DATASETS/CrossEyed_256/train/images" --resume="ffhq256" --gpus=1 --mirror=1 --outdir="IJCB_EXPERIMENTS/DB_SG2/experiments_CrossEyed_NIR_RGB_256"

Here the --data argument should point to a directory of RGB images, structured similarly to the example in the IJCB_EXPERIMENTS directory. It should have a neighbour directory filled with corresponding NIR (grayscale) images.

For details on other arguments and the configurations check the StyleGAN2-ADA documentation.

Step 1.5 Prepare for Style Interpreter steps:

First save the final trained StyleGAN2 model under IJCB_EXPERIMENTS/checkpoints/. Then make the required directory in IJCB_EXPERIMENTS/interpreter/ with the two .json configuration files (generate.json, train_datagan.json). To construct these use the available template in IJCB_EXPERIMENTS/interpreter/CrossEyed_NIR_RGB/. For more details check the DatasetGAN documentation.

Step 2. Generate examples of training image pairs:

docker_run.sh python make_training_data_DB_SG2.py --exp="IJCB_EXPERIMENTS/interpreter/CrossEyed_NIR_RGB/generate.json" --sv_path="IJCB_EXPERIMENTS/interpreter/CrossEyed_NIR_RGB

This generates image pairs in the images_to_annotate directory.

Step 2.1. Annotate:

Annotate the desired number of images (8 in our experiments) with the desired number of regions (4 and 10 in our experiments). For this you can use GIMP or any other software. Save the annotations and original images to the eyes_GIMP directory as seen in the example.

Step 2.2. Preprocess annotations:

Preprocess the annotations so that the classes are in the range {0, 1, 2, ... num_classes} and that the annotations are saved as .npy files. For this, you can also use the preprocess_annotated_images_and_latents.ipynb Jupyter Notebook, however, make sure that the values correspond to your annotations.

Step 3. Train the Style Interpreter:

docker_run.sh python train_interpreter_DB_SG2.py --exp "IJCB_EXPERIMENTS/interpreter/CrossEyed_NIR_RGB/train_datagan.json"

Step 4. Generate dataset of matching RGB and NIR images with corresponding annotations:

docker_run.sh python train_interpreter_DB_SG2.py --generate_data True --num_sample=500 --exp "IJCB_EXPERIMENTS/interpreter/CrossEyed_NIR_RGB/train_datagan.json"  --resume "IJCB_EXPERIMENTS/interpreter/CrossEyed_NIR_RGB"

Notes:

Pre-trained networks are stored as *.pkl files. These contain the Generator 'G' and Discriminator 'D', as well as 'G_ema', represents a moving average of the generator weights over several training steps. The generator consists of two submodules, G.mapping and G.synthesis, that can be executed separately. They also support various additional options, such as truncation. For further examples, check StyleGAN2-ADA.
To use BiOcularGAN without Docker, simply remove docker_run.sh from the above commands.
An example experiment is available under IJCB_EXPERIMENTS/interpreter/CrossEyed_NIR_RGB/

License

This work is made available under the Nvidia Source Code License.

Citation

If you use code or results from this repository, please cite the following publication:

@inproceedings{tomasevic2022bioculargan,
  title={BiOcularGAN: Bimodal Synthesis and Annotation of Ocular Images},
  author={Toma{\v{s}}evi{\'c}, Darian and Peer, Peter and {\v{S}}truc, Vitomir},
  booktitle={IEEE International Joint Conference on Biometrics (IJCB)},
  pages={1--10},
  year={2022},
}

Acknowledgements

Supported in parts by the Slovenian Research Agency ARRS through the Research Programmes P2-0250(B) "Metrology and Biometric System" and P2--0214 (A) “Computer Vision”, the ARRS Project J2-2501(A) "DeepBeauty" and the ARRS junior researcher program.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
IJCB_EXPERIMENTS		IJCB_EXPERIMENTS
bonus_scripts		bonus_scripts
dnnlib		dnnlib
docs		docs
interpreter_utils		interpreter_utils
metrics		metrics
models		models
torch_utils		torch_utils
training_scripts_DB_SG2		training_scripts_DB_SG2
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
README_SG2.md		README_SG2.md
docker_run.sh		docker_run.sh
main_BiOcularGAN.ipynb		main_BiOcularGAN.ipynb
make_training_data_DB_SG2.py		make_training_data_DB_SG2.py
preprocess_annotated_images_and_latents.ipynb		preprocess_annotated_images_and_latents.ipynb
train_DB_StyleGAN2.py		train_DB_StyleGAN2.py
train_interpreter_DB_SG2.py		train_interpreter_DB_SG2.py

License

dariant/BiOcularGAN

Folders and files

Latest commit

History

Repository files navigation

BiOcularGAN: Bimodal Synthesis and Annotation of Ocular Images

Release Notes:

Requirements and Setup:

How to Run (using Docker):

Step 1. Train the Dual-Branch StyleGAN2 network:

Step 1.5 Prepare for Style Interpreter steps:

Step 2. Generate examples of training image pairs:

Step 2.1. Annotate:

Step 2.2. Preprocess annotations:

Step 3. Train the Style Interpreter:

Step 4. Generate dataset of matching RGB and NIR images with corresponding annotations:

Notes:

License

Citation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Languages