YAI-Alchera Project : Wildfire Image Generator

Repository : https://github.com/YAI-alcheragan/YAI-alcheraGAN

Credits

YAI 10th 2022 Summer Project - AlcehraGAN Team

Team Leader : Gihyun Kim - Yonsei Univ. Dept of Artificial Intelligence.
Team Member : Donggeon Bae - Yonsei. Univ. Dept. of Electrical and Electronic Engineering.
Team Member : Jungbin Cho - Yonsei. Univ. Dept. of Electrical and Electronic Engineering.
Team Member : Minjae Lee - Yonsei Univ. Dept. of Creative Technology Management.
Team Member : Sangheon Lee - Yonsei. Univ. Dept. of Electrical and Electronic Engineering.
Team Member : Subin Kang - Yonsei Univ. Dept. Computer Science.

Installation

Create an anaconda environment.

conda create -n <env_name> 
conda activate <env_name>

Install PyTorch

conda install pytorch torchvision cudatoolkit=11.3 -c pytorch

git clone

git clone https://github.com/YAI-alcheragan/YAI-alcheraGAN.git

Install pip packages.

pip install -r requirements.txt

Dataset

Wildfire Dataset Taken from Alchera Wildfire Detection : https://alchera.ai/

Data Description

This dataset contains wildfire images that are given as detected sets from Alchera, AI technology development company based on face & anomalous situation detection (e.g. wildfire).

In total the dataset contains 20k wild forest images of size 1920x1080: 10k in confirmed(detected as wildfire situation) and the other 10k in skipped(detected as nothing-happened) as jpg images, each with a bbox position for detective area that is classified as a situation as JSON file

We generated segmentation mask dataset folder for the confirmed dataset mentioned above, each has same label(image name) for a target of segmentation. This particular directory includes segmented binary masks of size 1920x1080, following original target image's. A actual area of smoke has size of 128x128 which it's center position is derived from a given image's bbox

Additional dataset called Smoke5K, which used for segmentation model train(UNet++: A Nested U-Net Architecture for Medical Image Segmentation), has 5,400 images that are mixture of real(1,400) and synthetic(4,000) smoke images. For preventing overfitting of the segmentation and maintaining naturality of the segmented smoke from general images, we tried to train the model with an unaffective smoke set so we chose this dataset.

The files within each fold directories except the additional dataset are:

confirmed/#/images/ - directory contains 10k of 1920x1080 jpg images
confirmed/#/labels/ - directory contains 10k of json labels for the directory mentioned above ([left, up, right, bottom]
confirmed/#/masks/ - directory contains 10k of segmented masks for the directory mentioned above
skipped/ - directory contains 10k of 1920x1080 normal wild forest images

Data Preview

To inference, make datasets folder consisting of images, labels, masks per each file # Make sure to put the files as the following structure

└── YAI-alcheraGAN/
    ├── datasets/
    │   ├── confirmed/
    │   │   ├── 1/
    │   │   │   ├── images/
    │   │   │   │   └── (labelfilenames)
    │   │   │   ├── labels/
    │   │   │   │   └── (labelfilenames)
    │   │   │   └── masks/
    │   │   │       └── (labelfilenames)
    │   │   └── ...
    │   └── skipped/
    │       ├── 1/
    │       │   └── images/(labelfilenames)
    │       └── ...
    ├── train_segmentation
    ├── gp_gan.py
    └── ...

masks folder made by train_segementation

Origin

Model Architecture

GP-GAN: Backbone

Paper: ArXiv 1703.07195
Implementation: Pytorch Vision

Wildfire Segmentation: Generate Mask

Paper: ArXiv 1807.10165
Implementation: Pytorch Vision

Color Match: Determine Mask Postion Depending on Color Distribution

Implementation: OpenCV

Metrics

FID

$$ \text{FID} = ||\mu_x-\mu_y ||^2 + \text{Tr}(\Sigma_\text{x} + \Sigma_\text{y} - 2\sqrt{\Sigma_\text{x}\Sigma_\text{y}}) $$

KID

$$ \text{KID} = E_{x, x^{\prime}p}[K(x,x^{\prime})]+E_{x,x^{\prime}q}[K(x,x^{\prime})]-2E_{xp,x^{\prime}p}[K(x,x^{\prime})] $$

Density and Convergence:

Density

$$ \text{Density} := \frac{1}{kM}\sum_{j=1}^{M}\sum_{i=1}^{N}1_{Y_j\in B(X_i,\text{NND}_k(X_i))} $$

Converage

$$ \text{Coverage} := \frac{1}{N}\sum_{i=1}^{N}1_{\exists\text{ }j\text{ s.t. } Y_j\in B(X_i,\text{NND}_k(X_i))} $$

Results

Source Example

Segmentation Example

Target Example

Final Output

Experiment

Experiment description

Experiment examples

Pretrained models

GP-GAN: Blending GAN G(x)

blending_gan.npz: download
finetuned.pt: download

Wildfire Segmentation: Unet++

pretrained.pth: download

Full Source Code

Github: [https://github.com/YAI-alcheragan/YAI-alcheraGAN]

All non-necessary codes are modularized as package. Watch all codes in github repository.

To train,

Wildfire Segmentation: train_segmentation

GP-GAN: train_blending_gan

To inference,

Create Segmentation Mask Images

Wildfire Segmentation: inference_segmentation

Run Model GP-GAN:

python inference_gpgan.py --root ./datasets --result_folder ./results --experiment_folder ./experiments --g_path ./blending_gan.npz

To Compute Metrics,

Set True images and False(Generated) images Folder

└── true_data_root/
    ├── true_image1.jpg
    ├── true_image2.jpg
    └── ...
└── false_data_root/
    ├── true_image1.jpg
    ├── true_image2.jpg
    └── ...

Compute Metrics

!python metric.py --true_data_root "real image root" --false_data_root "generated image root" --calculate_num "number of images to use to cal metrics"

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
img		img
train_blending_gan		train_blending_gan
train_segmentation		train_segmentation
.gitignore		.gitignore
README.md		README.md
colormatch_meth1_main.py		colormatch_meth1_main.py
gp_gan.py		gp_gan.py
inference_blend_dataset.py		inference_blend_dataset.py
inference_gpgan.py		inference_gpgan.py
metric.py		metric.py
metric_dataset.py		metric_dataset.py
model.py		model.py
requirements.txt		requirements.txt
utils.py		utils.py

YAI-alcheragan/YAI-alcheraGAN

Folders and files

Latest commit

History

Repository files navigation