# mapping-challenge-mask_rcnn-training
![CrowdAI-Logo](https://github.com/crowdAI/crowdai/raw/master/app/assets/images/misc/crowdai-logo-smile.svg?sanitize=true)

This notebook contains the baseline code for the training a vanilla [Mask RCNN](https://arxiv.org/abs/1703.06870) model for the [crowdAI Mapping Challenge](https://www.crowdai.org/challenges/mapping-challenge).

This code is adapted from the [Mask RCNN]() tensorflow implementation available here : [https://github.com/matterport/Mask_RCNN](https://github.com/matterport/Mask_RCNN).

First we begin by importing all the necessary dependencies : 

In [1]:
import os
import sys
import time
import numpy as np

# Download and install the Python COCO tools from https://github.com/waleedka/coco
# That's a fork from the original https://github.com/pdollar/coco with a bug
# fix for Python 3.
# I submitted a pull request https://github.com/cocodataset/cocoapi/pull/50
# If the PR is merged then use the original repo.
# Note: Edit PythonAPI/Makefile and replace "python" with "python3".
#  
# A quick one liner to install the library 
# !pip install git+https://github.com/waleedka/coco.git#subdirectory=PythonAPI

from pycocotools.coco import COCO
from pycocotools.cocoeval import COCOeval
from pycocotools import mask as maskUtils

from mrcnn.evaluate import build_coco_results, evaluate_coco
from mrcnn.dataset import MappingChallengeDataset

import zipfile
import urllib.request
import shutil


## Dataset location 
Now we have to download all the files in the datasets section and untar them to have the following structure :
```
├── data
|   ├── pretrained_weights.h5 (already included in this repository)
│   ├── test
│   │   └── images/
│   │   └── annotation.json
│   ├── train
│   │   └── images/
│   │   └── annotation.json
│   └── val
│       └── images/
│       └── annotation.json
```
Note that the `pretrained_weights.h5` (available at [https://www.crowdai.org/challenges/mapping-challenge/dataset_files](https://www.crowdai.org/challenges/mapping-challenge/dataset_files)) are the weights used for the baseline submission, and are obtained by running the learning schedule mentioned later in the experiment. In the said experiment, the initial weights used can be found [here](https://github.com/matterport/Mask_RCNN/releases/download/v2.1/mask_rcnn_balloon.h5). 

In [2]:
ROOT_DIR = os.getcwd()

# Import Mask RCNN
sys.path.append(ROOT_DIR)  # To find local version of the library
from mrcnn.config import Config
from mrcnn import model as modellib, utils


PRETRAINED_MODEL_PATH = os.path.join(ROOT_DIR,"data/" "pretrained_weights.h5")
LOGS_DIRECTORY = os.path.join(ROOT_DIR, "logs")

Using TensorFlow backend.


## Experiment Configuration

In [3]:
class MappingChallengeConfig(Config):
    """Configuration for training on data in MS COCO format.
    Derives from the base Config class and overrides values specific
    to the COCO dataset.
    """
    # Give the configuration a recognizable name
    NAME = "crowdai-mapping-challenge"

    # We use a GPU with 12GB memory, which can fit two images.
    # Adjust down if you use a smaller GPU.
    IMAGES_PER_GPU = 5

    # Uncomment to train on 8 GPUs (default is 1)
    GPU_COUNT = 8

    # Number of classes (including background)
    NUM_CLASSES = 1 + 1  # 1 Backgroun + 1 Building

    STEPS_PER_EPOCH=90
    VALIDATION_STEPS=20

    IMAGE_MAX_DIM=320
    IMAGE_MIN_DIM=320
    
    MEAN_PIXEL = [81.16231469, 86.53528546, 64.72005973]

config = MappingChallengeConfig()
config.display()


Configurations:
BACKBONE                       resnet101
BACKBONE_STRIDES               [4, 8, 16, 32, 64]
BATCH_SIZE                     40
BBOX_STD_DEV                   [0.1 0.1 0.2 0.2]
COMPUTE_BACKBONE_SHAPE         None
DETECTION_MAX_INSTANCES        100
DETECTION_MIN_CONFIDENCE       0.7
DETECTION_NMS_THRESHOLD        0.3
FPN_CLASSIF_FC_LAYERS_SIZE     1024
GPU_COUNT                      8
GRADIENT_CLIP_NORM             5.0
IMAGES_PER_GPU                 5
IMAGE_CHANNEL_COUNT            3
IMAGE_MAX_DIM                  320
IMAGE_META_SIZE                14
IMAGE_MIN_DIM                  320
IMAGE_MIN_SCALE                0
IMAGE_RESIZE_MODE              square
IMAGE_SHAPE                    [320 320   3]
LEARNING_MOMENTUM              0.9
LEARNING_RATE                  0.001
LOSS_WEIGHTS                   {'mrcnn_mask_loss': 1.0, 'rpn_bbox_loss': 1.0, 'mrcnn_class_loss': 1.0, 'mrcnn_bbox_loss': 1.0, 'rpn_class_loss': 1.0}
MASK_POOL_SIZE                 14
MASK_SHAPE            

## Instantiate Model

In [4]:
model = modellib.MaskRCNN(mode="training", config=config, model_dir=LOGS_DIRECTORY)
# Load pretrained weights
model_path = PRETRAINED_MODEL_PATH
model.load_weights(model_path, by_name=True)

Instructions for updating:
Colocations handled automatically by placer.


## Load Training and Validation Dataset

In [5]:
# Load training dataset
dataset_train = MappingChallengeDataset()
dataset_train.load_dataset(dataset_dir=os.path.join("data", "train"), load_small=False)
dataset_train.prepare()

# Load validation dataset
dataset_val = MappingChallengeDataset()
val_coco = dataset_val.load_dataset(dataset_dir=os.path.join("data", "val"), load_small=False, return_coco=True)
dataset_val.prepare()

Annotation Path  data/train/annotation.json
Image Dir  data/train/images
loading annotations into memory...
Done (t=2.93s)
creating index...
index created!
Annotation Path  data/val/annotation.json
Image Dir  data/val/images
loading annotations into memory...
Done (t=0.47s)
creating index...
index created!


## Train

In [6]:
# *** This training schedule is an example. Update to your needs ***

# # Training - Stage 1
# print("Training network heads")
# model.train(dataset_train, dataset_val,
#             learning_rate=config.LEARNING_RATE,
#             epochs=10,
#             layers='heads')

# # Training - Stage 2
# # Finetune layers from ResNet stage 4 and up
# print("Fine tune Resnet stage 4 and up")
# model.train(dataset_train, dataset_val,
#             learning_rate=config.LEARNING_RATE,
#             epochs=10,
#             layers='4+')

# Training - Stage 3
# Fine tune all layers
print("Fine tune all layers")
model.train(dataset_train, dataset_val,
            learning_rate=config.LEARNING_RATE / 10,
            epochs=20,
            layers='all')

Training network heads

Starting at epoch 0. LR=0.001

Checkpoint Path: /home/yoninachmany/crowdai-mapping-challenge-mask-rcnn/logs/crowdai-mapping-challenge20190428T1455/mask_rcnn_crowdai-mapping-challenge_{epoch:04d}.h5
Selecting layers to train
fpn_c5p5               (Conv2D)
fpn_c4p4               (Conv2D)
fpn_c3p3               (Conv2D)
fpn_c2p2               (Conv2D)
fpn_p5                 (Conv2D)
fpn_p2                 (Conv2D)
fpn_p3                 (Conv2D)
fpn_p4                 (Conv2D)
In model:  rpn_model
    rpn_conv_shared        (Conv2D)
    rpn_class_raw          (Conv2D)
    rpn_bbox_pred          (Conv2D)
mrcnn_mask_conv1       (TimeDistributed)
mrcnn_mask_bn1         (TimeDistributed)
mrcnn_mask_conv2       (TimeDistributed)
mrcnn_mask_bn2         (TimeDistributed)
mrcnn_class_conv1      (TimeDistributed)
mrcnn_class_bn1        (TimeDistributed)
mrcnn_mask_conv3       (TimeDistributed)
mrcnn_mask_bn3         (TimeDistributed)
mrcnn_class_conv2      (TimeDistributed

  "Converting sparse IndexedSlices to a dense Tensor of unknown shape. "


Epoch 1/10
Epoch 2/10
Epoch 3/10
Epoch 4/10
Epoch 5/10
Epoch 6/10
Epoch 7/10
Epoch 8/10
Epoch 9/10



















Epoch 10/10
 4/90 [>.............................] - ETA: 1:28 - loss: 2.2443 - rpn_class_loss: 0.1604 - rpn_bbox_loss: 0.9683 - mrcnn_class_loss: 0.2980 - mrcnn_bbox_loss: 0.3818 - mrcnn_mask_loss: 0.4358



 6/90 [=>............................] - ETA: 1:24 - loss: 2.2338 - rpn_class_loss: 0.1596 - rpn_bbox_loss: 0.9648 - mrcnn_class_loss: 0.2962 - mrcnn_bbox_loss: 0.3786 - mrcnn_mask_loss: 0.4348



 8/90 [=>............................] - ETA: 1:22 - loss: 2.2251 - rpn_class_loss: 0.1587 - rpn_bbox_loss: 0.9613 - mrcnn_class_loss: 0.2946 - mrcnn_bbox_loss: 0.3764 - mrcnn_mask_loss: 0.4341



 9/90 [==>...........................] - ETA: 1:20 - loss: 2.2221 - rpn_class_loss: 0.1583 - rpn_bbox_loss: 0.9597 - mrcnn_class_loss: 0.2940 - mrcnn_bbox_loss: 0.3760 - mrcnn_mask_loss: 0.4339



10/90 [==>...........................] - ETA: 1:19 - loss: 2.2181 - rpn_class_loss: 0.1580 - rpn_bbox_loss: 0.9583 - mrcnn_class_loss: 0.2939 - mrcnn_bbox_loss: 0.3747 - mrcnn_mask_loss: 0.4332



11/90 [==>...........................] - ETA: 1:18 - loss: 2.2154 - rpn_class_loss: 0.1576 - rpn_bbox_loss: 0.9571 - mrcnn_class_loss: 0.2936 - mrcnn_bbox_loss: 0.3741 - mrcnn_mask_loss: 0.4331



12/90 [===>..........................] - ETA: 1:17 - loss: 2.2117 - rpn_class_loss: 0.1572 - rpn_bbox_loss: 0.9558 - mrcnn_class_loss: 0.2930 - mrcnn_bbox_loss: 0.3729 - mrcnn_mask_loss: 0.4328



13/90 [===>..........................] - ETA: 1:15 - loss: 2.2070 - rpn_class_loss: 0.1568 - rpn_bbox_loss: 0.9543 - mrcnn_class_loss: 0.2922 - mrcnn_bbox_loss: 0.3715 - mrcnn_mask_loss: 0.4323



14/90 [===>..........................] - ETA: 1:14 - loss: 2.2027 - rpn_class_loss: 0.1564 - rpn_bbox_loss: 0.9528 - mrcnn_class_loss: 0.2911 - mrcnn_bbox_loss: 0.3705 - mrcnn_mask_loss: 0.4320



15/90 [====>.........................] - ETA: 1:13 - loss: 2.1999 - rpn_class_loss: 0.1561 - rpn_bbox_loss: 0.9515 - mrcnn_class_loss: 0.2912 - mrcnn_bbox_loss: 0.3695 - mrcnn_mask_loss: 0.4316



16/90 [====>.........................] - ETA: 1:11 - loss: 2.1978 - rpn_class_loss: 0.1557 - rpn_bbox_loss: 0.9505 - mrcnn_class_loss: 0.2903 - mrcnn_bbox_loss: 0.3697 - mrcnn_mask_loss: 0.4316



17/90 [====>.........................] - ETA: 1:10 - loss: 2.1958 - rpn_class_loss: 0.1553 - rpn_bbox_loss: 0.9494 - mrcnn_class_loss: 0.2905 - mrcnn_bbox_loss: 0.3692 - mrcnn_mask_loss: 0.4314



18/90 [=====>........................] - ETA: 1:09 - loss: 2.1935 - rpn_class_loss: 0.1550 - rpn_bbox_loss: 0.9482 - mrcnn_class_loss: 0.2902 - mrcnn_bbox_loss: 0.3690 - mrcnn_mask_loss: 0.4311



19/90 [=====>........................] - ETA: 1:08 - loss: 2.1901 - rpn_class_loss: 0.1546 - rpn_bbox_loss: 0.9470 - mrcnn_class_loss: 0.2898 - mrcnn_bbox_loss: 0.3681 - mrcnn_mask_loss: 0.4306



20/90 [=====>........................] - ETA: 1:07 - loss: 2.1869 - rpn_class_loss: 0.1543 - rpn_bbox_loss: 0.9456 - mrcnn_class_loss: 0.2893 - mrcnn_bbox_loss: 0.3674 - mrcnn_mask_loss: 0.4302















































































































Fine tune Resnet stage 4 and up

Starting at epoch 10. LR=0.001

Checkpoint Path: /home/yoninachmany/crowdai-mapping-challenge-mask-rcnn/logs/crowdai-mapping-challenge20190428T1455/mask_rcnn_crowdai-mapping-challenge_{epoch:04d}.h5
Selecting layers to train
res4a_branch2a         (Conv2D)
bn4a_branch2a          (BatchNorm)
res4a_branch2b         (Conv2D)
bn4a_branch2b          (BatchNorm)
res4a_branch2c         (Conv2D)
res4a_branch1          (Conv2D)
bn4a_branch2c          (BatchNorm)
bn4a_branch1           (BatchNorm)
res4b_branch2a         (Conv2D)
bn4b_branch2a          (BatchNorm)
res4b_branch2b         (Conv2D)
bn4b_branch2b          (BatchNorm)
res4b_branch2c         (Conv2D)
bn4b_branch2c          (BatchNorm)
res4c_branch2a         (Conv2D)
bn4c_branch2a          (BatchNorm)
res4c_branch2b         (Conv2D)
bn4c_branch2b          (BatchNorm)
res4c_branch2c         (Conv2D)
bn4c_branch2c          (BatchNorm)
res4d_branch2a         (Conv2D)
bn4d_branch2a          (BatchNorm)
res4d

Epoch 11/20
Epoch 12/20
Epoch 13/20
Epoch 14/20
Epoch 15/20
Epoch 16/20
Epoch 17/20
Epoch 18/20
Epoch 19/20















































Epoch 20/20
 3/90 [>.............................] - ETA: 2:06 - loss: 1.9534 - rpn_class_loss: 0.1279 - rpn_bbox_loss: 0.7961 - mrcnn_class_loss: 0.2910 - mrcnn_bbox_loss: 0.3252 - mrcnn_mask_loss: 0.4133



 4/90 [>.............................] - ETA: 2:08 - loss: 1.9535 - rpn_class_loss: 0.1275 - rpn_bbox_loss: 0.7940 - mrcnn_class_loss: 0.2896 - mrcnn_bbox_loss: 0.3280 - mrcnn_mask_loss: 0.4144



 5/90 [>.............................] - ETA: 2:02 - loss: 1.9528 - rpn_class_loss: 0.1271 - rpn_bbox_loss: 0.7920 - mrcnn_class_loss: 0.2901 - mrcnn_bbox_loss: 0.3291 - mrcnn_mask_loss: 0.4143



 6/90 [=>............................] - ETA: 2:03 - loss: 1.9505 - rpn_class_loss: 0.1268 - rpn_bbox_loss: 0.7901 - mrcnn_class_loss: 0.2893 - mrcnn_bbox_loss: 0.3300 - mrcnn_mask_loss: 0.4143



 7/90 [=>............................] - ETA: 1:59 - loss: 1.9205 - rpn_class_loss: 0.1253 - rpn_bbox_loss: 0.7757 - mrcnn_class_loss: 0.2839 - mrcnn_bbox_loss: 0.3235 - mrcnn_mask_loss: 0.4121



 9/90 [==>...........................] - ETA: 1:54 - loss: 1.9211 - rpn_class_loss: 0.1248 - rpn_bbox_loss: 0.7751 - mrcnn_class_loss: 0.2842 - mrcnn_bbox_loss: 0.3247 - mrcnn_mask_loss: 0.4122



10/90 [==>...........................] - ETA: 1:50 - loss: 1.9204 - rpn_class_loss: 0.1246 - rpn_bbox_loss: 0.7743 - mrcnn_class_loss: 0.2843 - mrcnn_bbox_loss: 0.3248 - mrcnn_mask_loss: 0.4124



11/90 [==>...........................] - ETA: 1:49 - loss: 1.9184 - rpn_class_loss: 0.1243 - rpn_bbox_loss: 0.7735 - mrcnn_class_loss: 0.2841 - mrcnn_bbox_loss: 0.3241 - mrcnn_mask_loss: 0.4124



12/90 [===>..........................] - ETA: 1:46 - loss: 1.9174 - rpn_class_loss: 0.1240 - rpn_bbox_loss: 0.7725 - mrcnn_class_loss: 0.2847 - mrcnn_bbox_loss: 0.3240 - mrcnn_mask_loss: 0.4121



13/90 [===>..........................] - ETA: 1:43 - loss: 1.9167 - rpn_class_loss: 0.1237 - rpn_bbox_loss: 0.7715 - mrcnn_class_loss: 0.2854 - mrcnn_bbox_loss: 0.3241 - mrcnn_mask_loss: 0.4120



14/90 [===>..........................] - ETA: 1:42 - loss: 1.9163 - rpn_class_loss: 0.1234 - rpn_bbox_loss: 0.7705 - mrcnn_class_loss: 0.2861 - mrcnn_bbox_loss: 0.3239 - mrcnn_mask_loss: 0.4123



15/90 [====>.........................] - ETA: 1:40 - loss: 1.9150 - rpn_class_loss: 0.1231 - rpn_bbox_loss: 0.7696 - mrcnn_class_loss: 0.2862 - mrcnn_bbox_loss: 0.3238 - mrcnn_mask_loss: 0.4123



16/90 [====>.........................] - ETA: 1:37 - loss: 1.9133 - rpn_class_loss: 0.1228 - rpn_bbox_loss: 0.7686 - mrcnn_class_loss: 0.2861 - mrcnn_bbox_loss: 0.3235 - mrcnn_mask_loss: 0.4122



17/90 [====>.........................] - ETA: 1:35 - loss: 1.9123 - rpn_class_loss: 0.1225 - rpn_bbox_loss: 0.7677 - mrcnn_class_loss: 0.2867 - mrcnn_bbox_loss: 0.3233 - mrcnn_mask_loss: 0.4121



18/90 [=====>........................] - ETA: 1:32 - loss: 1.9105 - rpn_class_loss: 0.1222 - rpn_bbox_loss: 0.7667 - mrcnn_class_loss: 0.2869 - mrcnn_bbox_loss: 0.3228 - mrcnn_mask_loss: 0.4120



19/90 [=====>........................] - ETA: 1:32 - loss: 1.9257 - rpn_class_loss: 0.1252 - rpn_bbox_loss: 0.7736 - mrcnn_class_loss: 0.2874 - mrcnn_bbox_loss: 0.3261 - mrcnn_mask_loss: 0.4135



20/90 [=====>........................] - ETA: 1:30 - loss: 1.9224 - rpn_class_loss: 0.1247 - rpn_bbox_loss: 0.7721 - mrcnn_class_loss: 0.2868 - mrcnn_bbox_loss: 0.3255 - mrcnn_mask_loss: 0.4133







































































































Epoch 11/20


Now you can monitor the training by running : 
```
tensorboard --logdir=logs/[path-to-your-experiment-logdir]
```
and if everything works great, you should see something like : 
![loss-plot](images/loss-plot.png)

# Author
Sharada Mohanty [sharada.mohanty@epfl.ch](sharada.mohanty@epfl.ch)