Skip to content
No description, website, or topics provided.
Branch: master
Clone or download
Latest commit c3ae915 Dec 21, 2017
Type Name Latest commit message Commit time
Failed to load latest commit information.
data/training add placeholder images Dec 15, 2017
helpers Merge with amazon branch Dec 21, 2017
models Add master 152 with base model Dec 14, 2017
restore Merge remote-tracking branch 'origin/amazon' Dec 21, 2017
significant_results Perform small improvements Dec 11, 2017
utils Merge remote-tracking branch 'origin/amazon' Dec 21, 2017
.gitattributes Initial commit Nov 24, 2017
.gitignore Improve the logger Dec 10, 2017 Implement test set usage Dec 11, 2017 Merge with amazon branch Dec 21, 2017 Merge with amazon branch Dec 21, 2017 Add mask_to_submission in and some documentation Dec 21, 2017 Merge with amazon branch Dec 21, 2017 Change Readme Dec 21, 2017 Merge with amazon branch Dec 21, 2017
segment_aerial_images.ipynb Correct accuracy - F1 score computation for logistic regression Dec 12, 2017 Best model (without lfs images) Dec 20, 2017

Road segmentation - ML Project 2

Model used

In order to determine which parts of an image belong to the road class vs the background class, a 4 layer Convolutional Neural Network is used.

The square input images (of width 400px) are split in 16 px width square patches, together with the patch context (that is, 16 px of context next to the analyzed patch, and a total considered patch of size 48x48px).

The various settings can be selected in the file, and their signification is explained below.

The model is defined in the module, and can be trained using the module The library responsible for doing the computations is Tensorflow.

Before anything, the data (training and test_set_images folders) must be placed in the data folder at the root.

Model settings

These parameters can be set to convenience in the file (only the one which require an explanation are stated in the table):

Variable name Effect
NUM_CHANNELS Defines the number of channel of the input images (e.g. 3 for RGB)
PIXEL_DEPTH Number of colour of a pixel
TEST_PROP Proportion of the dataset to be used to test the model
SEED Definition of the seed use for the random number generators
BATCH_SIZE Number of patches to be processed at once by tensorflow
NUM_EPOCHS Defines how many times the whole dataset will be processed by the learning algorithm
ADAM_INITIAL_LEARNING_RATE Defines the learning rate for the Adam optimizer
USE_DROPOUT If enabled, a random proportion of the neurons are not updated during an iteration of the training of the model
USE_L2_REGULARIZATION Defines whether to use L2 regularization or not
USE_LEAKY_RELU Defines whether to use a leaky rectified linear unit activation function or not
DROPOUT_KEEP_RATE Proportion of the neurons that are not "dropped out". Used in conjonction with the ÙSE_DROPOUT`setting
IMG_PATCH_SIZE Width of an image part.
PATCH_CONTEXT_SIZE Width of the neighboring pixels to be considered together with the image patch.

Training the model

As the computations involve millions of parameters update, we used Amazon AWS remote instances, which provided us access to Nvidia Tesla K80 GPU, so as to speedup the model training.

Before training, the data must be serialized and partitioned between the train, test and validation (our best model use 100% of the data); to do this use at the root : python -m utils.dataset_partitioner. To train the model use: python

Obtain the Submission

To obtain the submission, use python This will use the best model obtained : cnnb-full-175_model, trained for 175 epoch, on the images in test_set_images. The model used is located in the folder models. The file will be created in the folder submissions.

You can’t perform that action at this time.