GitHub - alec-ng/fully-convolutional-network-semantic-segmentation: Implementation of the paper Fully Convolutional Networks for Semantic Segmentation, used to segment a person from a background in an RGB image

This repository is an implementation of the paper Fully Convolutional Networks for Semantic Segmentation for the purpose of segmenting humans from a singular RGB image.

Tested with...

Ubuntu 16.04.01 LTS running on VirtualBox with 5551 MB Memory, 2 Processor Cores, 35 GB Storage, Hyper-V acceleration
Intel i7-4510U CPU @ 2.00 GHz 2.60 GHz)
Caffe 1.0.0-rc3
Python 3.5.3
OpenCV 3.1.0

Algorithm

Preprocess test image by downsizing to a maximum of 750px (hardware limitation) and apply CLAHE histogram equalization
Run test image through the FCN network to obtain a prediction numpy array, corresponding to a pixel-wise prediction of classes
Iterate through the prediction array and original test image together.
- Using the VOC 2012 class definitions, if the prediction array contains a value that is not classified as 'PERSON', white out all 3 colour channels on the original image ("discarding" it)
Compute precision, recall, and F1 scores on the final segmented image by comparing it to a ground truth image

The above system specifications was not powerful enough to handle input images larger than a certain dimension (around 750x750px).
Check failed: *ptr host allocation of size xxxxx failed : check memory availability of system
blob size exceeds INT_MAX : limitation of HDF5DataLayer. Could possibly do with image dimensions that are too large for the net to handle
killed : most probable cause is system running out of RAM or GPU memory. Reducing batch size may be a fix, or increasing memory / computation resource of system, or rescaling image to be smaller

In src/, download the .caffemodel file from the URL included in src/caffemodel-url. Place the .caffemodel file in src/
Sequentially label your test images and put them in images/test
Put your corresponding truth images for your test images in images/truth
- The filenames for the test and truth must be identical

Run python eval.py /images [clipSize]

Output:

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
results		results
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
eval.py		eval.py