Semantic segmentation sample

This sample demonstrates DL model compression capabilities for semantic segmentation problem

Features:

UNet and ICNet with implementations as close as possible to the original papers
Loaders for CamVid, Cityscapes (20-class), Mapillary Vistas(20-class), Pascal VOC (reuses the loader integrated into torchvision)
Configuration file examples for sparsity, quantization, filter pruning and quantization with sparsity
Export to ONNX compatible with OpenVINO
DataParallel and DistributedDataParallel modes
Tensorboard output

Installation

At this point it is assumed that you have already installed nncf. You can find information on downloading nncf here.

To work with the sample you should install the corresponding Python package dependencies:

pip install -r examples/torch/requirements.txt

Quantize FP32 pretrained model

This scenario demonstrates quantization with fine-tuning of UNet on Mapillary Vistas dataset.

Dataset preparation

Obtain a copy of Mapillary Vistas train/val data here

Run semantic segmentation sample

If you did not install the package then add the repository root folder to the PYTHONPATH environment variable
Navigate to the examples/torch/segmentation folder

(Optional) Before compressing a model, it is highly recommended checking the accuracy of the pretrained model, use the following command:

python main.py \
--mode=test \
--config=configs/unet_mapillary_int8.json \
--weights=<path_to_fp32_model_checkpoint> \
--data=<path_to_dataset> \
--batch-size=1 \
--disable-compression

Run the following command to start compression with fine-tuning on GPUs: python main.py -m train --config configs/unet_mapillary_int8.json --data <path_to_dataset> --weights <path_to_fp32_model_checkpoint>

It may take a few epochs to get the baseline accuracy results.

Use --multiprocessing-distributed flag to run in the distributed mode.
Use --resume flag with the path to a model from the previous experiment to resume training.
Use -b <number> option to specify the total batch size across GPUs
Use the --weights flag with the path to a compatible PyTorch checkpoint in order to load all matching weights from the checkpoint into the model - useful if you need to start compression-aware training from a previously trained uncompressed (FP32) checkpoint instead of performing compression-aware training fr om scratch.

Validate your model checkpoint

To estimate the test scores of your trained model checkpoint use the following command: python main.py -m test --config=configs/unet_mapillary_int8.json --resume <path_to_trained_model_checkpoint> If you want to validate an FP32 model checkpoint, make sure the compression algorithm settings are empty in the configuration file or pretrained=True is set.

WARNING: The samples use torch.load functionality for checkpoint loading which, in turn, uses pickle facilities by default which are known to be vulnerable to arbitrary code execution attacks. Only load the data you trust

Export compressed model

To export trained model to ONNX format use the following command: python main.py --mode export --config configs/unet_mapillary_int8.json --data <path_to_dataset> --resume <path_to_compressed_model_checkpoint> --to-onnx unet_int8.onnx

Export to OpenVINO Intermediate Representation (IR)

To export a model to OpenVINO IR and run it using Intel Deep Learning Deployment Toolkit please refer to this tutorial.

Results

Model	Compression algorithm	Dataset	Accuracy (Drop) %	NNCF config file	PyTorch checkpoint
UNet	None	CamVid	71.95	unet_camvid.json	Link
UNet	INT8	CamVid	71.8 (0.15)	unet_camvid_int8.json	Link
UNet	INT8 + Sparsity 60% (Magnitude)	CamVid	72.03 (-0.08)	unet_camvid_magnitude_sparsity_int8.json	Link
ICNet	None	CamVid	67.89	icnet_camvid.json	Link
ICNet	INT8	CamVid	67.86 (0.03)	icnet_camvid_int8.json	Link
ICNet	INT8 + Sparsity 60% (Magnitude)	CamVid	67.18 (0.71)	icnet_camvid_magnitude_sparsity_int8.json	Link
UNet	None	Mapillary	56.23	unet_mapillary.json	Link
UNet	INT8	Mapillary	55.87 (0.36)	unet_mapillary_int8.json	Link
UNet	INT8 + Sparsity 60% (Magnitude)	Mapillary	55.65 (0.58)	unet_mapillary_magnitude_sparsity_int8.json	Link

Results for filter pruning

Model	Compression algorithm	Dataset	Accuracy (Drop) %	GFLOPS	MParams	NNCF config file	PyTorch checkpoint
UNet	None	Mapillary	56.23	875.0 (100%)	31.0 (100%)	Link	Link
UNet	Filter pruning 25%, geometric median criterion	Mapillary	55.62 (0.61)	589.4 (67.36%)	18.6 (60.00%)	Link	Link

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Semantic segmentation sample

Features:

Installation

Quantize FP32 pretrained model

Dataset preparation

Run semantic segmentation sample

Validate your model checkpoint

Export compressed model

Export to OpenVINO Intermediate Representation (IR)

Results

Results for filter pruning

Files

README.md

Latest commit

History

README.md

File metadata and controls

Semantic segmentation sample

Features:

Installation

Quantize FP32 pretrained model

Dataset preparation

Run semantic segmentation sample

Validate your model checkpoint

Export compressed model

Export to OpenVINO Intermediate Representation (IR)

Results

Results for filter pruning