Semantic Segmentation

This is a sub-project of pytorch-cv（for convenient）

Support Models：

Environment

PyTorch 1.1

Performance

Pascal VOC 2012

Here, we using train (10582), val (1449), test (1456) as most paper used. (More detail can reference DeepLabv3) . And the performance is evaluated with single scale

Base Size 540, Crop Size 480

Model	backbone	Paper	OHEM	aux	dilated	JPU	Epoch	val (crop)	val
FCN	ResNet101-v1s	/	✗	✓	✗	✓	50	94.54/78.31	94.50/76.89
PSPNet	ResNet101-v1s	/	✗	✓	✓	✗	50	94.87/80.13	94.88/78.57
PSPNet	ResNet101-v1s	/	✗	✓	✗	✓	50	94.89/79.90	94.77/78.48
DeepLabv3	ResNet101-v1s	no / 77.02	✗	✓	✗	✓	50	95.17/81.00	94.81/78.75
DANet	ResNet101-v1s	/	✗	✓	✗	✓	50	94.98/80.49	94.85/78.72
OCNet-Base	ResNet101-v1s	/	✗	✓	✗	✓	50	94.91/80.33	94.86/79.07
OCNet-ASP	ResNet101-v1s	/	✗	✓	✗	✓	50

the metric is pixAcc/mIoU

aux_weight=0.5

Cityscapes

Here, we only using fine train (2975), val (500) as most paper used. (More detail can reference DeepLabv3) . And the performance is evaluated with single scale

Base Size 1024, Crop Size 768

Model	backbone	Paper(*)	OHEM	aux	dilated	JPU	Epoch	val (crop)	val
FCN	ResNet101-v1s	no/75.96	✗	✓	✗	✓	120	96.29/73.60	96.18/78.61
PSPNet	ResNet101-v1s	no/78.56	✗	✓	✗	✓	120	96.21/73.64	96.09/78.62
DeepLabv3	ResNet101-v1s	no/78.90	✗	✓	✗	✓	120	96.25/73.44	96.23/79.03
DANet	ResNet101-v1s	no/78.83
OCNet-Base	ResNet101-v1s	no/79.67	✗	✓	✗	✓	120	96.30/74.18	TODO
OCNet-ASP	ResNet101-v1s

Note：

Paper(*) means results from: openseg.pytorch（results with single scale without crop），there are a little different in the training strategy.

Demo

Demo of segmentation of a given image. (Please download pre-trained model to ~/.torch/models first. --- If you put pre-trained model to other folder, please change the --root)

$ python demo_segmentation_pil.py [--model fcn_resnet101_voc] [--input-pic <image>.jpg] [--cuda true] [--aux true] [--jpu true] [--dilated false]

Note：

if not give --input-pic, using default image we provided.

aux, jpu, dilated is depend on your model

Evaluation

The default data root is ~/.torch/datasets (You can download dataset and build a soft-link to it)

$ python eval_segmentation_pil.py [--model_name fcn_resnet101_voc] [--dataset pascal_paper] [--split val] [--mode testval|val] [--base-size 540] [--crop-size 480] [--aux true] [--jpu true] [--dilated false] [--cuda true]

Note：

if you choose mode=testval，you can not set base-size and crop-size

aux, jpu, dilated is depend on your model

Train

Download pre-trained backbone and put it on ~/.torch/models

Recommend to using distributed training.

$ export NGPUS=4
$ python -m torch.distributed.launch --nproc_per_node=$NGPUS train_segmentation_pil.py [--model fcn] [--backbone resnet101] [--dataset pascal_voc] [--batch-size 8] [--base-size 540] [--crop-size 480] [--aux true] [--jpu true] [--dilated false] [--log-step 10]

Our training results' setting can see train.sh

Prepare data

VOC2012

mkdir data
cd data
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar
tar -xf VOCtrainval_11-May-2012.tar
cd VOCdevkit/VOC2012/
wget http://cs.jhu.edu/~cxliu/data/SegmentationClassAug.zip
wget http://cs.jhu.edu/~cxliu/data/SegmentationClassAug_Visualization.zip
wget http://cs.jhu.edu/~cxliu/data/list.zip
unzip SegmentationClassAug.zip
unzip SegmentationClassAug_Visualization.zip
unzip list.zip

Your can make a soft link to .torch/datasets/voc

Cityscapes

unzip leftImg8bit_trainvaltest.zip
unzip gtFine_trainvaltest.zip
git clone https://github.com/mcordts/cityscapesScripts.git
mv cityscapesScripts/cityscapesscripts ./

Your can make a soft link to .torch/datasets/citys

Download

Backbone

resnet50-v1s	resnet101-v1s
GoogleDrive	GoogleDrive

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
model		model
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Semantic Segmentation

Environment

Performance

Pascal VOC 2012

Cityscapes

Demo

Evaluation

Train

Prepare data

VOC2012

Cityscapes

Download

Backbone

About

Uh oh!

Releases

Packages

Languages

License

AceCoooool/segmentation

Folders and files

Latest commit

History

Repository files navigation

Semantic Segmentation

Environment

Performance

Pascal VOC 2012

Cityscapes

Demo

Evaluation

Train

Prepare data

VOC2012

Cityscapes

Download

Backbone

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages