GitHub - FreakTheMighty/fcn.berkeleyvision.org: Fully Convolutional Networks for Semantic Segmentation by Jonathan Long*, Evan Shelhamer*, and Trevor Darrell. CVPR 2015.

This is the reference implementation of the models and code for the fully convolutional networks (FCNs) in the PAMI FCN and CVPR FCN papers:

Fully Convolutional Models for Semantic Segmentation
Evan Shelhamer*, Jonathan Long*, Trevor Darrell
PAMI (accepted May, 2016)
arXiv:1605.06211

Fully Convolutional Models for Semantic Segmentation
Jonathan Long*, Evan Shelhamer*, Trevor Darrell
CVPR 2015
arXiv:1411.4038

Note that this is a work in progress and the final, reference version is coming soon. Please ask Caffe and FCN usage questions on the caffe-users mailing list.

These models are compatible with BVLC/caffe:master @ 8c66fa5 with the merge of PRs BVLC/caffe#3613 and BVLC/caffe#3570. The code and models here are available under the same license as Caffe (BSD-2) and the Caffe-bundled models (that is, unrestricted use; see the BVLC model license).

PASCAL VOC models: trained online with high momentum for a ~5 point boost in mean intersection-over-union over the original models. These models are trained using extra data from Hariharan et al., but excluding SBD val. FCN-32s is fine-tuned from the ILSVRC-trained VGG-16 model, and the finer strides are then fine-tuned in turn. The "at-once" FCN-8s is fine-tuned from VGG-16 all-at-once by scaling the skip connections to better condition optimization.

FCN-32s PASCAL: single stream, 32 pixel prediction stride net, scoring 63.6 mIU on seg11valid
FCN-16s PASCAL: two stream, 16 pixel prediction stride net, scoring 65.0 mIU on seg11valid
FCN-8s PASCAL: three stream, 8 pixel prediction stride net, scoring 65.5 mIU on seg11valid and 67.2 mIU on seg12test
FCN-8s PASCAL at-once: all-at-once, three stream, 8 pixel prediction stride net, scoring 65.4 mIU on seg11valid

FCN-AlexNet PASCAL: AlexNet (CaffeNet) architecture, single stream, 32 pixel prediction stride net, scoring 48.0 mIU on seg11valid. Unlike the FCN-32/16/8s models, this network is trained with gradient accumulation, normalized loss, and standard momentum. (Note: when both FCN-32s/FCN-VGG16 and FCN-AlexNet are trained in this same way FCN-VGG16 is far better; see Table 1 of the paper.)

To reproduce the validation scores, use the seg11valid split defined by the paper in footnote 7. Since SBD train and PASCAL VOC 2011 segval intersect, we only evaluate on the non-intersecting set for validation purposes.

NYUDv2 models: trained online with high momentum on color, depth, and HHA features (from Gupta et al. https://github.com/s-gupta/rcnn-depth). These models demonstrate FCNs for multi-modal input.

FCN-32s NYUDv2 Color: single stream, 32 pixel prediction stride net on color/BGR input
FCN-32s NYUDv2 HHA: single stream, 32 pixel prediction stride net on HHA input
FCN-32s NYUDv2 Early Color-Depth: single stream, 32 pixel prediction stride net on early fusion of color and (log) depth for 4-channel input
FCN-32s NYUDv2 Late Color-HHA: single stream, 32 pixel prediction stride net by late fusion of FCN-32s NYUDv2 Color and FCN-32s NYUDv2 HHA

SIFT Flow models: trained online with high momentum for joint semantic class and geometric class segmentation. These models demonstrate FCNs for multi-task output.

FCN-32s SIFT Flow: single stream stream, 32 pixel prediction stride net
FCN-16s SIFT Flow: two stream, 16 pixel prediction stride net
FCN-8s SIFT Flow: three stream, 8 pixel prediction stride net

Note: in this release, the evaluation of the semantic classes is not quite right at the moment due to an issue with missing classes. This will be corrected soon. The evaluation of the geometric classes is fine.

PASCAL-Context models: trained online with high momentum on an object and scene labeling of PASCAL VOC.

FCN-32s PASCAL-Context: single stream, 32 pixel prediction stride net
FCN-16s PASCAL-Context: two stream, 16 pixel prediction stride net
FCN-8s PASCAL-Context: three stream, 8 pixel prediction stride net

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
nyud-fcn32s-color-d		nyud-fcn32s-color-d
nyud-fcn32s-color-hha		nyud-fcn32s-color-hha
nyud-fcn32s-color		nyud-fcn32s-color
nyud-fcn32s-hha		nyud-fcn32s-hha
pascalcontext-fcn16s		pascalcontext-fcn16s
pascalcontext-fcn32s		pascalcontext-fcn32s
pascalcontext-fcn8s		pascalcontext-fcn8s
siftflow-fcn16s		siftflow-fcn16s
siftflow-fcn32s		siftflow-fcn32s
siftflow-fcn8s		siftflow-fcn8s
voc-fcn-alexnet		voc-fcn-alexnet
voc-fcn16s		voc-fcn16s
voc-fcn32s		voc-fcn32s
voc-fcn8s-atonce		voc-fcn8s-atonce
voc-fcn8s		voc-fcn8s
README.md		README.md
infer.py		infer.py
nyud_layers.py		nyud_layers.py
pascalcontext_layers.py		pascalcontext_layers.py
score.py		score.py
siftflow_layers.py		siftflow_layers.py
surgery.py		surgery.py
voc_layers.py		voc_layers.py

FreakTheMighty/fcn.berkeleyvision.org

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages