Improve semantic segmentation models #14355

NielsRogge · 2021-11-10T12:20:52Z

What does this PR do?

add SegFormer documentation (including a figure)
fix padding (fixes SegformerFeatureExtractor trying to access non-existent .ndim attribute #14332) for SegFormer
add attribute to the configuration of SegFormer and BEiT called semantic_ignore_index, which defaults to 255. The loss functions of semantic segmentation models typically use 255 instead of -100. The reason for this is that some datasets include a 0 in the annotated segmentation maps to indicate "background", however it can be that background is not included in any of the labels of the dataset. e.g. ADE20k has 150 labels, but "background" is not included. Therefore, one reduces the labels of all segmentation maps by 1 value, and replaces the 0 by 255 as shown here. It's only after that that images get resized using PIL. However, if we replace values by -100, PIL can't read these images, and will thrown an error.
add option to pass segmentation_maps to BeitFeatureExtractor, and add corresponding tests.

To do:
SegformerFeatureExtractor currently includes the align, do_random_crop and do_pad arguments at initialization, however I wonder whether it's maybe better to remove those, and only include the bare minimum in the feature extractors (similar to ViTFeatureExtractor) to get started (i.e. resizing, center cropping, normalizing). Things like random cropping and padding is maybe already a bit too much and makes the feature extractors more complex. It's also not easy to determine good default values for this feature extractor; should it randomly crop + pad by default, or not?

remove random cropping and padding from SegformerFeatureExtractor, if one agrees on this.
make tests of SegformerFeatureExtractor and BeitFeatureExtractor consistent.
update the preprocessor_config.json of the semantic segmentation models on the hub.

sgugger

Thanks for adding this, I left w few comments on the names.

src/transformers/models/segformer/configuration_segformer.py

sgugger · 2021-11-11T01:02:25Z

tests/test_feature_extraction_beit.py

@@ -76,6 +77,26 @@ def prepare_feat_extract_dict(self):
        }


+def prepare_semantic_single_inputs():
+    ds = load_dataset("hf-internal-testing/fixtures_ade20k", split="test")


Let's give a better name than just ds, we try to avoid such short names in the codebase.

src/transformers/models/beit/feature_extraction_beit.py

src/transformers/models/beit/configuration_beit.py

LysandreJik

Cool, thanks! Agree with both of you to keep the data augmentation part out of the feature extractor. LGTM!

docs/source/model_doc/segformer.rst

* Improve tests * Improve documentation * Add ignore_index attribute * Add semantic_ignore_index to BEiT model * Add segmentation maps argument to BEiTFeatureExtractor * Simplify SegformerFeatureExtractor and corresponding tests * Improve tests * Apply suggestions from code review * Minor docs improvements * Streamline segmentation map tests of SegFormer and BEiT * Improve reduce_labels docs and test * Fix code quality * Fix code quality again

NielsRogge requested review from sgugger and LysandreJik November 10, 2021 16:53

sgugger approved these changes Nov 11, 2021

View reviewed changes

LysandreJik approved these changes Nov 17, 2021

View reviewed changes

docs/source/model_doc/segformer.rst Show resolved Hide resolved

NielsRogge added 13 commits November 17, 2021 15:12

Improve tests

10ba8d9

Improve documentation

6b4155e

Add ignore_index attribute

fd62f60

Add semantic_ignore_index to BEiT model

fee4e50

Add segmentation maps argument to BEiTFeatureExtractor

80aa6e2

Simplify SegformerFeatureExtractor and corresponding tests

c35a132

Improve tests

a5b1944

Apply suggestions from code review

855061d

Minor docs improvements

62d4e0d

Streamline segmentation map tests of SegFormer and BEiT

14d8209

Improve reduce_labels docs and test

cf6df7f

Fix code quality

4126a6a

Fix code quality again

ed14f3e

NielsRogge force-pushed the fix_segformer branch from 36cd4fa to ed14f3e Compare November 17, 2021 14:12

NielsRogge merged commit a2864a5 into huggingface:master Nov 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve semantic segmentation models #14355

Improve semantic segmentation models #14355

NielsRogge commented Nov 10, 2021 •

edited

Loading

sgugger left a comment

sgugger Nov 11, 2021

LysandreJik left a comment

Improve semantic segmentation models #14355

Improve semantic segmentation models #14355

Conversation

NielsRogge commented Nov 10, 2021 • edited Loading

What does this PR do?

sgugger left a comment

Choose a reason for hiding this comment

sgugger Nov 11, 2021

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

NielsRogge commented Nov 10, 2021 •

edited

Loading