New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add semantic segmentation post-processing method to MobileViT #19105

Merged

alaradirik merged 6 commits into huggingface:main from alaradirik:mobilevit-postprocessing

Sep 23, 2022

Contributor

alaradirik commented Sep 19, 2022

What does this PR do?

Adds post_process_semantic_segmentation method to MobileViTFeatureExtractor.

I will open an issue and separate PRs to make sure that

Segmentation models (DETR, MaskFormer, SegFormer, etc.) have consistently named post-processing methods, arguments and outputs
ImageSegmentationPipeline works with all available segmentation models

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[ X] Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
[ X] Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?


          add post-processing method for semantic segmentation

485cba8

alaradirik requested review from hollance and sgugger

September 19, 2022 14:25

alaradirik changed the title ~~add post-processing method for semantic segmentation~~ Add semantic segmentation post-processing method to MobileViT

HuggingFaceDocBuilderDev commented Sep 19, 2022 •

edited

The documentation is not available anymore as the PR was closed or merged.

sgugger reviewed

View reviewed changes

src/transformers/models/mobilevit/feature_extraction_mobilevit.py Outdated

Comment on lines 160 to 167

+                      Args:
+                      Converts the output of [`MobileViTForSemanticSegmentation`] into semantic segmentation maps. Only supports PyTorch.:
+                          outputs ([`MobileViTForSemanticSegmentation`]):
+                              Raw outputs of the model.
+                          target_sizes (`torch.Tensor` of shape `(batch_size, 2)` or `List[Tuple]` of length `batch_size`,
+                          *optional*):
+                              Torch Tensor (or list) corresponding to the requested final size (h, w) of each prediction. If left to
+                              None, predictions will not be resized.

Collaborator

sgugger Sep 19, 2022

Same problem as the other PR, the docstring is not properly formatted because the description of the function is after the Args and not before.

Contributor Author

alaradirik Sep 20, 2022

Fixed it!


          fix styling

b3516e0

hollance reviewed

View reviewed changes

src/transformers/models/mobilevit/feature_extraction_mobilevit.py Outdated

@@ @@ -151,3 +154,46 @@ def __call__( @@
                       encoded_inputs = BatchFeature(data=data, tensor_type=return_tensors)
                       return encoded_inputs
+                  def post_process_semantic_segmentation(self, outputs, target_sizes: Union[TensorType, List[Tuple]] = None):

Contributor

hollance Sep 20, 2022

Could this be added to the Mixin instead of the FeatureExtractor?

Contributor Author

alaradirik Sep 21, 2022

@hollance I agree with this but I'd prefer to do this after making sure (1) all post-processing methods of all segmentation models have consistent input arguments and naming and (2) ImageSegmentationPipeline supports all available segmentation models rather than just DETR and MaskFormer.

sgugger reviewed

View reviewed changes

src/transformers/models/mobilevit/feature_extraction_mobilevit.py Outdated

Comment on lines 163 to 173

+                      Args:
+                          outputs ([`MobileViTForSemanticSegmentation`]):
+                              Raw outputs of the model.
+                          target_sizes (`torch.Tensor` of shape `(batch_size, 2)` or `List[Tuple]` of length `batch_size`,
+                          *optional*):
+                              Torch Tensor (or list) corresponding to the requested final size (h, w) of each prediction. If left to
+                              None, predictions will not be resized.
+                      Returns:
+                          semantic_segmentation: `torch.Tensor` of shape `(batch_size, 2)` or `List[torch.Tensor]` of length
+                          `batch_size`, where each item is a semantic segmentation map of of the corresponding target_sizes entry (w,
+                          h) if `target_sizes` is specified). Each entry of each `torch.Tensor` correspond to a semantic class id.

Collaborator

sgugger Sep 20, 2022

Same comments as in the other PRs.

Contributor Author

alaradirik Sep 21, 2022

Thank you, this is fixed now. Post-processing uses torch for resizing and returns a list of torch tensors of shape (height, width). I also added a test for the post-processing.


          add test, ensure consistent postprocessing output

7447d20

sgugger reviewed

View reviewed changes

src/transformers/models/mobilevit/feature_extraction_mobilevit.py Outdated

+                              List of tuples corresponding to the requested final size (height, width) of each prediction. If left to
+                              None, predictions will not be resized.
+                      Returns:
+                          semantic_segmentation: `List[torch.Tensor]` of length `batch_size`, where each item is a semantic

Collaborator

sgugger Sep 21, 2022

As said previously in the other PRs, this should be the return type first then the colon.

Contributor Author

alaradirik Sep 23, 2022

Thanks, fixed it.


          fix docs style

fdf84e3

sgugger reviewed

View reviewed changes

src/transformers/models/mobilevit/feature_extraction_mobilevit.py Outdated Show resolved Hide resolved


          Update src/transformers/models/mobilevit/feature_extraction_mobilevit.py

72be12d

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

sgugger approved these changes

View reviewed changes


          Merge branch 'huggingface:main' into mobilevit-postprocessing

07ae0e5

alaradirik merged commit 7e84723 into huggingface:main

oneraghavan pushed a commit to oneraghavan/transformers that referenced this pull request


          Add semantic segmentation post-processing method to MobileViT (huggin…

2eb2bd6

…gface#19105)

* add post-processing method for semantic segmentation

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment