Added utility to draw segmentation masks #3330

oke-aditya · 2021-01-30T20:03:14Z

Closes #3272.

Initial Implementation

Adds Code
Adds Tests
Adds Docs

torchvision/utils.py

oke-aditya · 2021-02-02T17:57:42Z

torchvision/utils.py

+    img_to_draw = torch.from_numpy(np.array(img_to_draw))
+
+    # Project the drawn image to orignal one
+    image[: 1] = img_to_draw


I need small help here as this projects to a black background.

My guess is we need an alpha channel which will make masks transparent ?

I supposed there are many ways to do it. One that I had in mind originally (which might not be the optimal) is to convert the img_to_draw from palette to RGBA, replace the background colour with transparent and then combine it with image to achieve the "projection". Worth experimenting with the approach because it's likely there is a better way.

Yep, DETR has very nice visualization, but they use matplotlib. Unsure how to reproduce them though.
As you pointed out before Mask RCNN utils have nice way to apply mask too.

I just sent it for reference not necessarily for reproduction. :)

datumbox

@oke-aditya It's going towards the right direction. I left a few comments to get your thoughts.

BTW @fmassa brought to my attention a nice guide they have to DETR with some visualization utils we might want to look into for inspiration.

torchvision/utils.py

datumbox · 2021-02-03T12:59:55Z

torchvision/utils.py

+    img_to_draw = torch.from_numpy(np.array(img_to_draw))
+
+    # Project the drawn image to orignal one
+    image[: 1] = img_to_draw


I supposed there are many ways to do it. One that I had in mind originally (which might not be the optimal) is to convert the img_to_draw from palette to RGBA, replace the background colour with transparent and then combine it with image to achieve the "projection". Worth experimenting with the approach because it's likely there is a better way.

oke-aditya · 2021-02-04T17:03:00Z

Hi @datumbox I just resolved the num_classes hardcoding by changing params to probabilities.

oke-aditya · 2021-02-04T17:43:52Z

I'm bit unsure now how to proceed, one way I think is to create another util to apply weighted mask just like MaskRCNN matterpott benchmark

We can probably use this and apply the mask here.

Unsure of tests, it might be too lengthy and calculative to write 20 x 20 x 3 tensor with probabilities. I thought to just run FCN on torch.full and use its outputs.

Let me know how to proceed ! This part seems tricky

This does double check, but couples tests with models.

datumbox · 2021-02-04T19:26:14Z

@oke-aditya Great changes, I think we are almost there!

As I commented above, there are multiple ways to do this. Here is a hacky, quick and dirty approach to get an idea. I'm sure you can do it in a much better way:

@torch.no_grad()
def draw_segmentation_masks(
    image: torch.Tensor,
    masks: torch.Tensor,
    colors: Optional[List[Union[str, Tuple[int, int, int]]]] = None,
) -> torch.Tensor:
    if not isinstance(image, torch.Tensor):
        raise TypeError(f"Tensor expected, got {type(image)}")
    elif image.dtype != torch.uint8:
        raise ValueError(f"Tensor uint8 expected, got {image.dtype}")
    elif image.dim() != 3:
        raise ValueError("Pass individual images, not batches")

    classifications = masks.argmax(0).byte()
    img_to_draw = Image.fromarray(classifications.cpu().numpy())

    if colors is None:
        num_classes = masks.size(0)
        palette = torch.tensor([2 ** 25 - 1, 2 ** 15 - 1, 2 ** 21 - 1, 0])
        colors_t = torch.as_tensor([i for i in range(num_classes)])[:, None] * palette
        color_arr = (colors_t % 255).numpy().astype("uint8")
        color_arr[1:, 3] = 255
    else:
        color_list = []
        for color in colors:
            if isinstance(color, str):
                fill_color = ImageColor.getrgb(color) 
                color_list.append(fill_color)
            elif isinstance(color, tuple):
                color_list.append(color)

        color_arr = np.array(color_list).astype("uint8")

    img_to_draw.putpalette(color_arr)

    img_to_draw = torch.from_numpy(np.array(img_to_draw.convert('RGBA')))
    img_to_draw = img_to_draw.permute((2, 0, 1))

    alpha = 0.6
    return (torch.cat([image, torch.full(image.shape[1:], 255).unsqueeze(0)]).float()*alpha+img_to_draw.float()*(1.0-alpha)).to(dtype=torch.uint8)

Ping me when you have a demo you are comfortable with to discuss the last details of the API. :)

oke-aditya · 2021-02-06T16:25:42Z

Sorry for the delay @datumbox . Here are few ouputs for different values of alpha

Alpha = 0.2

Alpha = 0.3

Alpha = 0.6

Alpha = 0.7

Also I made alpha as a paramter as it is really useful, to remove or keep background.

Another thought I had was to make util apply_mask which can be used to project mask.
This might be useful in some other frequently use alpha blending, alpha masking cases.
So let me know !

def apply_mask(image, mask, alpha):

torch.cat([image, torch.full(image.shape[1:], 255).unsqueeze(0)]).float()
            * alpha + mask.float() * (1.0 - alpha)).to(dtype=torch.uint8)

fmassa

Hi,

The PR looks very nice, thanks a lot!

I have one suggestion which I think would make the function more generic, and wouldn't involve too many changes. Let me know what you think

fmassa · 2021-02-10T15:00:56Z

torchvision/utils.py

+    num_classes = masks.size()[0]
+    masks = masks.argmax(0)


Instead of being specific for semantic segmentation and not supporting instance segmentation nor panoptic segmentation, I think we could make this slightly more generic while supporting all the use-cases I mentioned. The idea would be to accept a mask as a [num_masks, H, W] boolean Tensor.
This way, the user can get the semantic segmentation masks to pass to this function as follows

out.argmax(0) == torch.arange(out.shape[0])[:, None, None]

fmassa · 2021-02-10T15:01:51Z

torchvision/utils.py

+        masks (Tensor): Tensor of shape (num_classes, H, W). Each containing probability of predicted class.
+        alpha (float): Float number between 0 and 1 denoting factor of transpaerency of masks.
+        colors (List[Union[str, Tuple[int, int, int]]]): List containing the colors of masks. The colors can
+            be represented as `str` or `Tuple[int, int, int]`.


If we add support for instance segmentation and panoptic segmentation, I think it would be a good idea to add an example from using the output of a semantic segmentation model and an instance segmentation model (for example from those from torchvision)

Sure, this is a TODO. I will add GitHub gist and other minor documentation improvements for both the utilities.

Can I do this in a follow-up PR which will address all the issues as mentioned in #3364 ?

Sure, adding documentation improvements on a follow-up PR is ok with me. What do you think about the other comment as well? Because it would be a breaking change in functionality if we support it, so better do it once (specially that the branch cut is happening very soon so if we merge it now it can get integrated in the release, in which case breaking backwards-compatibility is more annoying)

Definetely, I will refactor the other comment ASAP in this PR 😄 I understand how bad it would be with BC change.

oke-aditya · 2021-02-10T16:50:25Z

Slight problem while implementing.
Seems that argmax is not supported for bool tensor

I am pushing my latest changes, can someone please have look ?

A simple code to reproduce bug

masks =    [
        [False, False, False, False, False],
        [True, True, True, True, True],
        [False, False, False, False, False],
        [False, False, False, False, False],
        [False, False, False, False, False]
    ],
    [
        [True, True, True, True, True],
        [False, False, False, False, False],
        [True, True, True, True, True],
        [True, True, True, True, True],
        [False, False, False, False, False]
    ],
    [
        [False, False, False, False, False],
        [False, False, False, False, False],
        [False, False, False, False, False],
        [False, False, False, False, False],
        [True, True, True, True, True],
    ]
], dtype=torch.bool)

masks = masks.argmax(0)
print(masks)

RuntimeError: "argmax_cpu" not implemented for 'Bool'

Maybe there is some workaround? I guess this was the only change, to support bool Tensor instead of float Tensor

The error would occur when we try to get [H, W] from [num_masks, H, W] tensor.

fmassa · 2021-02-11T14:11:51Z

@oke-aditya maybe I'm missing something, but if we pass a bool tensor we don't need to compute the argmax anymore, because the independent masks have already been computed?

EDIT: oh I see, you don't perform a for loop over each one of the masks as of now. Computing the argmax could be done after casting the mask to float for example, but note that in instance segmentation each pixel can be covered by multiple masks, so the argmax wouldn't be enough to handle those. But it can be a first approximation

oke-aditya · 2021-02-11T14:39:49Z

Computing the argmax could be done after casting the mask to float for example, but note that in instance segmentation each pixel can be covered by multiple masks, so the argmax wouldn't be enough to handle those. But it can be a first approximation

That's possible. But may I know why we decided to pass a Bool Tensor and not a Float mask Tensor ?
Sorry if the question is slightly dumb.

Edit: My idea was to make initial implementation compatible with single channel masks. That would make it compatible with Mask RCNN. I thought in the same previous function, we could handle single channel case differently.

oke-aditya · 2021-02-16T16:08:59Z

I think with release cut coming quite soon, we could wait for and add in next release ? (I mean after 0.9.0)

oke-aditya · 2021-03-11T17:26:15Z

Hey @fmassa and @datumbox. Any thoughts on how to proceed further with this. I'm willing to incorporate any changes requested 😃

fmassa

Sorry for the delay in reviewing.

I've made a comment to unblock you, but I think in the future we might need to change the implementation to let it handle overlapping masks (which is the whole purpose of the function accepting a tensor with a per-object map).

The input mask doesn't need to be a boolean by the way, but it can be a floating point representing probabilities for the given instance.

But to unblock for now let's just make this small change I proposed, and we can improve this in a follow-up PR

fmassa · 2021-03-17T15:25:58Z

torchvision/utils.py

+        raise ValueError("Pass an RGB image. Other Image formats are not supported")
+
+    num_masks = masks.size()[0]
+    masks = masks.argmax(0)


In order to unblock you can do masks.to(torch.int64).argmax(0) or cast it to float if you want.
This won't handle overlapping instances very well thought, and we will need to remove this in the future and probably replace it with a for loop so that overlapping masks are taken into account.

Plus, by using for loops and letting the mask be a floating point if the user wants, we can allow the user to have heatmaps being passed (instead of only binary maps), which would be very nice

oke-aditya · 2021-03-19T16:49:11Z

Extremely sorry for the delay (my health let me down 😢)

I think that boolean tensor leads to some limitations and again we re-cast it to int/float.

I refactored to use floating-point masks. Each point represents the probability of class.
Floating-point masks make more sense as we can either take argmax() and plot the best masks.
Or we could take topk to plot top masks. Or as you said we can cover up the overlapping case.

Currently, this code accepts a floating-point tensor of (num_masks, H, W)

Let me know what we need to do in further PRs / this PR.

fmassa

Thanks!

Summary: * add draw segm masks * rewrites with new api * fix flaky colors * fix resize bug * resize for sanity * cleanup * project the image * Minor refactor to adopt num classes * add uint8 in docstring * adds alpha and docstring * move code a bit down * Minor fix * fix type check * Fixing resize bug. * Fix type of alpha. * Remove unnecessary RGBA conversions. * update docs to supported only rgb * minor edits * adds tests * shifts masks up * change tests and impelementation for bool * change mode to L * convert to float * fixes docs Reviewed By: fmassa Differential Revision: D27433933 fbshipit-source-id: 26e72b4f8471218631b26cc555422890b0f6b81d Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com> Co-authored-by: Vasilis Vryniotis <vvryniotis@fb.com> Co-authored-by: Francisco Massa <fvsmassa@gmail.com>

add draw segm masks

ba9db12

facebook-github-bot added the cla signed label Jan 30, 2021

oke-aditya and others added 9 commits February 1, 2021 18:15

Merge branch 'master' of https://github.com/pytorch/vision into add_msks

24f076f

Merge branch 'master' into add_msks

785a6c2

rewrites with new api

9bb4191

Merge branch 'add_msks' of github.com:oke-aditya/vision into add_msks

11adde7

fix flaky colors

d3e28b7

fix resize bug

313f6a6

resize for sanity

80e4e4a

cleanup

0388c7a

project the image

d6018e8

oke-aditya mentioned this pull request Feb 2, 2021

Utility to draw Semantic Segmentation Masks #3272

Closed

oke-aditya marked this pull request as ready for review February 2, 2021 17:53

oke-aditya commented Feb 2, 2021

View reviewed changes

torchvision/utils.py Outdated Show resolved Hide resolved

oke-aditya commented Feb 2, 2021

View reviewed changes

datumbox reviewed Feb 3, 2021

View reviewed changes

Minor refactor to adopt num classes

4af5549

add uint8 in docstring

f11fa61

adds alpha and docstring

d554c75

oke-aditya and others added 4 commits February 6, 2021 21:55

move code a bit down

301b9de

Merge branch 'master' into add_msks

18b9cf1

Minor fix

02226fa

fix type check

a720f91

oke-aditya requested a review from datumbox February 8, 2021 16:56

datumbox and others added 2 commits February 8, 2021 18:00

Fixing resize bug.

155c568

Merge branch 'master' into add_msks

b5f8e76

fmassa reviewed Feb 10, 2021

View reviewed changes

change tests and impelementation for bool

58b3870

oke-aditya requested a review from datumbox February 10, 2021 16:58

change mode to L

fbf4dc7

Merge branch 'master' into add_msks

3dbfd67

oke-aditya requested a review from fmassa February 16, 2021 16:07

oke-aditya mentioned this pull request Feb 25, 2021

[docs] Usage example missing for torchvision.utils.draw_bounding_boxes #3449

Closed

oke-aditya and others added 2 commits March 3, 2021 11:02

Merge branch 'master' into add_msks

ae2cacd

Merge branch 'master' of https://github.com/pytorch/vision into add_msks

f5a4636

fmassa reviewed Mar 17, 2021

View reviewed changes

oke-aditya added 2 commits March 19, 2021 20:14

Merge branch 'master' of https://github.com/pytorch/vision into add_msks

6d8729f

convert to float

bc81e50

oke-aditya requested a review from fmassa March 19, 2021 16:49

fixes docs

32de89b

fmassa approved these changes Mar 22, 2021

View reviewed changes

fmassa added 2 commits March 22, 2021 11:21

Merge branch 'master' into add_msks

9b80cc8

Merge branch 'master' into add_msks

acc2d70

fmassa merged commit 19ad0bb into pytorch:master Mar 22, 2021

oke-aditya mentioned this pull request Mar 22, 2021

Improved utilites, adds examples, tests #3594

Merged

6 tasks

oke-aditya deleted the add_msks branch March 22, 2021 18:13

NicolasHug added module: utils new feature labels Mar 31, 2021

NicolasHug mentioned this pull request May 12, 2021

Change draw_segmentation_masks to accept boolean masks #3820

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added utility to draw segmentation masks #3330

Added utility to draw segmentation masks #3330

oke-aditya commented Jan 30, 2021 •

edited

oke-aditya Feb 2, 2021

datumbox Feb 3, 2021

oke-aditya Feb 4, 2021

datumbox Feb 4, 2021

datumbox left a comment •

edited

datumbox Feb 3, 2021

oke-aditya commented Feb 4, 2021

oke-aditya commented Feb 4, 2021 •

edited

datumbox commented Feb 4, 2021

oke-aditya commented Feb 6, 2021

fmassa left a comment

fmassa Feb 10, 2021

fmassa Feb 10, 2021

oke-aditya Feb 10, 2021

fmassa Feb 10, 2021 •

edited

oke-aditya Feb 10, 2021 •

edited

oke-aditya commented Feb 10, 2021 •

edited

fmassa commented Feb 11, 2021 •

edited

oke-aditya commented Feb 11, 2021 •

edited

oke-aditya commented Feb 16, 2021 •

edited

oke-aditya commented Mar 11, 2021

fmassa left a comment

fmassa Mar 17, 2021

oke-aditya commented Mar 19, 2021

fmassa left a comment

Added utility to draw segmentation masks #3330

Added utility to draw segmentation masks #3330

Conversation

oke-aditya commented Jan 30, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

datumbox left a comment • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oke-aditya commented Feb 4, 2021

oke-aditya commented Feb 4, 2021 • edited

datumbox commented Feb 4, 2021

oke-aditya commented Feb 6, 2021

fmassa left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fmassa Feb 10, 2021 • edited

Choose a reason for hiding this comment

oke-aditya Feb 10, 2021 • edited

Choose a reason for hiding this comment

oke-aditya commented Feb 10, 2021 • edited

fmassa commented Feb 11, 2021 • edited

oke-aditya commented Feb 11, 2021 • edited

oke-aditya commented Feb 16, 2021 • edited

oke-aditya commented Mar 11, 2021

fmassa left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oke-aditya commented Mar 19, 2021

fmassa left a comment

Choose a reason for hiding this comment

oke-aditya commented Jan 30, 2021 •

edited

datumbox left a comment •

edited

oke-aditya commented Feb 4, 2021 •

edited

fmassa Feb 10, 2021 •

edited

oke-aditya Feb 10, 2021 •

edited

oke-aditya commented Feb 10, 2021 •

edited

fmassa commented Feb 11, 2021 •

edited

oke-aditya commented Feb 11, 2021 •

edited

oke-aditya commented Feb 16, 2021 •

edited