About the number of input channels #4

leijue222 · 2020-03-31T02:37:16Z

In paper 3.1 :

First, we increase the number of input channels from 3 to 9 to allow for
the extra trimap. We encode the trimap using Gaussian blurs of the definite
foreground and background masks at three different scales (in a similar way
to the method of [19] in interactive segmentation). This encoding differs from
existing approaches in deep image matting, as they usually encode the trimap as
a single channel with value 1 if foreground, 0.5 for unknown and 0 for background.

I know the output channels is 7 (a=1, F=3, B=3)
But why the input channels is 9?
In you code,I saw the input are image and trimap,then input channels will be 4.
So,why the input channels is 9 in paper 3.1?

xymsh · 2020-03-31T03:18:53Z

I think the input channel is 3 (rgb) + 6 (blurred trimap) = 9.

In code dataloader.py, the author defined a function called read_trimap. This function first reads the single-channel trimap, then transforms the pure foreground and pure background area into one-hot version. At this time, 1 channel trimap -> 2 channel one-hot trimap (indicating fg and bg area).

After that, trimap_transform function in transforms.py is used to blur the fg and bg at 3 scale levels separately. Thus, 2 channel trimap -> 6 channel trimap (3 for fg, and 3 for bg).

In the end, we can get 6 channel trimap + 3 channel RGB.

leijue222 · 2020-03-31T03:26:29Z

I got it.
Thanks!

kartikwar · 2021-01-21T07:34:12Z

I think the input channel is 3 (rgb) + 6 (blurred trimap) = 9.

In code dataloader.py, the author defined a function called read_trimap. This function first reads the single-channel trimap, then transforms the pure foreground and pure background area into one-hot version. At this time, 1 channel trimap -> 2 channel one-hot trimap (indicating fg and bg area).

After that, trimap_transform function in transforms.py is used to blur the fg and bg at 3 scale levels separately. Thus, 2 channel trimap -> 6 channel trimap (3 for fg, and 3 for bg).

In the end, we can get 6 channel trimap + 3 channel RGB.

how is the one hot encoding done in the code ?

are you referring to these lines ?

# trimap[trimap_im == 1, 1] = 1
# trimap[trimap_im == 0, 0] = 1

kartikwar · 2021-01-21T07:35:11Z

if yes wouldn't this binarize the trimap? Ideally trimap should be continous b/w 0 and 1

MarcoForte closed this as completed Mar 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the number of input channels #4

About the number of input channels #4

leijue222 commented Mar 31, 2020 •

edited

xymsh commented Mar 31, 2020

leijue222 commented Mar 31, 2020

kartikwar commented Jan 21, 2021

kartikwar commented Jan 21, 2021

About the number of input channels #4

About the number of input channels #4

Comments

leijue222 commented Mar 31, 2020 • edited

xymsh commented Mar 31, 2020

leijue222 commented Mar 31, 2020

kartikwar commented Jan 21, 2021

kartikwar commented Jan 21, 2021

leijue222 commented Mar 31, 2020 •

edited