which features are fed into the matching network? #2

shaayaansayed · 2019-04-29T19:25:11Z

Hey guys, fantastic work.

I have a question about the paper. You feed the output of ROIAlign into the matching network. I'm having trouble understanding figure 4. How is the input for the matching network of a single image an NxNx256 tensor? N is the number of garment classes, correct? The output of ROIAlign is either 7x7x256 or 14x14x256 (depending on if you take the bbox stream or mask stream). How are you getting NxN?

Thanks!

geyuying · 2019-05-03T08:03:49Z

N is the size of feature map of a ROI. Given a ROI, a fixed NxNxC feature map is extracted after ROIAlign to represent features of that ROI and is then fed to the matching network.

xwjabc · 2019-10-09T20:13:33Z

Still curious about the RoI features fed into the matching net.
In the mask head (link), it has the procedure:
backbone -> RoI Pooling -> 4x conv (feature extractor) -> 1x deconv + 1 conv (predictor)
So the RoI features fed into the match net should be the features after RoI Pooling. Am I correct?

geyuying closed this as completed May 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

which features are fed into the matching network? #2

which features are fed into the matching network? #2

shaayaansayed commented Apr 29, 2019 •

edited

geyuying commented May 3, 2019

xwjabc commented Oct 9, 2019

which features are fed into the matching network? #2

which features are fed into the matching network? #2

Comments

shaayaansayed commented Apr 29, 2019 • edited

geyuying commented May 3, 2019

xwjabc commented Oct 9, 2019

shaayaansayed commented Apr 29, 2019 •

edited