Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

which features are fed into the matching network? #2

Closed
shaayaansayed opened this issue Apr 29, 2019 · 2 comments
Closed

which features are fed into the matching network? #2

shaayaansayed opened this issue Apr 29, 2019 · 2 comments

Comments

@shaayaansayed
Copy link

shaayaansayed commented Apr 29, 2019

Hey guys, fantastic work.

I have a question about the paper. You feed the output of ROIAlign into the matching network. I'm having trouble understanding figure 4. How is the input for the matching network of a single image an NxNx256 tensor? N is the number of garment classes, correct? The output of ROIAlign is either 7x7x256 or 14x14x256 (depending on if you take the bbox stream or mask stream). How are you getting NxN?

Thanks!

@geyuying
Copy link
Collaborator

geyuying commented May 3, 2019

N is the size of feature map of a ROI. Given a ROI, a fixed NxNxC feature map is extracted after ROIAlign to represent features of that ROI and is then fed to the matching network.

@xwjabc
Copy link

xwjabc commented Oct 9, 2019

Still curious about the RoI features fed into the matching net.
In the mask head (link), it has the procedure:
backbone -> RoI Pooling -> 4x conv (feature extractor) -> 1x deconv + 1 conv (predictor)
So the RoI features fed into the match net should be the features after RoI Pooling. Am I correct?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants