Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Loss and training of the Rigid Structure Reconstructor #67

Closed
yijie0710 opened this issue Jun 2, 2020 · 3 comments
Closed

Loss and training of the Rigid Structure Reconstructor #67

yijie0710 opened this issue Jun 2, 2020 · 3 comments

Comments

@yijie0710
Copy link

yijie0710 commented Jun 2, 2020

Hello, dear authors.
I have read the paper and am trying to reproduce a PyTorch version of it.
I was wondering that is possible for the rigid structure reconstruction (RSR) to be lazy to learn the depth?
In the Eq(1) of the paper, is that possible the net just output T*D as E?
And the rigid flow would be zero; the depth of all pixels are the same.
The net reconstructs the target image equivalent to the source image and eventually cannot learn the real depth and pose?

BTW, in my experiments, the smooth loss converges quickly but the rigid warp loss didn't change a lot. The depths firstly are quite noisy. Then after thousands of iterations, the depths of all pixels become all the same.

@yijie0710
Copy link
Author

I mean maybe there should be a regularization on Depth to prevent "the all same depth output"?

@yzcjtr
Copy link
Owner

yzcjtr commented Jun 4, 2020

  1. Since we enforce the warping loss for both stages, including the rigid reconstruction part, identical depth prediction cannot lead to convergence in terms of optimization;
  2. I think it depends on your network initialization. You can check whether most of the rigid flow points to coordinates out of the image plane. In that case, the warping loss doesn't give you reasonable supervision signal. Another way to debug it is to try different scaling coefficients of the posenet output or the depthnet output (

    GeoNet/geonet_nets.py

    Lines 10 to 11 in 5b17602

    DISP_SCALING_RESNET50 = 5
    DISP_SCALING_VGG = 10
    ). You can also visualize the warped images guided by rigid flow.

@yijie0710
Copy link
Author

Thank you so much for your reply. I will try that.

@yzcjtr yzcjtr closed this as completed Oct 2, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants