-
Notifications
You must be signed in to change notification settings - Fork 84
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Strange training loss and test result #3
Comments
Hi, thanks for trying out the repo! Interesting - yes that loss does appear to be very low right from the start. I just tried a fresh github clone and see an initial loss (i.e. batch 0) of 0.34110, which seems more sensible. Can you take a look at the tensorboard logs and switch to the images tab? Does it look like the images are being loaded correctly? And if so, how do the disparity predictions look? Feel free to post a screenshot. A loss that low straight away suggests that it might be loading blank images. If you downloaded the raw KITTI and have In the meantime I will push a small fix to raise an error if no image files are found (currently it's caught in a Thanks a lot |
You are right, I am using png files, which needs a flag even when I am in monodepth2. But I did not expect this to be error-less/warning-less. Thank you for your answer |
Thanks @Owen-Liuyuxuan for:
And thanks @JamieWatson683 for the quick fix for this. |
I ran
epoch 0 | batch 0 | examples/s: 2.8 | loss: 0.00810 | time elapsed: 00h00m09s | time left: 00h00m00s
epoch 0 | batch 250 | examples/s: 22.4 | loss: 0.00049 | time elapsed: 00h02m34s | time left: 11h19m40s
epoch 0 | batch 500 | examples/s: 20.6 | loss: 0.00024 | time elapsed: 00h05m00s | time left: 10h59m55s
epoch 0 | batch 750 | examples/s: 22.2 | loss: 0.00013 | time elapsed: 00h07m26s | time left: 10h50m49s
epoch 0 | batch 1000 | examples/s: 21.5 | loss: 0.00008 | time elapsed: 00h09m52s | time left: 10h44m55s
epoch 0 | batch 1250 | examples/s: 21.0 | loss: 0.00018 | time elapsed: 00h12m18s | time left: 10h41m15s
epoch 0 | batch 1500 | examples/s: 21.7 | loss: 0.00019 | time elapsed: 00h14m45s | time left: 10h37m37s
epoch 0 | batch 1750 | examples/s: 21.5 | loss: 0.00011 | time elapsed: 00h17m10s | time left: 10h33m45s
The loss is extremely small.
The result on the 12 epoch (it should be reasonable at this moment), but is not.
The text was updated successfully, but these errors were encountered: