New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
From the iteration 0,loss =NAN #5986
Comments
Please do not post usage, installation, or modeling questions, or other requests for help to Issues. Use the caffe-users list instead. This helps developers maintain a clear, uncluttered, and efficient view of the state of Caffe. Please read the guidelines for contributing before submitting an issue or a pull request. You might want to review your data as it is likely to be the source of NaNs, potentially also the caffemodel. I suggest using the python interface to inspect the inside of each blob. |
You might find this SO thread useful. |
Thanks a lot! After I reduced the batch_size,the problem seems to be disappeared,at least,from 0th iterations to 5000th iterations,NAN error is disappeared,my program is running. |
thanks a lot ,I am new to caffe , and I've been having this problem recently:
Check failed: error == cudaSuccess (3 vs. 0) initialization error.
luckly I found your question about this problem on stack overfllow: https://stackoverflow.com/questions/43756686/check-failed-error-cudasuccess-3-vs-0-initialization-error-check-fail
I think it's fate. LOL! would you please give me some suggestion on this new problem? thank you very much!
At 2017-10-17 18:38:11, "Shai" <notifications@github.com> wrote:
You might find this SO thread useful.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub, or mute the thread.
|
I am using my data to train bvlc_alexnet,and I didn't change the structure of alexnet .
when I run solver.prototxt ,I found loss=NAN at every iterion(from iteration 0).
I have tried reducing the learning rate to 0.000001,it didn't work.
I even set base_lr = 0,loss still equals to NAN from iteration 0.
it's disturbed me .because yangqing answered at issues#409 #409 (comment)
here is the solver,prototxt:
here is the output:
should I change the net structure more simple or did my data have some problem?
thanks very much!
The text was updated successfully, but these errors were encountered: