New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hessian algorithm produces NaN values during the training procedure #119
Comments
Hi @rmlopes, I have two questions RE problem with Hessian algorithm
|
With the real data I get the error right away.
In the testing script even with 100000 samples I cannot reproduce the
Concerning (2), do you mean to ask if I get the error with some other implementation of the Hessian algorithm? (I am only trying NeuPy and Keras - to my surprise the behavior varies a lot even with same parameterization -, and Keras does not have the Hessian algos) |
Yes, there might be difference between different libraries due to different initialization methods - http://neupy.com/docs/cheatsheet.html#parameter-initialization-methods. One more question related to your dataset. What is the value range per each input feature? Is it a number beetween 0 and 1 or maybe it's between -3 and 3? |
I am using the |
@rmlopes , I've tried different ways to reproduce this problem, but non of them help. I was thinking that the issue can be related to the inverse matrix computation. I've changed the way that update calculates (in that way it suppose to work in Theano after graph optimization). Can you try to run your code with this update? pip install git+https://github.com/itdxer/neupy.git@release/v0.4.0 |
Seems to be fine now. In another note, even though my problem is not NLP, the structure of the data is more similar to NLP than images, so how would you implement a CNN like this using neupy? |
As far as I understood first convolutional layer produces different dimensions for different filters. Basically it means that you want to make 3 different convolutions in parallel and after a few more layers concatenate outputs together in one vector. If that's correct than there is no simple building layers to construct this structure. Of course, you can construct any layer you want (docs), but it might be difficult. |
It's great that this update fixed your problem. |
RE #118 (comment)
The text was updated successfully, but these errors were encountered: