-
Notifications
You must be signed in to change notification settings - Fork 683
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DeepIV Value Overflow #60
Comments
The Error output is:
|
It will work if I set:
|
But it will give |
It seems that scaling down the value will work:
|
So I guess there is a limit on the maximum of the value for the Deep IV module? |
Thanks for reporting this - I'll try to take a closer look tomorrow. |
I'm also getting NaNs, has this issues been resolved? |
@yl3832 When I investigated this previously, it appeared to be dependent on the the network architecture and initialization (e.g. the weights need to have lower variance as additional layers are added). Unfortunately I don't see any easy way to avoid this since we let the user specify the network and some choices may lead to issues like exploding gradients. |
@kbattocchi Thank you Keith, I did tried different architectures in my use-case and it sometimes works. Also, normalization always helps. |
The following is a slightly modification to the DeepIV notebook, but it no longer works. I guess it is because of the fact that I increased the max value of the X and T, which caused the overflow somewhere:
Could you help and investigate why?
The text was updated successfully, but these errors were encountered: