Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mse criterion returns negative values when running in parallel on the same gpu #136

Open
ItzikMalkiel opened this issue Sep 9, 2015 · 1 comment

Comments

@ItzikMalkiel
Copy link

I'm using nn.MSECriterion and I've noticed that the following code results with negative value in very rare cases:
local err = criterion:forward(output, targets)

It happened to me in very rare cases (3-4 times in the last year) - in all cases I trained two networks at the same time over the same gpu.
Note that it happened after thousands of epochs and it is very hard to reproduce.

If someone knows why it can happen or how to fix/avoid it - please let me know.

Thanks in advance,
Itzik.

@soumith
Copy link
Member

soumith commented Sep 14, 2015

any possible test case (even if the test case has to be run 10000 times to get one reproduction), will be quite helpful.

I've not noticed such an issue offhand. Without a test-case, since this is really hard to figure out why it happens, I have no idea what to do next.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants