Set additional seeds? #33

egpbos · 2020-11-16T12:09:06Z

We're getting different results with the same configuration settings, including seed, so possibly we are not setting all seeds. Check which ones they are. I suspect torch.cuda.seed() may be one.

The text was updated successfully, but these errors were encountered:

cwmeijer · 2021-02-23T11:47:08Z

We have tests that are checking performance now (#63). These tests seem to pass consistently on my local machine as well as on Github Actions. A small number was failing on carmine though. Carmine is the only machine of the 3 I just mentioned that is using GPUs. We should indeed check out torch.cuda.seed().

egpbos · 2021-04-27T19:19:52Z

With commit 1ffcb82, I made it possible to run the tox test suite on GPU, by setting PLATALEA_DEVICE="cuda:0" (or another number) in the shell before running tox (by default, environment variables are not forwarded into the tox environment, so you have to manually add the ones you want in tox.ini).

I tried running the testsuite on carmine and everything passes.

However, since we're using approximate value checks in the tests, this obviously does not tell us whether the code is now deterministic or whether we're still missing some random seed.

@cwmeijer, you mention in #78 (comment) that the values get rounded differently on different machines. Did you look into where this could have come from? For instance, if it's just about different library versions, we could pin those and maybe then use exact value equality asserts.

egpbos · 2021-04-28T10:24:46Z

In branch https://github.com/spokenlanguage/platalea/tree/exact_equal_experiment_tests, I switched the assert to check results exactly, so I could look into determinism of the test results.

On carmine, using the tox test suite, I could not reproduce any non-determinism, not on GPU, nor on CPU. The tests fail and then print the diffs of the results. If I run tests multiple times, the diffs are exactly the same each time.

Note that I also tried installing the same dependency versions (at least the Python ones, can't control system dependencies). As @cwmeijer saw before, this still does not make the results consistent across machines. One reason may be that my laptop is a Mac, so there may be different basic underlying libraries that give inconsistent results with those on carmine's Linux setup. PyTorch indeed does not guarantee determinism across different platforms, see https://pytorch.org/docs/stable/notes/randomness.html.

So, it seems the different results we saw may just indeed have been platform or version differences. Given the fact that we cannot seem to get them equal to perform further tests and I could not reproduce non-determinism in the first place, I vote we close this issue.

egpbos added the bug Something isn't working label Jan 13, 2021

bhigy mentioned this issue Feb 4, 2021

Add actual result checks to test suite #63

Closed

cwmeijer assigned cwmeijer and unassigned cwmeijer Feb 16, 2021

bhigy closed this as completed Apr 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set additional seeds? #33

Set additional seeds? #33

egpbos commented Nov 16, 2020

cwmeijer commented Feb 23, 2021

egpbos commented Apr 27, 2021

egpbos commented Apr 28, 2021 •

edited

Loading

Set additional seeds? #33

Set additional seeds? #33

Comments

egpbos commented Nov 16, 2020

cwmeijer commented Feb 23, 2021

egpbos commented Apr 27, 2021

egpbos commented Apr 28, 2021 • edited Loading

egpbos commented Apr 28, 2021 •

edited

Loading