Bugfix - pretraining oom #348

eduardocarvp · 2021-12-18T23:57:47Z

What kind of change does this PR introduce?

This fixes the OOM error when using GPU with large validation sets during pretraining.

Does this PR introduce a breaking change?

No.

What needs to be documented once your changes are merged?

Nothing.

Closing issues

Closes #341

PS:
I spent a lot of time trying to find why the two losses were not giving the same answer. It turns out, be default numpy.std does not apply the Bessel correction, which consists of multiplying the std by n/(n-1) to render the estimator unbiased. For that, we need to add ddof=1 on the line

batch_stds = np.std(embedded_x, axis=0, ddof=1) ** 2 + eps

and just then the test works.

Talking about tests, the test was written, but since we do not currently have unit tests and pytest is not installed, installing it would cause the poetry lock to change and upgrade quite a lot of packages. We should probably do this somewhere else on another PR and then add the unit tests to the gitlab-ci on that occasion.

Optimox · 2021-12-20T09:13:17Z

Thanks @eduardocarvp,

It seems to be exactly what we needed.

I was thinking about the cases where the std is 0, this happens everytime one feature has a unique value.

As all this is just about normalization we could maybe fill 0 values of the std by the absolute value of the unique value (+1 if the unique value is 0).

This might be done in another PR but I'd like to have this in the next release as it causes a lot of confusion during pretraining.

eduardocarvp · 2021-12-20T23:05:17Z

What about this @Optimox ?
The tests are working and on the second test case we have a constant matrix. Do you think it's a good idea to update the package versions before release? I'd like to add pytest to the dev dependencies, but other than that just upgrading the versions might be a good idea if it does not break anything.

Optimox · 2021-12-21T08:29:53Z

@eduardocarvp I think we should try not to ask for unnecessary newer version to allow compatibility with other packages. For instance, we do not need the latest pytorch version, it would be a shame to force torch=1.10 while things are working fine with version 1.5
Otherwise there is absolutely no problem to add dev packages to the repo. But you should probably avoid a poetry update which will update all the dependencies to the latest versions.

eduardocarvp force-pushed the bugfix/pretraining-oom branch from 3707a8d to 4997b57 Compare December 19, 2021 00:06

eduardocarvp mentioned this pull request Dec 19, 2021

Add a conda install option for pytorch-tabnet #346

Closed

eduardocarvp added the bug Something isn't working label Dec 19, 2021

eduardocarvp requested a review from Optimox December 19, 2021 21:01

eduardocarvp added 2 commits December 27, 2021 17:15

fix: compute unsupervised loss using numpy

fbd023b

fix: use numpy std with bessel correction and test

fd5a8e2

eduardocarvp force-pushed the bugfix/pretraining-oom branch from da33c54 to 016f3ae Compare December 27, 2021 16:16

fix: replace std 0 by the mean or 1 if mean is 0

b71b592

eduardocarvp force-pushed the bugfix/pretraining-oom branch from 016f3ae to c2bd5dc Compare December 27, 2021 17:17

chore: add pytest as dev dependency

aac163b

eduardocarvp force-pushed the bugfix/pretraining-oom branch from c2bd5dc to aac163b Compare December 27, 2021 17:32

Optimox approved these changes Dec 27, 2021

View reviewed changes

Optimox merged commit 95e0e9a into develop Dec 27, 2021

Optimox deleted the bugfix/pretraining-oom branch December 27, 2021 18:05

Optimox mentioned this pull request Sep 14, 2022

CUDA out of memory when performing TabNetPretrainer on large dataset #435

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugfix - pretraining oom #348

Bugfix - pretraining oom #348

eduardocarvp commented Dec 18, 2021 •

edited

Optimox commented Dec 20, 2021

eduardocarvp commented Dec 20, 2021

Optimox commented Dec 21, 2021

Bugfix - pretraining oom #348

Bugfix - pretraining oom #348

Conversation

eduardocarvp commented Dec 18, 2021 • edited

Optimox commented Dec 20, 2021

eduardocarvp commented Dec 20, 2021

Optimox commented Dec 21, 2021

eduardocarvp commented Dec 18, 2021 •

edited