[BUG] LSTM deep learning estimators failing CI on windows #4033

fkiraly · 2022-12-31T19:50:29Z

Since recently, a failure of two deep learning estimators has been appearing on windows CI:

FAILED sktime/tests/test_all_estimators.py::TestAllEstimators::test_methods_have_no_side_effects[MLPClassifier-1-ClassifierFitPredictMultivariate-predict]
FAILED sktime/tests/test_all_estimators.py::TestAllEstimators::test_methods_have_no_side_effects[LSTMFCNClassifier-0-ClassifierFitPredictMultivariate-predict]

Note that the failure appears to be only on python 3.9, but this is due to the matrix design which spreads estimators across version/OS combinations.

In theory, it could be a 3.9 specific failure, but I think that is less likely than windows specifity (although it may be worth to test that by turning the matrixdesign flag off in the CI).

The text was updated successfully, but these errors were encountered:

fkiraly · 2022-12-31T19:53:34Z

This does not seem clearly related to any recent PR merge to main.

@achieveordie, @AurumnPegasus, @jnrusson1, @solen0id, do you have a clue what is happening here?

fkiraly · 2022-12-31T19:54:47Z

traceback is not very informative:

 ================================== FAILURES ===================================
_____________________ sktime/tests/test_all_estimators.py _____________________
[gw0] win32 -- Python 3.9.12 C:\Miniconda\envs\test\python.exe
worker 'gw0' crashed while running 'sktime/tests/test_all_estimators.py::TestAllEstimators::test_methods_have_no_side_effects[MLPClassifier-1-ClassifierFitPredictMultivariate-predict]'
_____________________ sktime/tests/test_all_estimators.py _____________________
[gw1] win32 -- Python 3.9.12 C:\Miniconda\envs\test\python.exe
worker 'gw1' crashed while running 'sktime/tests/test_all_estimators.py::TestAllEstimators::test_methods_have_no_side_effects[LSTMFCNClassifier-0-ClassifierFitPredictMultivariate-predict]'

Skips two recent test failures on `main` until diagnosed and fixed. Failures are described in #4033

achieveordie · 2023-01-01T04:06:32Z

LSTMFCN was added last week, right? I suspect the problem is arising from that estimator.

We have seen similar problems back when TapNet was ported, OutOfMemory/MemoryError error arising from one worker can crash the entire test suite. Hence we see MLPClassifier failing, just because it happened to be tested on the same system.

The reason I think LSTMFCN is causing the failures is that its get_test_params() currently only reduces the number of epochs, the size of the model remains the same.

Let me take this up and diagnose if there are any more problems in it.

fkiraly · 2023-01-01T23:14:42Z

Yes, it seems to be LSTMFCN - it also pops up in other failures recently.

…#4037) This skips all tests for `LSTMFCNClassifier` due to unfixed sporadic failures on `main`, see this issue: #4033 After fixing, we should add the tests back. Also removes an orphaned comment related to this old issue which has been superseded in the exclusion list: #1627

fkiraly added the bug Something isn't working label Dec 31, 2022

fkiraly mentioned this issue Dec 31, 2022

[MNT] skip #4033 related failures until fixed #4034

Merged

fkiraly added the module:classification classification module: time series classification label Dec 31, 2022

fkiraly added a commit that referenced this issue Dec 31, 2022

[MNT] skip #4033 related failures until fixed (#4034)

748890f

Skips two recent test failures on `main` until diagnosed and fixed. Failures are described in #4033

This was referenced Dec 31, 2022

[DOC][BUG] Add warning regarding issues with macOS ARM #4010

Merged

& matthewmiddlehurst [DOC] improved docstring for dtw_distance #4028

Merged

achieveordie self-assigned this Jan 1, 2023

achieveordie mentioned this issue Jan 1, 2023

[BUG] Diagnose and fix sporadic failures in the test suite due to MemoryError #4036

Merged

6 tasks

fkiraly changed the title ~~[BUG] two deep learning estimators failing CI on windows~~ [BUG] LSTM deep learning estimators failing CI on windows Jan 1, 2023

fkiraly mentioned this issue Jan 1, 2023

[MNT] skip LSTMFCNClassifier tests due to unfixed failure on main #4037

Merged

achieveordie mentioned this issue Jan 5, 2023

[ENH] Migrating ResNet Regressor from sktime_dl #3928

Closed

6 tasks

achieveordie closed this as completed in #4036 Jan 15, 2023

achieveordie mentioned this issue Jun 14, 2023

[BUG] Diagnose and fix recent memouts in CI #4702

Draft

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] LSTM deep learning estimators failing CI on windows #4033

[BUG] LSTM deep learning estimators failing CI on windows #4033

fkiraly commented Dec 31, 2022

fkiraly commented Dec 31, 2022

fkiraly commented Dec 31, 2022

achieveordie commented Jan 1, 2023

fkiraly commented Jan 1, 2023

[BUG] LSTM deep learning estimators failing CI on windows #4033

[BUG] LSTM deep learning estimators failing CI on windows #4033

Comments

fkiraly commented Dec 31, 2022

fkiraly commented Dec 31, 2022

fkiraly commented Dec 31, 2022

achieveordie commented Jan 1, 2023

fkiraly commented Jan 1, 2023