Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] LSTM deep learning estimators failing CI on windows #4033

Closed
fkiraly opened this issue Dec 31, 2022 · 4 comments · Fixed by #4036
Closed

[BUG] LSTM deep learning estimators failing CI on windows #4033

fkiraly opened this issue Dec 31, 2022 · 4 comments · Fixed by #4036
Assignees
Labels
bug Something isn't working module:classification classification module: time series classification

Comments

@fkiraly
Copy link
Collaborator

fkiraly commented Dec 31, 2022

Since recently, a failure of two deep learning estimators has been appearing on windows CI:

FAILED sktime/tests/test_all_estimators.py::TestAllEstimators::test_methods_have_no_side_effects[MLPClassifier-1-ClassifierFitPredictMultivariate-predict]
FAILED sktime/tests/test_all_estimators.py::TestAllEstimators::test_methods_have_no_side_effects[LSTMFCNClassifier-0-ClassifierFitPredictMultivariate-predict]

Note that the failure appears to be only on python 3.9, but this is due to the matrix design which spreads estimators across version/OS combinations.

In theory, it could be a 3.9 specific failure, but I think that is less likely than windows specifity (although it may be worth to test that by turning the matrixdesign flag off in the CI).

@fkiraly fkiraly added the bug Something isn't working label Dec 31, 2022
@fkiraly
Copy link
Collaborator Author

fkiraly commented Dec 31, 2022

This does not seem clearly related to any recent PR merge to main.

@achieveordie, @AurumnPegasus, @jnrusson1, @solen0id, do you have a clue what is happening here?

@fkiraly fkiraly added the module:classification classification module: time series classification label Dec 31, 2022
@fkiraly
Copy link
Collaborator Author

fkiraly commented Dec 31, 2022

traceback is not very informative:

 ================================== FAILURES ===================================
_____________________ sktime/tests/test_all_estimators.py _____________________
[gw0] win32 -- Python 3.9.12 C:\Miniconda\envs\test\python.exe
worker 'gw0' crashed while running 'sktime/tests/test_all_estimators.py::TestAllEstimators::test_methods_have_no_side_effects[MLPClassifier-1-ClassifierFitPredictMultivariate-predict]'
_____________________ sktime/tests/test_all_estimators.py _____________________
[gw1] win32 -- Python 3.9.12 C:\Miniconda\envs\test\python.exe
worker 'gw1' crashed while running 'sktime/tests/test_all_estimators.py::TestAllEstimators::test_methods_have_no_side_effects[LSTMFCNClassifier-0-ClassifierFitPredictMultivariate-predict]'

fkiraly added a commit that referenced this issue Dec 31, 2022
Skips two recent test failures on `main` until diagnosed and fixed.

Failures are described in #4033
@achieveordie
Copy link
Collaborator

LSTMFCN was added last week, right? I suspect the problem is arising from that estimator.

We have seen similar problems back when TapNet was ported, OutOfMemory/MemoryError error arising from one worker can crash the entire test suite. Hence we see MLPClassifier failing, just because it happened to be tested on the same system.

The reason I think LSTMFCN is causing the failures is that its get_test_params() currently only reduces the number of epochs, the size of the model remains the same.

Let me take this up and diagnose if there are any more problems in it.

@achieveordie achieveordie self-assigned this Jan 1, 2023
@fkiraly fkiraly changed the title [BUG] two deep learning estimators failing CI on windows [BUG] LSTM deep learning estimators failing CI on windows Jan 1, 2023
@fkiraly
Copy link
Collaborator Author

fkiraly commented Jan 1, 2023

Yes, it seems to be LSTMFCN - it also pops up in other failures recently.

fkiraly added a commit that referenced this issue Jan 2, 2023
…#4037)

This skips all tests for `LSTMFCNClassifier` due to unfixed sporadic failures on `main`, see this issue: #4033
After fixing, we should add the tests back.

Also removes an orphaned comment related to this old issue which has been superseded in the exclusion list: #1627
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working module:classification classification module: time series classification
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants