Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ML] Upgrade MKL to version 2024.0 #2619

Merged
merged 2 commits into from Feb 9, 2024
Merged

Conversation

droberts195
Copy link
Contributor

After upgrading PyTorch to version 2.1.2 we encountered crashes when using the optimised version of ELSER v2 on Linux x86_64. The problem appeared to be caused either by a bug in the MKL version 2020.4 or else an incompatibility between that version of MKL and PyTorch 2.1.2. An upgrade to MKL version 2024.0 appears to resolve it.

Fixes #2617

After upgrading PyTorch to version 2.1.2 we encountered
crashes when using the optimised version of ELSER v2 on
Linux x86_64. The problem appeared to be caused either by
a bug in the MKL version 2020.4 or else an incompatibility
between that version of MKL and PyTorch 2.1.2. An upgrade
to MKL version 2024.0 appears to resolve it.

Fixes elastic#2617
Copy link
Contributor

@edsavage edsavage left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@droberts195 droberts195 added the ci:run-qa-tests Run a subset of the QA tests label Feb 5, 2024
@droberts195
Copy link
Contributor Author

buildkite run_qa_tests

@droberts195
Copy link
Contributor Author

buildkite test this please

@droberts195
Copy link
Contributor Author

buildkite run_qa_tests on linux

@droberts195
Copy link
Contributor Author

buildkite test this please

@droberts195 droberts195 merged commit 4924e41 into elastic:main Feb 9, 2024
13 checks passed
@droberts195 droberts195 deleted the upgrade_mkl branch February 9, 2024 23:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Fatal error: 'si_signo 11, si_code: 1, ..." when _infer using elser model, after pytorch 2.1 upgrade
2 participants