Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ML] Upgrade to PyTorch 2.3.1 #2688

Merged
merged 12 commits into from
Jul 15, 2024
Merged

Conversation

edsavage
Copy link
Contributor

@edsavage edsavage commented Jul 8, 2024

Update Docker images and dependency files with PyTorch 2.3.1.

Testing on Linux x86_64 gives promising indications that this version of
PyTorch may resolve some memory allocation issues related to the
pytorch_inference process.

Opening this PR to better test it across the range of platforms and
architectures that we support.

Update Docker images and dependecy files with PyTorch 2.3.1.

Testing on Linux x86_64 gives promising indications that this version of
PyTorch may resolve some memory allocation issues related to the
`pytorch_inference` process.

Opening this PR to better test it across the range of platforms and
architectures that we support.
@edsavage edsavage added >build :ml dependencies Pull requests that update a dependency file ci:run-qa-tests Run a subset of the QA tests v8.16.0 labels Jul 8, 2024
@edsavage edsavage marked this pull request as draft July 8, 2024 05:07
@edsavage
Copy link
Contributor Author

edsavage commented Jul 8, 2024

Compilation errors for the macOS intel cross compilation build are due to a (now) too old compiler version

In file included from /buildkite/builds/bk-agent-prod-k8s-1720411844430980726/elastic/ml-cpp-pr-builds/bin/pytorch_inference/CResultWriter.cc:12:
In file included from /buildkite/builds/bk-agent-prod-k8s-1720411844430980726/elastic/ml-cpp-pr-builds/bin/pytorch_inference/CResultWriter.h:21:
In file included from /usr/local/sysroot-x86_64-apple-macosx10.14/usr/local/include/pytorch/torch/csrc/api/include/torch/types.h:3:
In file included from /usr/local/sysroot-x86_64-apple-macosx10.14/usr/local/include/pytorch/ATen/ATen.h:7:
In file included from /usr/local/sysroot-x86_64-apple-macosx10.14/usr/local/include/pytorch/ATen/Context.h:21:
In file included from /usr/local/sysroot-x86_64-apple-macosx10.14/usr/local/include/pytorch/c10/util/CallOnce.h:8:
/usr/local/sysroot-x86_64-apple-macosx10.14/usr/local/include/pytorch/c10/util/C++17.h:18:2: error: "You're trying to build PyTorch with a too old version of Clang. We need Clang 9 or later."

and for the macOS aarch64 build there's a problem with tar and some of the flags in the dependency archive.

@edsavage edsavage marked this pull request as ready for review July 10, 2024 22:17
Copy link
Contributor

@valeriy42 valeriy42 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Just some minor suggestions.


(cd ${REPO_ROOT} && ./gradlew --info -Dbuild.version_qualifier=${VERSION_QUALIFIER:-} -Dbuild.snapshot=$BUILD_SNAPSHOT -Dbuild.ml_debug=$ML_DEBUG $TASKS) || TEST_OUTCOME=$?
(cd ${REPO_ROOT} && ./gradlew --info -Dbuild.version_qualifier=${VERSION_QUALIFIER:-} -Dbuild.snapshot=$BUILD_SNAPSHOT -Dbuild.ml_debug=$ML_DEBUG $TASKS) || TEST_OUTCOME=$?
else
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this the branch for testing Windows? If so, maybe you can add a comment so we can find our way around this file more easily in the future.

docs/CHANGELOG.asciidoc Outdated Show resolved Hide resolved
edsavage and others added 2 commits July 15, 2024 14:21
Co-authored-by: Valeriy Khakhutskyy <1292899+valeriy42@users.noreply.github.com>
@edsavage edsavage merged commit b8e6746 into elastic:main Jul 15, 2024
14 checks passed
edsavage added a commit to edsavage/ml-cpp that referenced this pull request Aug 14, 2024
Update Docker images and dependency files with PyTorch 2.3.1.

Testing on Linux x86_64 gives promising indications that this version of
PyTorch may resolve some memory allocation issues related to the
`pytorch_inference` process.

Co-authored-by: Valeriy Khakhutskyy <1292899+valeriy42@users.noreply.github.com>
edsavage added a commit to edsavage/ml-cpp that referenced this pull request Aug 14, 2024
…#2705)

Release notes need updating following backport of PyTorch 2.3.1 changes
to 8.15.1
edsavage added a commit to edsavage/ml-cpp that referenced this pull request Aug 14, 2024
…#2705)

Release notes need updating following backport of PyTorch 2.3.1 changes
to 8.15.1
edsavage added a commit that referenced this pull request Aug 18, 2024
Update Docker images and dependency files with PyTorch 2.3.1.

Testing on Linux x86_64 gives promising indications that this version of
PyTorch may resolve some memory allocation issues related to the
`pytorch_inference` process.

Co-authored-by: Valeriy Khakhutskyy <1292899+valeriy42@users.noreply.github.com>

Backports #2688
edsavage added a commit that referenced this pull request Aug 18, 2024
Release notes need updating following backport of PyTorch 2.3.1 changes
to 8.15.1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
>build ci:run-qa-tests Run a subset of the QA tests dependencies Pull requests that update a dependency file :ml v8.16.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants