-
Notifications
You must be signed in to change notification settings - Fork 75
[rocm6.4_internal_testing] Update missing changes for CentOS9 #1813
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Jenkins build for 4c41f4963045b16576ce018e567fafe154bfa1a0 commit finished as FAILURE |
|
Reported issue is fixed by this PR, Refer the log, http://rocm-ci.amd.com/job/mainline-framework-pytorch-ci/2200/console @jithunnair-amd Please review and merge. |
|
http://rocm-ci.amd.com/job/mainline-framework-pytorch-ci/2201/console is triggered with the change. |
|
Jenkins build for dc77a5818e159281126e84f15cb24996238f181b commit finished as FAILURE |
|
Triggered http://rocm-ci.amd.com/job/mainline-framework-pytorch-ci/2203/console to verify the PR @jithunnair-amd for CS9 too, these changes are missing. FYI |
|
Jenkins build for bf1285d4f4dc7a74378df15284637cb640a04b0a commit finished as FAILURE |
|
changed the logic of CS9 dockerfile as per Ubuntu, https://github.com/ROCm/pytorch/blob/rocm6.4_internal_testing/.ci/docker/ubuntu-rocm/Dockerfile#L105 There is no more trition-rocm.txt and it has changed to trition.txt for Ubuntu but missing in CS9 dockerfile http://rocm-ci.amd.com/job/mainline-framework-pytorch-ci/2208/console , replayed with new changes to verify cc: @jithunnair-amd |
|
Jenkins build for 3eaa0acb6108cb74fba675964b7b35371e7d7c23 commit finished as FAILURE |
|
Thanks for the fixes, @pramenku |
Thanks for reviewing and merging @jithunnair-amd |
==================================================== [SOW MS3] Centos stream9 PyTorch image support (#1090) * changes to build Centos stream 9 images * Added scripts for centos and centos stream images * Added an extra line * Add ninja installation * Optimized code * Fixes * Add comment * Optimized code * Added AMDGPU mapping for ROCm 5.2 and invalid-url for rocm_baseurl Co-authored-by: Jithun Nair <37884920+jithunnair-amd@users.noreply.github.com> Updated to latest conda for CentOS stream 9 [CS9] Updates to CentOS stream 9 build (#1326) - Add missing common_utils.sh - Update the install vision part - Move to amdgpu rhel 9.3 builds - Update to pick python from conda path - Add a missing package - Add ROCM_PATH and magma - Updated repo radeon path (cherry picked from commit 51ce1cc) [rocm6.4_internal_testing] Update missing changes for CentOS9 (#1813) To fix, https://ontrack-internal.amd.com/browse/SWDEV-505385 and https://ontrack-internal.amd.com/browse/SWDEV-507301 (cherry picked from commit 956c145) delete .ci/docker/common/install_db.sh
==================================================== [SOW MS3] Centos stream9 PyTorch image support (#1090) * changes to build Centos stream 9 images * Added scripts for centos and centos stream images * Added an extra line * Add ninja installation * Optimized code * Fixes * Add comment * Optimized code * Added AMDGPU mapping for ROCm 5.2 and invalid-url for rocm_baseurl Co-authored-by: Jithun Nair <37884920+jithunnair-amd@users.noreply.github.com> Updated to latest conda for CentOS stream 9 [CS9] Updates to CentOS stream 9 build (#1326) - Add missing common_utils.sh - Update the install vision part - Move to amdgpu rhel 9.3 builds - Update to pick python from conda path - Add a missing package - Add ROCM_PATH and magma - Updated repo radeon path (cherry picked from commit 51ce1cc) [rocm6.4_internal_testing] Update missing changes for CentOS9 (#1813) To fix, https://ontrack-internal.amd.com/browse/SWDEV-505385 and https://ontrack-internal.amd.com/browse/SWDEV-507301 (cherry picked from commit 956c145) delete .ci/docker/common/install_db.sh (cherry picked from commit 8a7fd64) CONSOLIDATED COMMITS: Updates to build on Jammy and CentOS7 =========================================================== Updates to build on Jammy - Fortran package installation moved after gcc - Update libtinfo search code in cmake1 - Install libstdc++.so [UB22.04] Updates to support latest scipy Build required version of libpng for CentOS7 Updated condition for libstc++ for Jammy Set ROCM_PATH in env for centOS docker container Changes to support docker v23 Reversed the condition as required temporarily ignore certificate check for Miniconda (cherry picked from commit 9848db1) [release/2.1] Skip certificate check for CentOS7 since certificate expired (#1399) * Skip certificate check only for CentOS7 since certificate expired * Naming Remove the installation of rocm-llvm-dev package - Causing regression - SWDEV-463083 fix install_centos() function [rocm6.3_internal_testing] skip pytorch-nightly installstion (#1557) This PR skips pytorch-nightly installation in docker images Installation of pytorch-nightly is needed to prefetch mobilenet_v2 avd v3 models for some tests. Came from 85bd6bc Models are downloaded on first use to the folder /root/.cache/torch/hub But pytorch-nightly installation also overrides .ci/docker/requirements-ci.txt settings and upgrades some of python packages (sympy from 1.12.0 to 1.13.0) which causes several 'dynamic_shapes' tests to fail Skip prefetching models affects these tests without any errors (but **internet access required**): - python test/mobile/model_test/gen_test_model.py mobilenet_v2 - python test/quantization/eager/test_numeric_suite_eager.py -k test_mobilenet_v3 Issue ROCm/frameworks-internal#8772 Also, in case of some issues these models can be prefetched after pytorch building and before testing (cherry picked from commit b92b34d) Fixes #ISSUE_NUMBER (cherry picked from commit ec70f7e) [rocm6.4_internal_testing] Changes to support UB 24.04 build (#1817) Changes applied from #1816 Test PyTorch build: http://rocm-ci.amd.com/job/mainline-framework-pytorch-ub24.04-py312-internal/5/ (cherry picked from commit 74e1e9e) (cherry picked from commit e7cb7cc) Update Centos 9 build (cherry picked from commit 3d6ba22) [rocm6.5_internal_testing] remove centos.stream dockerfile and move contents into dockerfile (#2044) rocm6.5_internal_testing move contents of centos stream dockerfile into dockerfile Validation: http://rocm-ci.amd.com/job/mainline-framework-pytorch-ci/2448/ --------- Co-authored-by: Jithun Nair <jithun.nair@amd.com> (cherry picked from commit 7886773)
To fix, https://ontrack-internal.amd.com/browse/SWDEV-505385 and https://ontrack-internal.amd.com/browse/SWDEV-507301