Skip to content

Conversation

@pramenku
Copy link

@pramenku pramenku commented Jan 6, 2025

@rocm-repo-management-api
Copy link

rocm-repo-management-api bot commented Jan 6, 2025

Jenkins build for 4c41f4963045b16576ce018e567fafe154bfa1a0 commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

@pramenku
Copy link
Author

pramenku commented Jan 6, 2025

Reported issue is fixed by this PR,

Refer the log, http://rocm-ci.amd.com/job/mainline-framework-pytorch-ci/2200/console

@jithunnair-amd Please review and merge.

@pramenku pramenku changed the title Enable crb for python3-wheel install Enable crb for python3-wheel install+ add missing trition-rocm.txt Jan 7, 2025
@pramenku
Copy link
Author

pramenku commented Jan 7, 2025

http://rocm-ci.amd.com/job/mainline-framework-pytorch-ci/2201/console is triggered with the change.

@rocm-repo-management-api
Copy link

rocm-repo-management-api bot commented Jan 7, 2025

Jenkins build for dc77a5818e159281126e84f15cb24996238f181b commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

@pramenku pramenku changed the title Enable crb for python3-wheel install+ add missing trition-rocm.txt [rocm6.4_internal_testing] Update missing changes for CentOS9 Jan 7, 2025
@pramenku
Copy link
Author

pramenku commented Jan 7, 2025

Triggered http://rocm-ci.amd.com/job/mainline-framework-pytorch-ci/2203/console to verify the PR

@jithunnair-amd for CS9 too, these changes are missing. FYI

@rocm-repo-management-api
Copy link

rocm-repo-management-api bot commented Jan 7, 2025

Jenkins build for bf1285d4f4dc7a74378df15284637cb640a04b0a commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

@pramenku
Copy link
Author

pramenku commented Jan 8, 2025

changed the logic of CS9 dockerfile as per Ubuntu, https://github.com/ROCm/pytorch/blob/rocm6.4_internal_testing/.ci/docker/ubuntu-rocm/Dockerfile#L105

There is no more trition-rocm.txt and it has changed to trition.txt for Ubuntu but missing in CS9 dockerfile

http://rocm-ci.amd.com/job/mainline-framework-pytorch-ci/2208/console , replayed with new changes to verify

cc: @jithunnair-amd

@rocm-repo-management-api
Copy link

rocm-repo-management-api bot commented Jan 8, 2025

Jenkins build for 3eaa0acb6108cb74fba675964b7b35371e7d7c23 commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

@jithunnair-amd
Copy link
Collaborator

Thanks for the fixes, @pramenku

@jithunnair-amd jithunnair-amd merged commit 956c145 into rocm6.4_internal_testing Jan 8, 2025
1 check failed
@jithunnair-amd jithunnair-amd deleted the pramenku-patch-2 branch January 8, 2025 15:56
@pramenku
Copy link
Author

Thanks for the fixes, @pramenku

Thanks for reviewing and merging @jithunnair-amd

ethanwee1 added a commit that referenced this pull request Jan 28, 2025
dnikolaev-amd pushed a commit that referenced this pull request Apr 24, 2025
====================================================

[SOW MS3] Centos stream9 PyTorch image support (#1090)

* changes to build Centos stream 9 images

* Added scripts for centos and centos stream images

* Added an extra line

* Add ninja installation

* Optimized code

* Fixes

* Add comment

* Optimized code

* Added AMDGPU mapping for ROCm 5.2 and invalid-url for rocm_baseurl

Co-authored-by: Jithun Nair <37884920+jithunnair-amd@users.noreply.github.com>

Updated to latest conda for CentOS stream 9

[CS9] Updates to CentOS stream 9 build (#1326)

- Add missing common_utils.sh
- Update the install vision part
- Move to amdgpu rhel 9.3 builds
- Update to pick python from conda path
- Add a missing package
- Add ROCM_PATH and magma
- Updated repo radeon path

(cherry picked from commit 51ce1cc)

[rocm6.4_internal_testing] Update missing changes for CentOS9 (#1813)

To fix, https://ontrack-internal.amd.com/browse/SWDEV-505385 and
https://ontrack-internal.amd.com/browse/SWDEV-507301

(cherry picked from commit 956c145)

delete .ci/docker/common/install_db.sh
pragupta pushed a commit that referenced this pull request Oct 29, 2025
====================================================

[SOW MS3] Centos stream9 PyTorch image support (#1090)

* changes to build Centos stream 9 images

* Added scripts for centos and centos stream images

* Added an extra line

* Add ninja installation

* Optimized code

* Fixes

* Add comment

* Optimized code

* Added AMDGPU mapping for ROCm 5.2 and invalid-url for rocm_baseurl

Co-authored-by: Jithun Nair <37884920+jithunnair-amd@users.noreply.github.com>

Updated to latest conda for CentOS stream 9

[CS9] Updates to CentOS stream 9 build (#1326)

- Add missing common_utils.sh
- Update the install vision part
- Move to amdgpu rhel 9.3 builds
- Update to pick python from conda path
- Add a missing package
- Add ROCM_PATH and magma
- Updated repo radeon path

(cherry picked from commit 51ce1cc)

[rocm6.4_internal_testing] Update missing changes for CentOS9 (#1813)

To fix, https://ontrack-internal.amd.com/browse/SWDEV-505385 and
https://ontrack-internal.amd.com/browse/SWDEV-507301

(cherry picked from commit 956c145)

delete .ci/docker/common/install_db.sh

(cherry picked from commit 8a7fd64)

CONSOLIDATED COMMITS: Updates to build on Jammy and CentOS7

===========================================================

Updates to build on Jammy
- Fortran package installation moved after gcc
- Update libtinfo search code in cmake1
- Install libstdc++.so

[UB22.04] Updates to support latest scipy

Build required version of libpng for CentOS7

Updated condition for libstc++ for Jammy

Set ROCM_PATH in env for centOS docker container

Changes to support docker v23

Reversed the condition as required

temporarily ignore certificate check for Miniconda

(cherry picked from commit 9848db1)

[release/2.1] Skip certificate check for CentOS7 since certificate expired (#1399)

* Skip certificate check only for CentOS7 since certificate expired

* Naming

Remove the installation of rocm-llvm-dev package

- Causing regression - SWDEV-463083

fix install_centos() function

[rocm6.3_internal_testing] skip pytorch-nightly installstion (#1557)

This PR skips pytorch-nightly installation in docker images

Installation of pytorch-nightly is needed to prefetch mobilenet_v2 avd
v3 models for some tests.
Came from

85bd6bc
Models are downloaded on first use to the folder /root/.cache/torch/hub
But pytorch-nightly installation also overrides
.ci/docker/requirements-ci.txt settings and upgrades some of python
packages (sympy from 1.12.0 to 1.13.0) which causes several
'dynamic_shapes' tests to fail
Skip prefetching models affects these tests without any errors (but
**internet access required**):

- python test/mobile/model_test/gen_test_model.py mobilenet_v2
- python test/quantization/eager/test_numeric_suite_eager.py -k
test_mobilenet_v3

Issue ROCm/frameworks-internal#8772

Also, in case of some issues these models can be prefetched after
pytorch building and before testing

(cherry picked from commit b92b34d)

Fixes #ISSUE_NUMBER

(cherry picked from commit ec70f7e)

[rocm6.4_internal_testing] Changes to support UB 24.04 build (#1817)

Changes applied from #1816

Test PyTorch build:
http://rocm-ci.amd.com/job/mainline-framework-pytorch-ub24.04-py312-internal/5/

(cherry picked from commit 74e1e9e)
(cherry picked from commit e7cb7cc)

Update Centos 9 build

(cherry picked from commit 3d6ba22)

[rocm6.5_internal_testing] remove centos.stream dockerfile and move contents into dockerfile (#2044)

rocm6.5_internal_testing move contents of centos stream dockerfile into
dockerfile

Validation:
http://rocm-ci.amd.com/job/mainline-framework-pytorch-ci/2448/

---------

Co-authored-by: Jithun Nair <jithun.nair@amd.com>
(cherry picked from commit 7886773)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants