[release/2.4] Fix the PyTorch build on ROCM 7.0 #2325

xinyazhang · 2025-07-08T15:42:59Z

Backport #2311

…:warp_size() (#2293) Fixes SWDEV-540240, SWDEV-540309, SWDEV-539989 ``` ... ``` 80cca70 created a static global variable that used `at::cuda::warp_size()` to initialize its value, which needs GPUs to be visible to query device properties. However, GPUs are not present on CPU-only build systems. Convert static variable into a static function, thus preventing static initialization. http://rocm-ci.amd.com/job/pyt_whl_docker_mainline/1461/artifact/build_artifacts.txt/*view*/ Ran microbenchmark to confirm basic functionality: ``` root@ubb4-rack-22:/var/lib/jenkins/pytorch-micro-benchmarking# python3 micro_benchmarking_pytorch.py --network resnet50 INFO: running forward and backward for warmup. INFO: running the benchmark.. OK: finished running benchmark.. --------------------SUMMARY-------------------------- Microbenchmark for network : resnet50 Num devices: 1 Dtype: FP32 Mini batch size [img] : 64 Time per mini-batch : 0.10158218145370483 Throughput [img/sec] : 630.0317544289736= ```

jeffdaily and others added 2 commits July 8, 2025 10:41

remove warpSize usage on host side

d9d020a

xinyazhang changed the title ~~Xinyazhang/rocm7.0torch2.4 enable mi350 testing~~ [release/2.5] Fix the Build on ROCM 7.0 Jul 8, 2025

xinyazhang marked this pull request as ready for review July 8, 2025 15:44

xinyazhang requested review from jithunnair-amd and pruthvistony July 8, 2025 15:44

xinyazhang changed the title ~~[release/2.5] Fix the Build on ROCM 7.0~~ [release/2.4] Fix the Build on ROCM 7.0 Jul 8, 2025

xinyazhang mentioned this pull request Jul 8, 2025

[release/2.4] Backport AOTriton 0.10b to support gfx950 and ROCM 7.0 #2318

Merged

jithunnair-amd changed the title ~~[release/2.4] Fix the Build on ROCM 7.0~~ [release/2.4] Fix the PyTorch build on ROCM 7.0 Jul 8, 2025

pruthvistony approved these changes Jul 8, 2025

View reviewed changes

pruthvistony merged commit a0e5785 into release/2.4 Jul 8, 2025
0 of 2 checks passed

pruthvistony deleted the xinyazhang/rocm7.0torch2.4-enable_mi350_testing branch July 8, 2025 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[release/2.4] Fix the PyTorch build on ROCM 7.0 #2325

[release/2.4] Fix the PyTorch build on ROCM 7.0 #2325

Uh oh!

xinyazhang commented Jul 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[release/2.4] Fix the PyTorch build on ROCM 7.0 #2325

[release/2.4] Fix the PyTorch build on ROCM 7.0 #2325

Uh oh!

Conversation

xinyazhang commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

xinyazhang commented Jul 8, 2025 •

edited

Loading