[release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) #2308

xinyazhang · 2025-07-02T15:49:32Z

Fixes SWDEV-540240, SWDEV-540309, SWDEV-539989

...

80cca70
created a static global variable that used at::cuda::warp_size() to
initialize its value, which needs GPUs to be visible to query device
properties. However, GPUs are not present on CPU-only build systems.

Convert static variable into a static function, thus preventing static
initialization.

http://rocm-ci.amd.com/job/pyt_whl_docker_mainline/1461/artifact/build_artifacts.txt/*view*/

Ran microbenchmark to confirm basic functionality:

root@ubb4-rack-22:/var/lib/jenkins/pytorch-micro-benchmarking# python3 micro_benchmarking_pytorch.py --network resnet50
INFO: running forward and backward for warmup.
INFO: running the benchmark..
OK: finished running benchmark..
--------------------SUMMARY--------------------------
Microbenchmark for network : resnet50
Num devices: 1
Dtype: FP32
Mini batch size [img] : 64
Time per mini-batch : 0.10158218145370483
Throughput [img/sec] : 630.0317544289736=

…:warp_size() (#2293) Fixes SWDEV-540240, SWDEV-540309, SWDEV-539989 ``` ... ``` 80cca70 created a static global variable that used `at::cuda::warp_size()` to initialize its value, which needs GPUs to be visible to query device properties. However, GPUs are not present on CPU-only build systems. Convert static variable into a static function, thus preventing static initialization. http://rocm-ci.amd.com/job/pyt_whl_docker_mainline/1461/artifact/build_artifacts.txt/*view*/ Ran microbenchmark to confirm basic functionality: ``` root@ubb4-rack-22:/var/lib/jenkins/pytorch-micro-benchmarking# python3 micro_benchmarking_pytorch.py --network resnet50 INFO: running forward and backward for warmup. INFO: running the benchmark.. OK: finished running benchmark.. --------------------SUMMARY-------------------------- Microbenchmark for network : resnet50 Num devices: 1 Dtype: FP32 Mini batch size [img] : 64 Time per mini-batch : 0.10158218145370483 Throughput [img/sec] : 630.0317544289736= ```

rocm-repo-management-api · 2025-07-02T15:51:01Z

Jenkins build for fd2a0432ae459fdabb6d3e5651ff4b918ab947fa commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

xinyazhang · 2025-07-07T23:10:15Z

Superseded by #2318

xinyazhang changed the title ~~[rocm7.0_internal_testing] Prevent static initialization of at::cuda::warp_size() (#2293)~~ [release/2.4] Prevent static initialization of at::cuda::warp_size() (#2293) Jul 2, 2025

xinyazhang changed the title ~~[release/2.4] Prevent static initialization of at::cuda::warp_size() (#2293)~~ [release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) Jul 2, 2025

xinyazhang requested review from jeffdaily, jithunnair-amd and pruthvistony July 2, 2025 15:54

xinyazhang marked this pull request as ready for review July 2, 2025 16:13

xinyazhang marked this pull request as draft July 2, 2025 19:44

xinyazhang closed this Jul 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) #2308

[release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) #2308

Uh oh!

xinyazhang commented Jul 2, 2025

Uh oh!

rocm-repo-management-api bot commented Jul 2, 2025 •

edited

Loading

Uh oh!

xinyazhang commented Jul 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) #2308

[release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) #2308

Uh oh!

Conversation

xinyazhang commented Jul 2, 2025

Uh oh!

rocm-repo-management-api bot commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xinyazhang commented Jul 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rocm-repo-management-api bot commented Jul 2, 2025 •

edited

Loading