Skip to content

[ROCm][release/2.12] Skip profiler check for foreach tests on ROCm (ROCM-21749)#3262

Merged
pragupta merged 1 commit into
release/2.12from
fix/rocm-21749-skip-profiler-foreach-2.12
May 28, 2026
Merged

[ROCm][release/2.12] Skip profiler check for foreach tests on ROCm (ROCM-21749)#3262
pragupta merged 1 commit into
release/2.12from
fix/rocm-21749-skip-profiler-foreach-2.12

Conversation

@srinivamd
Copy link
Copy Markdown

Summary

Fixes ROCM-21749: test_big_num_tensors__foreach_norm fails on all ROCm GPUs.

Root Cause

ForeachFuncWrapper.__call__ profiles a _foreach_norm call and checks for multi_tensor_apply_kernel in profiler output to verify the multi-tensor fast-path was taken. On CUDA, Kineto captures demangled kernel names like multi_tensor_apply_kernel[...]. On ROCm, ROCTracer/rocprofiler captures different symbols (hipLaunchKernel, vectorized_elementwise_kernel, etc.), so mta_called is always False and the assertion fails.

Fix

Add or TEST_WITH_ROCM to skip_profiler_check so the profiler-based assertion is bypassed on ROCm, consistent with the existing skip for CUDA 12.6/12.8 flakiness.

Test Plan

  • test_big_num_tensors__foreach_norm_* should now pass on ROCm GPUs (MI210, MI300X, MI350X)
  • No change in behavior on CUDA

Jira: https://amd-hub.atlassian.net/browse/ROCM-21749

…749)

ROCm's profiler (ROCTracer/rocprofiler) uses different kernel naming
than CUDA's Kineto profiler. The multi_tensor_apply_kernel symbol
never appears in ROCm profiler output, causing test_big_num_tensors
to fail with spurious AssertionError on all ROCm GPUs.

Add TEST_WITH_ROCM to the skip_profiler_check guard so the profiler-
based fast-path assertion is bypassed on ROCm, matching the existing
skip for CUDA 12.6/12.8 flakiness.
@rocm-repo-management-api
Copy link
Copy Markdown

rocm-repo-management-api Bot commented May 28, 2026

Jenkins build for 305c71767a71be234bdd2014d09c8b8fe029118f commit finished as FAILURE
Links: Pipeline Overview / Build artifacts / Test Results

@pragupta pragupta merged commit f8efdb3 into release/2.12 May 28, 2026
4 of 7 checks passed
@pragupta pragupta deleted the fix/rocm-21749-skip-profiler-foreach-2.12 branch May 28, 2026 15:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants