[OpenMP][libomptarget][DeviceRTL] Removing ASSERT macro leads to test failures #64421

doru1004 · 2023-08-04T15:09:19Z

When replacing the ASSERT macro in Debug.h i.e.:

#define ASSERT(expr, msg)                                                      \
  {                                                                            \
    if (config::isDebugMode(config::DebugKind::Assertion) && !(expr))          \
      __assert_fail(#expr, msg, __FILE__, __LINE__, __PRETTY_FUNCTION__);      \
    else                                                                       \
      __assert_assume(expr);                                                   \
  }

with an empty assert:

#define ASSERT(expr, msg)

There are 4 OpenMP tests which fail:

  libomptarget :: amdgcn-amd-amdhsa :: jit/empty_kernel_lvl2.c
  libomptarget :: amdgcn-amd-amdhsa :: offloading/cuda_no_devices.c
  libomptarget :: amdgcn-amd-amdhsa :: offloading/std_complex_arithmetic.cpp
  libomptarget :: amdgcn-amd-amdhsa :: offloading/test_libc.cpp

Further investigation has revealed that:

Removing the ASSERT in omp_get_level() makes the jit/empty_kernel_lvl2.c test fail.

Removing the ASSERT in void *SharedMemorySmartStackTy::push(uint64_t Bytes) { leads to 3 tests failing:

  libomptarget :: amdgcn-amd-amdhsa :: offloading/cuda_no_devices.c
  libomptarget :: amdgcn-amd-amdhsa :: offloading/std_complex_arithmetic.cpp
  libomptarget :: amdgcn-amd-amdhsa :: offloading/test_libc.cpp

In addition to the tests above the OpenMP opt pass produces different results based on whether the ASSERTs are present in the DeviceRTL or not.

#include <omp.h>
#include <stdio.h>

int main(void) {
  int threads = 0;

#pragma omp target map(from:threads)
  {
    threads = omp_get_num_threads();
  }

  if (threads <= 0) {
    printf("Runtime error, threads=%d\n", threads);
  }
  printf("Success!\n");

  // INFO: 1 blocks and 256 threads in Generic-SPMD mode
  return 0;
}

If run with LIBOMPTARGET_INFO=16 enabled, the launch details will change as follows based on whether the ASSERTs are present in the code or not:
If the ASSERTs are present then the launch info is:

"PluginInterface" device 0 info: Launching kernel __omp_offloading_fd00_2448d7a_main_l6 with 1 blocks and 256 threads in Generic mode
AMDGPU device 0 info: #Args: 1 Teams x Thrds:    1x 256 (MaxFlatWorkGroupSize: 1024) LDS Usage: 0B #SGPRs/VGPRs: 4/0 #SGPR/VGPR Spills: 0/0 Tripcount: 0

If the ASSERTs are NOT present then the launch info is:

"PluginInterface" device 0 info: Launching kernel __omp_offloading_fd00_2448d7a_main_l6 with 1 blocks and 256 threads in Generic-SPMD mode
AMDGPU device 0 info: #Args: 1 Teams x Thrds:    1x 256 (MaxFlatWorkGroupSize: 1024) LDS Usage: 0B #SGPRs/VGPRs: 4/0 #SGPR/VGPR Spills: 0/0 Tripcount: 0

Note the Generic VS. Generic-SPMD mode.

The presence of the ASSERTs should not influence the outcome of the optimizer and the OpenMP lit tests should not fail when the ASSERTs are removed.

The text was updated successfully, but these errors were encountered:

llvmbot · 2023-08-04T15:10:33Z

@llvm/issue-subscribers-openmp

doru1004 · 2023-08-04T16:38:30Z

Note: the failures above are segmentation faults.

jdoerfert · 2023-08-04T17:46:42Z

Note: the failures above are segmentation faults.

We segfault because the shared stack is "too small" and the fallback, which is malloc, is not implemented on AMD. So the reason this crashes is the missing malloc, not the optimizer going bad or assume/assert being wrong. The reason it works otherwise is coincidental and not backed up by anything. We effectively ignore the fact that we are not having enough shared stack for the main thread to allocate 16 bytes, and still just do it.

get level is failing because we optimize better for that case. This is the actual (performance) bug we should address; optimize better if the ASSERT is present.

doru1004 · 2023-08-04T18:47:20Z

Thanks for looking into this Johannes! The explanation is very helpful in understanding what's happening.

Fixes: llvm#64421

github-actions bot added the new issue label Aug 4, 2023

doru1004 added openmp and removed new issue labels Aug 4, 2023

jdoerfert closed this as completed in 4fcd5f9 Aug 18, 2023

razmser pushed a commit to SuduIDE/llvm-project that referenced this issue Oct 2, 2023

[OpenMPOpt] Mark more runtime functions as SPMD compatible

47b33b7

Fixes: llvm#64421

razmser pushed a commit to SuduIDE/llvm-project that referenced this issue Oct 2, 2023

[OpenMPOpt] Mark more runtime functions as SPMD compatible

d8ef11c

Fixes: llvm#64421

razmser pushed a commit to SuduIDE/llvm-project that referenced this issue Oct 2, 2023

[OpenMPOpt] Mark more runtime functions as SPMD compatible

f81cf48

Fixes: llvm#64421

razmser pushed a commit to SuduIDE/llvm-project that referenced this issue Oct 3, 2023

[OpenMPOpt] Mark more runtime functions as SPMD compatible

cd53c7b

Fixes: llvm#64421

razmser pushed a commit to SuduIDE/llvm-project that referenced this issue Oct 3, 2023

[OpenMPOpt] Mark more runtime functions as SPMD compatible

5c47830

Fixes: llvm#64421

razmser pushed a commit to SuduIDE/llvm-project that referenced this issue Oct 6, 2023

[OpenMPOpt] Mark more runtime functions as SPMD compatible

0f7b366

Fixes: llvm#64421

razmser pushed a commit to SuduIDE/llvm-project that referenced this issue Oct 11, 2023

[OpenMPOpt] Mark more runtime functions as SPMD compatible

0579eff

Fixes: llvm#64421

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OpenMP][libomptarget][DeviceRTL] Removing ASSERT macro leads to test failures #64421

[OpenMP][libomptarget][DeviceRTL] Removing ASSERT macro leads to test failures #64421

doru1004 commented Aug 4, 2023

llvmbot commented Aug 4, 2023

doru1004 commented Aug 4, 2023

jdoerfert commented Aug 4, 2023

doru1004 commented Aug 4, 2023

[OpenMP][libomptarget][DeviceRTL] Removing ASSERT macro leads to test failures #64421

[OpenMP][libomptarget][DeviceRTL] Removing ASSERT macro leads to test failures #64421

Comments

doru1004 commented Aug 4, 2023

llvmbot commented Aug 4, 2023

doru1004 commented Aug 4, 2023

jdoerfert commented Aug 4, 2023

doru1004 commented Aug 4, 2023