-
Notifications
You must be signed in to change notification settings - Fork 795
Open
Labels
bugSomething isn't workingSomething isn't workingconfirmedhipIssues related to execution on HIP backend.Issues related to execution on HIP backend.
Description
Describe the bug
Discovered in pre-commit run for #14209 which adds a few new E2E tests:
-- Testing: 2105 tests, 16 workers --
TIMEOUT: SYCL :: HostInteropTask/host-task-failure.cpp (2105 of 2105)
******************** TEST 'SYCL :: HostInteropTask/host-task-failure.cpp' FAILED ********************
Exit Code: -9
Timeout: Reached timeout of 600 seconds
Command Output (stdout):
--
# RUN: at line 1
/__w/llvm/llvm/toolchain/bin//clang++ -Xsycl-target-backend=amdgcn-amd-amdhsa --offload-arch=gfx10[31](https://github.com/intel/llvm/actions/runs/9973751804/job/27562247448?pr=14209#step:22:32) -fsycl -fsycl-targets=amdgcn-amd-amdhsa /__w/llvm/llvm/llvm/sycl/test-e2e/HostInteropTask/host-task-failure.cpp -o /__w/llvm/llvm/build-e2e/HostInteropTask/Output/host-task-failure.cpp.tmp.out
# executed command: /__w/llvm/llvm/toolchain/bin//clang++ -Xsycl-target-backend=amdgcn-amd-amdhsa --offload-arch=gfx1031 -fsycl -fsycl-targets=amdgcn-amd-amdhsa /__w/llvm/llvm/llvm/sycl/test-e2e/HostInteropTask/host-task-failure.cpp -o /__w/llvm/llvm/build-e2e/HostInteropTask/Output/host-task-failure.cpp.tmp.out
# note: command had no output on stdout or stderr
# RUN: at line 2
env ONEAPI_DEVICE_SELECTOR=hip:gpu /__w/llvm/llvm/build-e2e/HostInteropTask/Output/host-task-failure.cpp.tmp.out
# executed command: env ONEAPI_DEVICE_SELECTOR=hip:gpu /__w/llvm/llvm/build-e2e/HostInteropTask/Output/host-task-failure.cpp.tmp.out
# note: command had no output on stdout or stderr
# error: command failed with exit status: -9
# error: command reached timeout: True
--
To reproduce
Not sure how exactly to reproduce, i.e. it is clear how to run the test, but it is not clear what are the conditions of it hanging.
Environment
- OS: Linux
- Target device and vendor: AMD HIP
- DPC++ version: most recent
clang++from the repo - Dependencies version:
Platform [#1]:
Version : HIP 60140.9
Name : AMD HIP BACKEND
Vendor : AMD Corporation
Devices : 1
Device [#0]:
Type : gpu
Version : gfx1031
Name : AMD Radeon RX 6700 XT
Vendor : AMD Corporation
Driver : HIP 60[14](https://github.com/intel/llvm/actions/runs/9973751804/job/27562247448?pr=14209#step:17:15)0.9
Additional context
The test itself looks weird: it may be aimed to check for synchronization between host task and regular kernels, but it doesn't verify any results, meaning that it may pass even if both kernel and host_task are executed in parallel.
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workingconfirmedhipIssues related to execution on HIP backend.Issues related to execution on HIP backend.