Skip to content

HostInteropTask/host-task-failure.cpp timed out on an unrelated change #14613

@AlexeySachkov

Description

@AlexeySachkov

Describe the bug

Discovered in pre-commit run for #14209 which adds a few new E2E tests:

-- Testing: 2105 tests, 16 workers --
TIMEOUT: SYCL :: HostInteropTask/host-task-failure.cpp (2105 of 2105)
******************** TEST 'SYCL :: HostInteropTask/host-task-failure.cpp' FAILED ********************
Exit Code: -9
Timeout: Reached timeout of 600 seconds

Command Output (stdout):
--
# RUN: at line 1
/__w/llvm/llvm/toolchain/bin//clang++  -Xsycl-target-backend=amdgcn-amd-amdhsa --offload-arch=gfx10[31](https://github.com/intel/llvm/actions/runs/9973751804/job/27562247448?pr=14209#step:22:32) -fsycl -fsycl-targets=amdgcn-amd-amdhsa  /__w/llvm/llvm/llvm/sycl/test-e2e/HostInteropTask/host-task-failure.cpp -o /__w/llvm/llvm/build-e2e/HostInteropTask/Output/host-task-failure.cpp.tmp.out
# executed command: /__w/llvm/llvm/toolchain/bin//clang++ -Xsycl-target-backend=amdgcn-amd-amdhsa --offload-arch=gfx1031 -fsycl -fsycl-targets=amdgcn-amd-amdhsa /__w/llvm/llvm/llvm/sycl/test-e2e/HostInteropTask/host-task-failure.cpp -o /__w/llvm/llvm/build-e2e/HostInteropTask/Output/host-task-failure.cpp.tmp.out
# note: command had no output on stdout or stderr
# RUN: at line 2
env ONEAPI_DEVICE_SELECTOR=hip:gpu  /__w/llvm/llvm/build-e2e/HostInteropTask/Output/host-task-failure.cpp.tmp.out
# executed command: env ONEAPI_DEVICE_SELECTOR=hip:gpu /__w/llvm/llvm/build-e2e/HostInteropTask/Output/host-task-failure.cpp.tmp.out
# note: command had no output on stdout or stderr
# error: command failed with exit status: -9
# error: command reached timeout: True

--

To reproduce

Not sure how exactly to reproduce, i.e. it is clear how to run the test, but it is not clear what are the conditions of it hanging.

Environment

  • OS: Linux
  • Target device and vendor: AMD HIP
  • DPC++ version: most recent clang++ from the repo
  • Dependencies version:
Platform [#1]:
    Version  : HIP 60140.9
    Name     : AMD HIP BACKEND
    Vendor   : AMD Corporation
    Devices  : 1
        Device [#0]:
        Type              : gpu
        Version           : gfx1031
        Name              : AMD Radeon RX 6700 XT
        Vendor            : AMD Corporation
        Driver            : HIP 60[14](https://github.com/intel/llvm/actions/runs/9973751804/job/27562247448?pr=14209#step:17:15)0.9

Additional context

The test itself looks weird: it may be aimed to check for synchronization between host task and regular kernels, but it doesn't verify any results, meaning that it may pass even if both kernel and host_task are executed in parallel.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingconfirmedhipIssues related to execution on HIP backend.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions