Skip to content

Build issues on ROCm with random_ltd extension #4753

@Hobbes-Le-Chat

Description

@Hobbes-Le-Chat

I saw this reported before, I am trying to install deepspeed suing the git repo

DS_BUILD_UTILS=1 DS_BUILD_CPU_ADAGRAD=1 DS_BUILD_RANDOM_LTD=1 DS_BUILD_CPU_ADAM=1 DS_BUILD_FUSED_ADAM=1 DS_BUILD_FUSED_LAMB=1 DS_BUILD_CCL_COMM=1 python setup.py build_ext -j80 bdist_wheel

I am hitting this error:

In file included from csrc/random_ltd/token_sort.hip:9:
In file included from csrc/includes/custom_hip_layers.h:17:
In file included from csrc/includes/context_hip.h:17:
csrc/includes/gemm_test_hip.h:312:13: warning: ignoring return value of function declared with 'nodiscard' attribute [-Wunused-result]
            hipDeviceSynchronize();
            ^~~~~~~~~~~~~~~~~~~~
csrc/includes/gemm_test_hip.h:306:13: warning: ignoring return value of function declared with 'nodiscard' attribute [-Wunused-result]
            hipDeviceSynchronize();
            ^~~~~~~~~~~~~~~~~~~~
csrc/includes/gemm_test_hip.h:222:24: note: in instantiation of function template specialization 'StridedGemmTest<__half>::Run<(lambda at csrc/includes/gemm_test_hip.h:222:35)>' requested here
        int algo_bw1 = Run(loops, [=](int algo) {
                       ^
csrc/includes/context_hip.h:160:53: note: in instantiation of member function 'StridedGemmTest<__half>::TestAlgo' requested here
            _gemm_algos.push_back(test_attn_scores->TestAlgo(100));
                                                    ^
In file included from csrc/random_ltd/token_sort.hip:9:
In file included from csrc/includes/custom_hip_layers.h:17:
In file included from csrc/includes/context_hip.h:17:
csrc/includes/gemm_test_hip.h:312:13: warning: ignoring return value of function declared with 'nodiscard' attribute [-Wunused-result]
            hipDeviceSynchronize();
            ^~~~~~~~~~~~~~~~~~~~
csrc/includes/gemm_test_hip.h:306:13: warning: ignoring return value of function declared with 'nodiscard' attribute [-Wunused-result]
            hipDeviceSynchronize();
            ^~~~~~~~~~~~~~~~~~~~
csrc/includes/gemm_test_hip.h:256:24: note: in instantiation of function template specialization 'StridedGemmTest<__half>::Run<(lambda at csrc/includes/gemm_test_hip.h:256:35)>' requested here
        int algo_bw2 = Run(loops, [=](int algo) {
                       ^
csrc/includes/context_hip.h:160:53: note: in instantiation of member function 'StridedGemmTest<__half>::TestAlgo' requested here
            _gemm_algos.push_back(test_attn_scores->TestAlgo(100));
                                                    ^
In file included from csrc/random_ltd/token_sort.hip:9:
In file included from csrc/includes/custom_hip_layers.h:17:
In file included from csrc/includes/context_hip.h:17:
csrc/includes/gemm_test_hip.h:312:13: warning: ignoring return value of function declared with 'nodiscard' attribute [-Wunused-result]
            hipDeviceSynchronize();
            ^~~~~~~~~~~~~~~~~~~~
17 warnings and 2 errors generated when compiling for gfx1030.
error: command '/opt/rocm-5.7.0/bin/hipcc' failed with exit code 1

Some info about my torch:

$ pip show torch 
Name: torch
Version: 2.2.0.dev20231128+rocm5.7
Summary: Tensors and Dynamic neural networks in Python with strong GPU acceleration
Home-page: https://pytorch.org/
Author: PyTorch Team
Author-email: packages@pytorch.org
License: BSD-3
Location: /env/rocm_57_p39_V2/lib/python3.9/site-packages
Requires: filelock, fsspec, jinja2, networkx, pytorch-triton-rocm, sympy, typing-extensions
Required-by: torchaudio, torchvision

Metadata

Metadata

Assignees

Labels

rocmAMD/ROCm/HIP issues

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions