New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
guarantee(false && "hipErrorNoBinaryForGpu: Coudn't find binary for current devices!") #21
Comments
With ROCm-Developer-Tools/HIP#2166 (comment) as a starting point, I traced what was causing the error:
|
Rebuilding So, like others who encountered this problem before me, I stumbled my way to a solution, but the root cause remains undetermined. I suspect it may have something to do with the build order or chroot isolation. |
Hi and thank you for all your work in putting this together.
I'm running into an error when trying to run a simple Python program. After having built and installed
tensorflow-rocm
and dependencies, I'm trying the following simple Python script:It produces the following output:
I don't know which binary it is referring to that it cannot find. Grepping strace output for ENOENT does not produce any illuminating results.
rocminfo
outputclinfo
outputAny advice on how to narrow down this problem would be appreciated.
Also, I don't know if this is related or not, but I'm slightly confused about the subject of kernel modules. Information such as here seems to say that I should expect to have
amdkfd
loaded. For me, it is not loaded, which may or may not be related to the problem above. However, despite this,/dev/kfd
exists androcminfo
says "ROCk module is loaded", and I see that this repository's README.md doesn't mention kernel modules. Regardless if I try to installrock-dkms-bin
, the DKMS build fails due to not being able to findkcl/backport/kcl_reservation_backport.h
et al.The text was updated successfully, but these errors were encountered: