-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CPU Node [0] has no GPU connected #23
Comments
Can you check if you have a valid kfd device: 'ls -la /dev/kfd' The permissions on it should be 600. Also, check dmesg to see if your device is getting registered: 'dmesg | grep kfd' Send the output of that. |
The output of
and
|
This is showing me that the PCI interface you have the card plugged in doesn't support PCIe 3.0 atomics. The interface must support PCIe 3.0 (preferably x16 lanes) and the atomics extension. |
@jedwards-AMD My hardware consists of: I want to run it on virtual machine that does not support atomicop. Is it possible? |
No, it is not possible to run with out PCIe Atomics support, we need this for Signaling and some other functions with GFX8 and GFX9 GPU's. QEMU-KVM need to be augmented to support PCIe Atomics. |
@gstoner Thanks! |
It not sad, since this is functionality supported by Intel since the release of Ivybridge Xeon E5 v2, and in all Pentium, Core I3, Core I5, Core i7 since Haswell. It also supported in Ryzen, and EPYC processors. Also Cavium Thunder X and X2 ARM processors, and IBM Power9. Here is more information https://rocm.github.io/ROCmPCIeFeatures.html |
@gstoner |
Hi @insujang, ROCm supports docker, please check out the following repo for details: |
PCIe atomics are disabled by default at boot-up. The GPU driver enables it when it gets loaded. However, that doesn't work in the guest virtual machine. This has to be enabled in the hypervisor. I think a script that manually pokes PCI config space should do the trick. We've done this before, I need to find someone who knows the details ... |
Thank you for the suggestion, @sunway513. But I need a virtualized PCIe controller that I can modify for study. Anyway thank you :D |
@fxkamd Thank you for the information! I should find that script. Do you mean So I modified it return 0 without any condition, ROC runtime hang when I run the sample. |
I am continuously running into the same error when installing the ROCm stack from the Ubuntu repositories after a fresh system install (tried multiple times, starting with a fresh Ubuntu 16.10 install every time). The setup is using an AMD R9 Nano and runs Ubuntu 16.10.
After installing while following the instructions from here, (Both with and without rocm-opencl), running the included vector_copy sample results in the following output:
Installing and running
clinfo
results in:This is the
dmesg
output, where there are some ACPI errors that occur (line 699+) and further down some EDAC errors (1242+). Those errors do not appear when booting into the non-rocm kernel. Possibly related?The text was updated successfully, but these errors were encountered: