-
Notifications
You must be signed in to change notification settings - Fork 142
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
./insmod.sh fails #41
Comments
Ehsan, insmod.sh requires that the user issuing the command have sudo privileges. |
I definitely have a root permission. Let me copy-paste what I get when running "make": And the tail of the `make PREFIX=/easybuild/work/gdrcopy/install CUDA=/software/CUDA/9.1.85 all install cc -O2 -fPIC -I /software/CUDA/9.1.85/include -I gdrdrv/ -I /software/CUDA/9.1.85/include -D GDRAPI_ARCH=X86 -c -o gdrapi.o gdrapi.c I am building against CUDA/9.1.85. |
I made some progress with the previous errors, and now, I get a new error: |
Hard to tell. |
Alright ... I'm coming back to this ticket, because I need gdrcopy for a CUDA-aware OpenMPI. I am attaching the redirected stderr/stdout from building gdrcopy in here, together with the very simple build script I am using. Furthermore, I need to know what is expected to be inside NVIDIA_SRC_DIR? |
I would like to attract your attention to this ticket. In fact, my installation of CUDA-aware MPI is pending on compiling gdrcopy. Could you please take a look at my error logs, and also the questions I raised above? |
Ehsan,
|
Thanks Davide for your message; it brought some activity back to this ticket. |
That kernel module build error is discussed on the net, e.g. on RH/CentOS forums/bugzilla. |
Thanis Davide for the hint. For some reason, when I use GCC/6.4.0 module on our compute nodes (with |
BTW gdrdrv is a kernel module, which takes advantage of the Linux kernel build system, i.e. it does not have its own build system. |
dear , how dou you fix the problem "insmod: ERROR: could not insert module gdrdrv/gdrdrv.ko: Unknown symbol in module" ? i |
Hi @zhuanwancaishi , There are multiple possibilities:
|
Dear,
We have several GPU nodes (Skylake processors with 4x P100 cards per each node), and I would like to test if the RDMA is available on these nodes or not.
When I try to build the gdrcopy, I get the following error message:
mknod: ‘/dev/gdrdrv’: Operation not permitted
Here is the specification of the host:
$> uname -a Linux r23g34 3.10.0-693.21.1.el7.x86_64 #1 SMP Wed Mar 7 19:03:37 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
In fact, there is not such a file at
/dev/gdrdrv
on our current system. Do you have an idea what is wrong here?Thanks
Ehsan
The text was updated successfully, but these errors were encountered: