[OpenCL] "Cannot allocate memory" issues on PPC64 #2703

pierrepaleo · 2019-08-01T15:51:45Z

Any OpenCL test run on our Power9 machine results in the following error (with my environment):

PYOPENCL_CTX="0:1" ./run_tests.py silx.opencl.test.test_addition.suite
[...]
OSError: [Errno 12] Cannot allocate memory

The reason is linked to scikit-cuda:

On one hand, scikit-cuda creates a CUBLAS context to get the version number when imported.
On the other hand, silx creates an OpenCL context on all present devices to pick the best one.

For some reason doing (1) then (2) succeeds, but doing (2) then (1) fails on Power9.

The following fails:

from silx.opencl.convolution import Convolution                                                                                                                                       
from silx.math.fft.cufft import CUFFT

The following succeeds:

from silx.math.fft.cufft import CUFFT                                                                                                                                                  
from silx.opencl.convolution import Convolution

A workaround is to modify the order of imports.

The text was updated successfully, but these errors were encountered:

pierrepaleo · 2019-08-02T08:57:21Z

The OpenCL contexts on all visible devices seem to be created when calling pyopencl.get_platforms().
This occurs on both our Power9 and DGX1 servers. It might be due to the nvidia-persistenced daemon.

For now I see no obvious bugfix apart from being careful in the imports order.

pierrepaleo added the bug label Aug 1, 2019

pierrepaleo added this to In plan in OpenCL via automation Aug 1, 2019

pierrepaleo mentioned this issue Oct 10, 2019

[Opencl] Move non-opencl utilities #2782

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OpenCL] "Cannot allocate memory" issues on PPC64 #2703

[OpenCL] "Cannot allocate memory" issues on PPC64 #2703

pierrepaleo commented Aug 1, 2019

pierrepaleo commented Aug 2, 2019

[OpenCL] "Cannot allocate memory" issues on PPC64 #2703

[OpenCL] "Cannot allocate memory" issues on PPC64 #2703

Comments

pierrepaleo commented Aug 1, 2019

pierrepaleo commented Aug 2, 2019