I have a Radeon RX580, running Ubuntu 18.04, with amdgpu driver (18.20-579836) from AMD. Simple OpenCL works fine on the GPU, both in Blender Cycles, LuxMark, and darktable-cltest. (clinfo output)
ComputeCpp version is CE 0.8.0. (computecpp_info output)
After building TensorFlow (on the dev/amd_gpu branch, at tensorflow/tensorflow@00b0040 plus tensorflow/tensorflow#17508 cherry-picked on top of it), by following the instructions here, the models/tutorials/image/mnist sample outputs this error:
2018-05-17 01:15:34.090983: I ./tensorflow/core/common_runtime/sycl/sycl_device.h:70] Found following OpenCL devices:
2018-05-17 01:15:34.091027: I ./tensorflow/core/common_runtime/sycl/sycl_device.h:72] id: 0, type: GPU, name: Ellesmere, vendor: Advanced Micro Devices, Inc., profile: FULL_PROFILE
terminate called after throwing an instance of 'cl::sycl::detail::exception_implementation<(cl::sycl::detail::exception_types)7, cl::sycl::detail::exception_implementation<(cl::sycl::detail::exception_types)6, cl::sycl::exception> >'
Aborted (core dumped)
All I could figure out is that the actual exception is:
Error: [ComputeCpp:RT0100] Failed to build program
And it is thrown from somewhere inside PhiloxRandomOp::Compute, at least in this specific case. Trying to run the default generator from the textgenrnn project also results in the same error. I think I've also seen a similar exception coming from CastOpBase::Compute in a different sample.
I couldn't find out any more details about the error so far.
I think #77 might be related, but not quite the same.
I have a Radeon RX580, running Ubuntu 18.04, with amdgpu driver (18.20-579836) from AMD. Simple OpenCL works fine on the GPU, both in Blender Cycles, LuxMark, and darktable-cltest. (clinfo output)
ComputeCpp version is CE 0.8.0. (computecpp_info output)
After building TensorFlow (on the dev/amd_gpu branch, at tensorflow/tensorflow@00b0040 plus tensorflow/tensorflow#17508 cherry-picked on top of it), by following the instructions here, the
models/tutorials/image/mnistsample outputs this error:All I could figure out is that the actual exception is:
And it is thrown from somewhere inside
PhiloxRandomOp::Compute, at least in this specific case. Trying to run the default generator from thetextgenrnnproject also results in the same error. I think I've also seen a similar exception coming fromCastOpBase::Computein a different sample.I couldn't find out any more details about the error so far.
I think #77 might be related, but not quite the same.