Allow for CUDA backend #110

cdeterman · 2018-01-24T19:39:07Z

It has occurred to me, during my toying with gpuRcuda that I should be able to find a way to simply allow the user to indicate they wish to use the CUDA backend. After all, ViennaCL is intended to make an interface for both OpenCL and CUDA. This was not pursued previously because Rcpp and the nvcc compiler were not playing nice together. This has changed very recently with some changes to Rcpp.

If I am able to success in this, then gpuRcuda will no longer be relevant. Instead, I would work to interface with relevant CUDA extensions like cublas with gpuRcublas directly from gpuR.

The text was updated successfully, but these errors were encountered:

…ls (for now), using partial templates when using CUDA backend (enable_if), and custom CUDA templates. First big commit for #110

… for #110

cdeterman · 2018-02-08T17:32:58Z

I believe I nearly have this for Linux builds (and theoretically MacOSX). However, I don't believe this will be possible for Windows given the requirement by NVIDIA's nvcc compiler demanding use of Visual Studio. This is not supported by R and causes all sorts of other problems. As such, until such a time the NVIDIA allows Windows OS to use the MinGW toolset the CUDA backend for gpuR will likely be limited to Linux systems.

cdeterman · 2018-02-08T17:37:01Z

@pengzhao @rhaunschild you two are users who have a noted interest in CUDA. Could you try to install the cuda branch of this repository to confirm if it compiles nicely for you? From R you should just need the following commands

Sys.setenv(BACKEND="CUDA")
devtools::install_github("cdeterman/gpuR", ref = "cuda")

If it works, I would encourage you to try and clone the repository branch and try to run the unit tests by opening an R session in the git directory and run

devtools::test()

dselivanov · 2018-02-08T18:46:54Z

Congrats @cdeterman! I've tried to install it but got this error:

In file included from /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/meta/result_of.hpp:41:0,
from /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/scalar.hpp:29,
from /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/tools/entry_proxy.hpp:27,
from /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/detail/matrix_def.hpp:26,
from /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/matrix.hpp:26,
from ../inst/include/gpuR/dynVCLMat.hpp:26,
from ../inst/include/gpuR/getVCLptr.hpp:5,
from gpuMatrix_igemm.cpp:4:
/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/Core:58:34: fatal error: math_functions.hpp: No such file or directory
compilation terminated.

I have latest RcppEigen_0.3.3.4.0 installed.

Full traceback:

> * installing *source* package ‘gpuR’ ... checking for g++... g++ checking whether the C++ compiler works... yes checking for C++ compiler default output file name... a.out checking for suffix of executables... checking whether we are cross compiling... no checking for suffix of object files... o checking whether we are using the GNU C++ compiler... yes checking whether g++ accepts -g... yes checking how to run the C++ preprocessor... g++ -E Checking for C++ Compiler checking whether we are using the GNU C++ compiler... (cached) yes checking whether g++ accepts -g... (cached) yes configure: "BACKEND = CUDA" checking "Checking environment variable CUDA_HOME"... "CUDA_HOME not set; using highest version found /usr/local/cuda-9.1" checking for /usr/local/cuda-9.1/bin/nvcc... yes "NVCC found" checking "whether this is the 64 bit linux version of CUDA"... checking for /usr/local/cuda-9.1/lib64/libcudart.so... yes "yes -- using /usr/local/cuda-9.1/lib64 for CUDA libs" checking for Rscript... yes checking "building the nvcc command line"... configure: "Acquiring R compiler flags" configure: Building Makevars configure: creating ./config.status config.status: creating src/Makevars ** libs /usr/local/cuda-9.1/bin/nvcc -gencode arch=compute_30,code=sm_30 -std=c++11 -DGPU -x cu -c -Xcompiler "-fPIC" -Xcudafe "--diag_suppress=boolean_controlling_expr_is_constant --diag_suppress=code_is_unreachable" --expt-relaxed-constexpr -I. -I../inst/include -DBACKEND_CUDA -I/usr/share/R/include -I/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/Rcpp/include -I"/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include" -I/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include -I/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/BH/include gpuMatrix_igemm.cpp -o gpuMatrix_igemm.o In file included from /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/meta/result_of.hpp:41:0, from /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/scalar.hpp:29, from /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/tools/entry_proxy.hpp:27, from /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/detail/matrix_def.hpp:26, from /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/matrix.hpp:26, from ../inst/include/gpuR/dynVCLMat.hpp:26, from ../inst/include/gpuR/getVCLptr.hpp:5, from gpuMatrix_igemm.cpp:4: /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/Core:58:34: fatal error: math_functions.hpp: No such file or directory compilation terminated. Makevars:40: recipe for target 'gpuMatrix_igemm.o' failed make: *** [gpuMatrix_igemm.o] Error 1 ERROR: compilation failed for package ‘gpuR’ >* removing ‘/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/gpuR’ Warning message: In i.p(...) : installation of package ‘/tmp/RtmpjNapzd/remotes4fd42f6b7122/cdeterman-gpuR-5b29276’ had non-zero exit status

EDIT here is SO post after quick googling

cdeterman · 2018-02-08T18:53:49Z

Thanks @dselivanov what version of RcppEigen do you have installed?

dselivanov · 2018-02-08T18:57:52Z

Latest RcppEigen_0.3.3.4.0
After simlinking (as was suggested on SO) with

sudo ln -s /usr/local/cuda/include/crt/math_functions.hpp /usr/local/cuda/include/math_functions.hpp

I got another portion of errors:

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(96): error: identifier "x" is undefined

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(138): error: identifier "x" is undefined

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(138): error: class "Eigen::half" has no member "x"

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(223): error: class "Eigen::half" has no member "x"

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(276): error: class "__half" has no member "x"

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(372): error: class "Eigen::half" has no member "x"

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(378): error: class "Eigen::half" has no member "x"

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(387): error: class "Eigen::half" has no member "x"

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(571): error: class "Eigen::half" has no member "x"

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(603): error: class "Eigen::half" has no member "x"

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/Half.h(614): warning: function "__shfl_xor(float, int, int)"
/usr/local/cuda-9.1/bin/../targets/x86_64-linux/include/sm_30_intrinsics.hpp(295): here was declared deprecated ("__shfl_xor() is deprecated in favor of __shfl_xor_sync() and may be removed in a future release (Use -Wno-deprecated-declarations to suppress this warning).")

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include/Eigen/src/Core/arch/CUDA/PacketMathHalf.h(102): error: more than one conversion function from "const __half" to a built-in type applies:
function "__half::operator short() const"
function "__half::operator unsigned short() const"
function "__half::operator int() const"
function "__half::operator unsigned int() const"
function "__half::operator long long() const"
function "__half::operator unsigned long long() const"
function "__half::operator __nv_bool() const"

14 errors detected in the compilation of "/tmp/tmpxft_00005a44_00000000-6_gpuMatrix_igemm.cpp1.ii".

cdeterman · 2018-02-08T19:05:13Z

Hmm... odd, it continues to work with my CUDA 8.0 on ubuntu 14.04 docker image. I will try with CUDA 9.1 (as I see that is your version) on ubuntu 16.04 and see what happens.

dselivanov · 2018-02-08T19:06:24Z

It seems it is indeed related to cuda 9.1 tensorflow/tensorflow#15389

znmeb · 2018-02-08T19:37:50Z

I recently acquired a laptop with an NVidia 1050Ti. It's currently running Windows 10 Pro with Hyper-V, Docker for Windows and Windows Subsystem for Linux. I have CUDA 9.0 without Visual Studio. I haven't tried anything involving nvcc yet though; TensorFlow 1.5.0 runs fine on it.

I am doing a lot of work with Docker at the moment but have no immediate plans to dual-boot the machine. I think Hyper-V can see the GPU and export it to a guest VM; I'll try this in a day or so,

I will probably end up with both Visual Studio 2015 and 2017; the Microsoft R Client uses 2017 and the NVidia FORTRAN compiler uses 2015.

cdeterman · 2018-02-08T21:34:05Z

@dselivanov I can confirm this same problem on my docker image. I have found a way to resolve the initial problem (so you don't need to symlink) but I am looking in to why the half errors are happening which again appear to be an issue between Eigen and CUDA >= 9.

cdeterman · 2018-02-09T15:42:52Z

@dselivanov I have created forks of the BH and RcppEigen package that I have updated using the most recent changes in the sources (i.e. boostorg and Eigen) to support CUDA >= 9. Please try installing directly from my github with

devtools::install_github('cdeterman/BH')
devtools::install_github('cdeterman/RcppEigen')

and try to install gpuR again. I have successfully compiled in my docker image (ubuntu 16.04, cuda 9.1).

dselivanov · 2018-02-09T16:33:15Z

I got another error:

/usr/local/cuda-9.1/bin/nvcc -gencode arch=compute_30,code=sm_30 -std=c++11 -DGPU -x cu -c -Xcompiler "-fPIC" -Xcudafe "--diag_suppress=boolean_controlling_expr_is_constant --diag_suppress=code_is_unreachable" --expt-relaxed-constexpr -I. -I../inst/include -DBACKEND_CUDA -I/usr/share/R/include -I/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/Rcpp/include -I"/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RcppEigen/include" -I/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include -I/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/BH/include solve.cpp -o solve.o /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/traits/size.hpp(164): error: class "Eigen::Map, 0, Eigen::OuterStride<-1>>" has no member "size1" detected during: instantiation of "viennacl::vcl_size_t viennacl::traits::size1(const MatrixType &) [with MatrixType=Eigen::Map, 0, Eigen::OuterStride<-1>>]" /home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/matrix.hpp(1178): here instantiation of "void viennacl::copy(const viennacl::matrix &, CPUMatrixT &) [with CPUMatrixT=Eigen::Map, 0, Eigen::OuterStride<-1>>, NumericT=float, F=viennacl::row_major, AlignmentV=1U]" ../inst/include/gpuR/dynEigenMat.hpp(428): here instantiation of "void dynEigenMat::value, void>::type>::to_host(viennacl::matrix &) [with T=float]" solve.cpp(64): here instantiation of "void cpp_gpuMatrix_solve(SEXP, SEXP, __nv_bool, __nv_bool, int) [with T=float]" solve.cpp(103): here

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/meta/result_of.hpp(142): error: class "Eigen::Map<Eigen::Matrix<float, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>" has no member "size_type"
detected during:
instantiation of class "viennacl::result_of::size_type [with T=Eigen::Map<Eigen::Matrix<float, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>]"
/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/matrix.hpp(1186): here
instantiation of "void viennacl::copy(const viennacl::matrix<NumericT, F, AlignmentV> &, CPUMatrixT &) [with CPUMatrixT=Eigen::Map<Eigen::Matrix<float, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>, NumericT=float, F=viennacl::row_major, AlignmentV=1U]"
../inst/include/gpuR/dynEigenMat.hpp(428): here
instantiation of "void dynEigenMat<T, std::enable_if<std::is_floating_point::value, void>::type>::to_host(viennacl::matrix<T, viennacl::row_major, 1U> &) [with T=float]"
solve.cpp(64): here
instantiation of "void cpp_gpuMatrix_solve(SEXP, SEXP, __nv_bool, __nv_bool, int) [with T=float]"
solve.cpp(103): here

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/traits/size.hpp(202): error: class "Eigen::Map<Eigen::Matrix<float, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>" has no member "size2"
detected during:
instantiation of "viennacl::result_of::size_type::type viennacl::traits::size2(const MatrixType &) [with MatrixType=Eigen::Map<Eigen::Matrix<float, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>]"
/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/matrix.hpp(1186): here
instantiation of "void viennacl::copy(const viennacl::matrix<NumericT, F, AlignmentV> &, CPUMatrixT &) [with CPUMatrixT=Eigen::Map<Eigen::Matrix<float, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>, NumericT=float, F=viennacl::row_major, AlignmentV=1U]"
../inst/include/gpuR/dynEigenMat.hpp(428): here
instantiation of "void dynEigenMat<T, std::enable_if<std::is_floating_point::value, void>::type>::to_host(viennacl::matrix<T, viennacl::row_major, 1U> &) [with T=float]"
solve.cpp(64): here
instantiation of "void cpp_gpuMatrix_solve(SEXP, SEXP, __nv_bool, __nv_bool, int) [with T=float]"
solve.cpp(103): here

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/traits/size.hpp(164): error: class "Eigen::Map<Eigen::Matrix<double, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>" has no member "size1"
detected during:
instantiation of "viennacl::vcl_size_t viennacl::traits::size1(const MatrixType &) [with MatrixType=Eigen::Map<Eigen::Matrix<double, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>]"
/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/matrix.hpp(1178): here
instantiation of "void viennacl::copy(const viennacl::matrix<NumericT, F, AlignmentV> &, CPUMatrixT &) [with CPUMatrixT=Eigen::Map<Eigen::Matrix<double, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>, NumericT=double, F=viennacl::row_major, AlignmentV=1U]"
../inst/include/gpuR/dynEigenMat.hpp(428): here
instantiation of "void dynEigenMat<T, std::enable_if<std::is_floating_point::value, void>::type>::to_host(viennacl::matrix<T, viennacl::row_major, 1U> &) [with T=double]"
solve.cpp(64): here
instantiation of "void cpp_gpuMatrix_solve(SEXP, SEXP, __nv_bool, __nv_bool, int) [with T=double]"
solve.cpp(106): here

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/meta/result_of.hpp(142): error: class "Eigen::Map<Eigen::Matrix<double, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>" has no member "size_type"
detected during:
instantiation of class "viennacl::result_of::size_type [with T=Eigen::Map<Eigen::Matrix<double, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>]"
/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/matrix.hpp(1186): here
instantiation of "void viennacl::copy(const viennacl::matrix<NumericT, F, AlignmentV> &, CPUMatrixT &) [with CPUMatrixT=Eigen::Map<Eigen::Matrix<double, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>, NumericT=double, F=viennacl::row_major, AlignmentV=1U]"
../inst/include/gpuR/dynEigenMat.hpp(428): here
instantiation of "void dynEigenMat<T, std::enable_if<std::is_floating_point::value, void>::type>::to_host(viennacl::matrix<T, viennacl::row_major, 1U> &) [with T=double]"
solve.cpp(64): here
instantiation of "void cpp_gpuMatrix_solve(SEXP, SEXP, __nv_bool, __nv_bool, int) [with T=double]"
solve.cpp(106): here

/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/traits/size.hpp(202): error: class "Eigen::Map<Eigen::Matrix<double, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>" has no member "size2"
detected during:
instantiation of "viennacl::result_of::size_type::type viennacl::traits::size2(const MatrixType &) [with MatrixType=Eigen::Map<Eigen::Matrix<double, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>]"
/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/RViennaCL/include/viennacl/matrix.hpp(1186): here
instantiation of "void viennacl::copy(const viennacl::matrix<NumericT, F, AlignmentV> &, CPUMatrixT &) [with CPUMatrixT=Eigen::Map<Eigen::Matrix<double, -1, -1, 0, -1, -1>, 0, Eigen::OuterStride<-1>>, NumericT=double, F=viennacl::row_major, AlignmentV=1U]"
../inst/include/gpuR/dynEigenMat.hpp(428): here
instantiation of "void dynEigenMat<T, std::enable_if<std::is_floating_point::value, void>::type>::to_host(viennacl::matrix<T, viennacl::row_major, 1U> &) [with T=double]"
solve.cpp(64): here
instantiation of "void cpp_gpuMatrix_solve(SEXP, SEXP, __nv_bool, __nv_bool, int) [with T=double]"
solve.cpp(106): here

6 errors detected in the compilation of "/tmp/tmpxft_000077eb_00000000-6_solve.cpp1.ii".
Makevars:40: recipe for target 'solve.o' failed
make: *** [solve.o] Error 1
ERROR: compilation failed for package ‘gpuR’

removing ‘/home/dselivanov/R/x86_64-pc-linux-gnu-library/3.4/gpuR’
Warning message:
In i.p(...) :
installation of package ‘/tmp/RtmpPlThUs/remotes71dc19c963d7/cdeterman-gpuR-5b29276’ had non-zero exit status

cdeterman · 2018-02-09T17:10:00Z

@dselivanov Bah..., I forgot as well that you need my github version of RViennaCL. I am waiting on some pull requests with ViennaCL before I release the updates.

devtools::install_github('cdeterman/RViennaCL')

Then try once more.

dselivanov · 2018-02-09T17:55:49Z

Successfully installed (and packages loads normally)!

library(gpuR)
#  - context device index: 0
#    - GeForce GTX 680
#checked all devices
#completed initialization
#gpuR 2.0.2
#Attaching package: ‘gpuR’
#The following objects are masked from ‘package:base’:

#    colnames, pmax, pmin, svd

However devtools::test() prints:

Error in dyn.load(dllfile) :
unable to load shared object '/home/dselivanov/projects/gpuR/src/gpuR.so':
/home/dselivanov/projects/gpuR/src/gpuR.so: undefined symbol: cudaMemcpyAsync

…tory for header if greater than or equal to 9.1 to address a point raised in #110.

mjmg · 2018-08-29T12:03:07Z

Is there an easier way to switch between OpenCL and CUDA backends?

Right now it seems that you have to uninstall/reinstall/rebuild packages between

Sys.setenv(BACKEND="CUDA")
devtools::install_github("cdeterman/gpuR", ref = "cuda")

and regular install.packages

install.packages("gpuR")

It would be nice if there could be one build for both platforms linked to both OpenCL and CUDA libraries and switch context using some flags.

cdeterman · 2018-08-29T15:23:12Z

@mjmg I would ideally prefer to have the ability to switch between both backends but I don't think it is possible with ViennaCL. @karlrupp is it possible to use both OpenCL and CUDA concurrently with ViennaCL? If so, perhaps something could be done here.

karlrupp · 2018-08-29T18:12:17Z

yes, it is absolutely possible to switch between CUDA and OpenCL backends in ViennaCL at runtime. That's one of the strengths of ViennaCL over other libraries.

cdeterman · 2018-08-29T18:55:15Z

@karlrupp are there any examples of this? For example, how could I create two matrices one in OpenCL and one in CUDA?

jonpeake · 2018-11-15T16:24:47Z

@cdeterman I tried installing cuda-backed gpuR but can't get any of the cuda functions to work. For example, if I try creating a vclMatrix, I receive the error

Error in vectorToMatVCL(data, nrow, ncol, 8L, context_index - 1) : 
  /home/jonpeake/R/x86_64-pc-linux-gnu-library/3.5/RViennaCL/include/viennacl/linalg/cuda/matrix_operations.hpp(334): : getLastCudaError() CUDA error 48: no kernel image is available for execution on the device @ matrix_row_assign_kernel

Same thing happens if I try to multiply two gpuMatrix objects:

Error in cpp_gpuMatrix_elem_prod(A@address, is(A, "vclMatrix"), B@address,  : 
  /home/jonpeake/R/x86_64-pc-linux-gnu-library/3.5/RViennaCL/include/viennacl/linalg/cuda/matrix_operations.hpp(334): : getLastCudaError() CUDA error 48: no kernel image is available for execution on the device @ matrix_row_assign_kernel

My GPU info is below, running CUDA 10.0 on Ubuntu 18.04:

> gpuInfo()
$deviceName
[1] "GeForce GTX 960"

$deviceVendor
[1] "NVIDIA"

$majorVersion
[1] 5

$minorVersion
[1] 2

$numberOfMultiProcs
[1] 8

$sharedMemPerBlock
[1] 49152

$regsPerBlock
[1] 65536

$warpSize
[1] 32

$deviceMemory
[1] 4236902400

$deviceConstMemory
[1] 65536

$clockFreq
[1] 1253000

$double_support
[1] TRUE

Any idea what's going on?

jonpeake · 2018-11-15T17:08:15Z

@cdeterman Never mind! Figured out that RStudio doesn't automatically import system environment variables if opened from the desktop, I needed to open from command line for it to import.

cdeterman self-assigned this Jan 24, 2018

cdeterman added the feature request label Jan 24, 2018

cdeterman added this to the 2.1.0 milestone Jan 24, 2018

cdeterman mentioned this issue Jan 24, 2018

Using cmake for install? #111

Open

cdeterman added a commit that referenced this issue Feb 7, 2018

lots of changes to get nvcc compiler to work including use of autotoo…

6980e4b

…ls (for now), using partial templates when using CUDA backend (enable_if), and custom CUDA templates. First big commit for #110

cdeterman added a commit that referenced this issue Feb 7, 2018

blas2 updated for CUDA backend for #110

dfa872e

cdeterman added a commit that referenced this issue Feb 7, 2018

updated blas3 for cuda backend in issue #110

7304350

cdeterman added a commit that referenced this issue Feb 7, 2018

updated eigen.cpp for CUDA backend for #110

2ac1dfa

cdeterman added a commit that referenced this issue Feb 7, 2018

updated qr for CUDA backend for #110

7cb09fb

cdeterman added a commit that referenced this issue Feb 7, 2018

updated stats for CUDA backend for #110

183c889

cdeterman added a commit that referenced this issue Feb 7, 2018

svd updated for CUDA backend for #110

0b6fd99

cdeterman added a commit that referenced this issue Feb 7, 2018

make sure all functions still available and recompile Rcpp attributes…

124f361

… for #110

cdeterman added a commit that referenced this issue Feb 9, 2018

Updated configure to determine CUDA version and include special direc…

9ab009f

…tory for header if greater than or equal to 9.1 to address a point raised in #110.

cdeterman mentioned this issue Feb 9, 2018

Testing cuda backend #114

Open

cdeterman mentioned this issue Mar 20, 2018

arch=30 gpuRcore/gpuRcuda#8

Open

cdeterman mentioned this issue May 7, 2018

Performance Comparisons #125

Closed

cdeterman mentioned this issue Sep 28, 2018

OpenCL and CUDA concurrent? viennacl/viennacl-dev#267

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow for CUDA backend #110

Allow for CUDA backend #110

cdeterman commented Jan 24, 2018 •

edited

Loading

cdeterman commented Feb 8, 2018

cdeterman commented Feb 8, 2018

dselivanov commented Feb 8, 2018 •

edited

Loading

cdeterman commented Feb 8, 2018

dselivanov commented Feb 8, 2018

cdeterman commented Feb 8, 2018

dselivanov commented Feb 8, 2018

znmeb commented Feb 8, 2018

cdeterman commented Feb 8, 2018

cdeterman commented Feb 9, 2018

dselivanov commented Feb 9, 2018

cdeterman commented Feb 9, 2018

dselivanov commented Feb 9, 2018

mjmg commented Aug 29, 2018

cdeterman commented Aug 29, 2018

karlrupp commented Aug 29, 2018

cdeterman commented Aug 29, 2018

jonpeake commented Nov 15, 2018 •

edited

Loading

jonpeake commented Nov 15, 2018

Allow for CUDA backend #110

Allow for CUDA backend #110

Comments

cdeterman commented Jan 24, 2018 • edited Loading

cdeterman commented Feb 8, 2018

cdeterman commented Feb 8, 2018

dselivanov commented Feb 8, 2018 • edited Loading

cdeterman commented Feb 8, 2018

dselivanov commented Feb 8, 2018

cdeterman commented Feb 8, 2018

dselivanov commented Feb 8, 2018

znmeb commented Feb 8, 2018

cdeterman commented Feb 8, 2018

cdeterman commented Feb 9, 2018

dselivanov commented Feb 9, 2018

cdeterman commented Feb 9, 2018

dselivanov commented Feb 9, 2018

mjmg commented Aug 29, 2018

cdeterman commented Aug 29, 2018

karlrupp commented Aug 29, 2018

cdeterman commented Aug 29, 2018

jonpeake commented Nov 15, 2018 • edited Loading

jonpeake commented Nov 15, 2018

cdeterman commented Jan 24, 2018 •

edited

Loading

dselivanov commented Feb 8, 2018 •

edited

Loading

jonpeake commented Nov 15, 2018 •

edited

Loading