v1.2.2

@tpapaz tpapaz released this Oct 7, 2013 · 2 commits to master since this release

  • Deleted unneeded library "cufft"
  • Added "train_set" file. A large dataset to observe the speedups between CPU and GPU execution times.

v1.2

@tpapaz tpapaz released this Oct 4, 2013 · 4 commits to master since this release

Using LibSVM 3.17 & CUDA SDK 5.5

  • Exchanged the header file “cublas.h” for “cublas_v2.h”.
  • Exchanged the type cublasStatus for cublasStatus_t.
  • Exchanged the function cublasAlloc() and cublasFree() for cudaMalloc() and cudaFree(), respectively.
  • Declared the cublasHandle_t CUBLAS library handle.
  • Initialized the handle using cublasCreate(). Also, released the handle once finished using cublasDestroy().
  • Added the handle as the first parameter to all the CUBLAS library function calls.
  • Changed the scalar parameters to be passed by reference, instead of by value (usually simply adding “&” symbol in C/C++ is enough, because the parameters are passed by reference on the host by default).
  • Changed the parameter characters 'N' or 'n' (non-transpose operation), to CUBLAS_OP_N.
  • Checked the function prototypes in the header files “cublas.h” and “cublas_v2.h” for code correctness.