chandangreddy/Linear-Algebra-Kernels
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Required software: * PPCG and dependent packages * Autotuner * OpenCL or Cuda SDK 5.5 * AMD CLBLAS 2.2.0 https://github.com/clMathLibraries/clBLAS * CuBLAS 5.5 OpenCl Blas kernels OpneCL_Dir = {Benchmark}/Opencl * modify the OpenCL_SDK and BLAS_SDK varibles according to your machine installation in {OpenCL_Dir}/common.mk * to build, execute make blas command in each of the benchmark directories * run the resulting executable CUDA Blas Kernels CUDA_DIR = {Benchmark}/CUDA * make sure nvcc compiler is installed * to build, execute make blas command in each of the benchmark directories * run the resulting executable PPCG generated kernels Polybench_DIR = {Benchmark}/polybench-c-3.2 * Setup PPCG_DIR, OpenCL_DIR, AUTOTUNE_DIR, PENCIL_UTIL and POLYBENCH variables in {Polybench_DIR}/common.mk * Add ${PENCIL_UTIL}/runtime/src/.libs directory to the LD_LIBRARY_PATH environment variable * to build and run, execute make opencl or make cuda in each of the benchmark directories * run make autotune_cuda or autotune_opencl to launch autotuning. Autotuning parameters can be changed in expolre-params.py in each benchmark directory. Note that some of the parameters could be invalid on a particular machine. * Setup the TARGET variable in {Polybench_DIR}/commom.mk accordingly * Best configuration found after autotuning will be printed in {benchmark}_autotune.log * Update these configurations in the Makefile inside individual directory and do make opencl/make cuda, to execute the benchmark with the best configuration
About
Contains the linear algebra kernels in C and BLAS equivalents for both OpenCL and CUDA
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published