A Simple Utility for Benchmarking CUBLAS

Compiling and Running

$ /usr/local/cuda-X.0/bin/nvcc --std=c++11 -arch=compute_60 -code=sm_60 main.cu \
  -lcublas -o matmul
$ LD_LIBRARY_PATH=/usr/local/cuda-X.0/lib64 ./matmul M N K TA TB

Where M, N, K are integers defining the size of the matrices to be multiplied, and TA and TB are integers 0 or 1 indicating whether the A and B matrices should be transposed.

If you have multiple nvidia GPUs in your machine, set the CUDA_VISIBLE_DEVICES env var to the appropriate number before running the benchmark.

Running the Benchmark Suite

$ cat benchmark_args | xargs -n1 -I{} sh -c './matmul {}'

Creating Pretty Output for Comparing CUDA 8 vs CUDA 9

TODO. :)

Disclaimer

This is not an official Google project.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
benchmark_args		benchmark_args
main.cu		main.cu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Simple Utility for Benchmarking CUBLAS

Compiling and Running

Running the Benchmark Suite

Creating Pretty Output for Comparing CUDA 8 vs CUDA 9

Disclaimer

About

Releases

Packages

Languages

License

jlebar/cublas-benchmark

Folders and files

Latest commit

History

Repository files navigation

A Simple Utility for Benchmarking CUBLAS

Compiling and Running

Running the Benchmark Suite

Creating Pretty Output for Comparing CUDA 8 vs CUDA 9

Disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages