A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
hpc
profiler
gpu
opencl
cuda
nvidia
gpu-acceleration
gpu-computing
sycl
nvidia-cuda
nvidia-gpu
ptx
gpu-programming
roofline-model
ptx-utils
-
Updated
Dec 31, 2023 - C++