-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CUBLAS abort #20
Comments
It is definitely a bug, and possible a bad one to fix. Thanks for reporting. |
Hi Nvbit team,
I have NVIDIA GeForce 1080Ti on my machine. Are you able to get any solution? Thanks. |
version 1.5.5 should have fixed this issue, closing! |
I'm trying to run the opcode_hist on 0_Simple/matrixMulCUBLAS from NVIDIA samples on Tesla K40c. This sample uses sgemm from CUBLAS to perform matrix multiplication. However the I'm having trouble with it. When I try to execute the following line:
eval LD_PRELOAD=.../nvbit_release/tools/opcode_hist/opcode_hist.so ./matrixMulCUBLAS
The output is as follows:
I'm using nvcc 10.1. Without the instrumentation, the code runs as expected.
The text was updated successfully, but these errors were encountered: