-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[libcusmm] libcusmm_unittest_transpose test failure #75
Comments
Are you using CUBLAS flag during the compilation? Such a big kernel should not be there, unless we apply densification and that case we should not transpose... |
no, I used |
Ah OK, I see the problem now. The file https://github.com/cp2k/dbcsr/blob/develop/src/acc/libsmm_acc/libcusmm/parameters_K20X.json reports "huge" kernels (I think for testing purpose):
@shoshijak
Personally, I will go for the first solution... |
The If really |
On our K20X system, the
libcusmm_unittest_transpose
test fails with the following message:@shoshijak is this CUDA arch or kernel dependent?
The text was updated successfully, but these errors were encountered: