Skip to content

Add CUDA_POST_KERNEL_CHECK to math_functions.cu#3521

Open
xkszltl wants to merge 1 commit intoBVLC:masterfrom
xkszltl:master
Open

Add CUDA_POST_KERNEL_CHECK to math_functions.cu#3521
xkszltl wants to merge 1 commit intoBVLC:masterfrom
xkszltl:master

Conversation

@xkszltl
Copy link

@xkszltl xkszltl commented Jan 6, 2016

I get CUDA error (Error code 11, invalid augument) in BatchNorm layer when I increase the batch size to >=145(Exceed 65536 limitation in caffe_gpu_powx), and the error is caught in the next PReLU layer since there is no CUDA_POST_KERNEL_CHECK in this math functions

@xkszltl
Copy link
Author

xkszltl commented Jan 6, 2016

The 65536 issue can be fixed by compiling with compute_35,sm_35, which allows 0x7FFFFFFF blocks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant