Add CUDA_POST_KERNEL_CHECK to math_functions.cu by xkszltl · Pull Request #3521 · BVLC/caffe

xkszltl · 2016-01-06T09:44:51Z

I get CUDA error (Error code 11, invalid augument) in BatchNorm layer when I increase the batch size to >=145(Exceed 65536 limitation in caffe_gpu_powx), and the error is caught in the next PReLU layer since there is no CUDA_POST_KERNEL_CHECK in this math functions

xkszltl · 2016-01-06T10:28:35Z

The 65536 issue can be fixed by compiling with compute_35,sm_35, which allows 0x7FFFFFFF blocks.

Add CUDA_POST_KERNEL_CHECK to math_functions.cu

81bbce5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CUDA_POST_KERNEL_CHECK to math_functions.cu#3521

Add CUDA_POST_KERNEL_CHECK to math_functions.cu#3521
xkszltl wants to merge 1 commit intoBVLC:masterfrom
xkszltl:master

xkszltl commented Jan 6, 2016

Uh oh!

xkszltl commented Jan 6, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

xkszltl commented Jan 6, 2016

Uh oh!

xkszltl commented Jan 6, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant