Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

newby get cudnn_conv_layer.cpp:53] Check failed: status == CUDNN_STATUS_SUCCESS (4 vs. 0) CUDNN_STATUS_INTERNAL_ERROR #5701

Closed
bhomass opened this issue Jun 16, 2017 · 3 comments

Comments

@bhomass
Copy link

bhomass commented Jun 16, 2017

I just set up caffe on my Mac. Am running example code for the first time.
./examples/mnist/train_lenet.sh

I get this error
I0616 16:54:54.592954 3070141376 layer_factory.hpp:77] Creating layer mnist
I0616 16:54:54.594542 3070141376 db_lmdb.cpp:35] Opened lmdb examples/mnist/mnist_train_lmdb
I0616 16:54:54.597126 3070141376 net.cpp:84] Creating Layer mnist
I0616 16:54:54.597185 3070141376 net.cpp:380] mnist -> data
I0616 16:54:54.597214 3070141376 net.cpp:380] mnist -> label
I0616 16:54:54.597785 3070141376 data_layer.cpp:45] output data size: 64,1,28,28
I0616 16:54:54.603828 3070141376 net.cpp:122] Setting up mnist
I0616 16:54:54.603852 3070141376 net.cpp:129] Top shape: 64 1 28 28 (50176)
I0616 16:54:54.603862 3070141376 net.cpp:129] Top shape: 64 (64)
I0616 16:54:54.603868 3070141376 net.cpp:137] Memory required for data: 200960
I0616 16:54:54.603878 3070141376 layer_factory.hpp:77] Creating layer conv1
I0616 16:54:54.603895 3070141376 net.cpp:84] Creating Layer conv1
I0616 16:54:54.603901 3070141376 net.cpp:406] conv1 <- data
I0616 16:54:54.603910 3070141376 net.cpp:380] conv1 -> conv1
F0616 16:54:55.034394 3070141376 cudnn_conv_layer.cpp:53] Check failed: status == CUDNN_STATUS_SUCCESS (4 vs. 0) CUDNN_STATUS_INTERNAL_ERROR
*** Check failure stack trace: ***
@ 0x112de40ce google::LogMessage::Fail()
@ 0x112de33ad google::LogMessage::SendToLog()
@ 0x112de3ab3 google::LogMessage::Flush()
@ 0x112de6d03 google::LogMessageFatal::~LogMessageFatal()
@ 0x112de43b7 google::LogMessageFatal::~LogMessageFatal()
@ 0x10fb4ee3f caffe::CuDNNConvolutionLayer<>::LayerSetUp()
@ 0x10fbc5267 caffe::Net<>::Init()
@ 0x10fbc3fce caffe::Net<>::Net()
@ 0x10fbdc2ca caffe::Solver<>::InitTrainNet()
@ 0x10fbdb6a1 caffe::Solver<>::Init()
@ 0x10fbdb3a5 caffe::Solver<>::Solver()
@ 0x10fbea496 caffe::SGDSolver<>::SGDSolver()
@ 0x10fbef8f2 caffe::Creator_SGDSolver<>()
@ 0x10fa8b5dd train()
@ 0x10fa8ddc1 main
@ 0x7fffae2c1255 start
@ 0x3 (unknown)
./examples/mnist/train_lenet.sh: line 4: 69494 Abort trap: 6 ./build/tools/caffe train --solver=examples/mnist/lenet_solver.prototxt $@

There are number of posts on similar errors, but I am brand new at this and can't figure out what is the solution they offered. Please walk me thru what to do.

@bhomass
Copy link
Author

bhomass commented Jun 18, 2017

I fixed by fixing Makefile.config to use Anaconda python.

@bhomass bhomass closed this as completed Jun 18, 2017
@MLenthousiast
Copy link

Hi @bhomass, could you expend a bit on your answer please? I get the same kind of error but I do not fully understand you solution.. What is in your Makefile.config? I cannot find that file.

@bhomass
Copy link
Author

bhomass commented Jul 26, 2017

In your caffe installation directory. Uncomment Anacoda related lines in the Makefile.config

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants