-
Notifications
You must be signed in to change notification settings - Fork 18.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
syncedmem.cpp:57] Check failed: error == cudaSuccess (11 vs. 0) #1600
Comments
Have you solved this problem? Or what's the reason of this situation? I used the caffe version 0.99. And run successfully in other training task, just failed finetuning the trained model with different classes output. math_functions.cpp:90] Check failed: error == cudaSuccess (11 vs. 0) invalid argument |
When use the GT9500 Gpu , I got the problem |
I have not yet solved this problem. Unfortunately, I am using a Jetson TK1, so I can't change GPU. |
The latest release Please ask about hardware on the caffe-users group. |
@jwessnit Have you solved this problem? |
hi,
would anyone know what the problem might be here? cudaMalloc is failing on iteration 0 when I am trying to train my own network. make runtest worked with the exception of 12 unit tests (see below).
I have seen Cuda error 11 a couple of times in other submitted issues but none caused by syncedmem.cpp.
I1219 14:44:31.277175 4727 net.cpp:208] This network produces output accuracy
I1219 14:44:31.277303 4727 net.cpp:208] This network produces output loss
I1219 14:44:31.277470 4727 net.cpp:467] Collecting Learning Rate and Weight Decay.
I1219 14:44:31.277608 4727 net.cpp:219] Network initialization done.
I1219 14:44:31.277739 4727 net.cpp:220] Memory required for data: 5493236
I1219 14:44:31.278095 4727 solver.cpp:41] Solver scaffolding done.
I1219 14:44:31.278239 4727 solver.cpp:160] Solving testnet
I1219 14:44:31.278445 4727 solver.cpp:247] Iteration 0, Testing net (#0)
F1219 14:44:31.733762 4727 syncedmem.cpp:57] Check failed: error == cudaSuccess (11 vs. 0) invalid argument
*** Check failure stack trace: ***
@ 0xb2619060 (unknown)
@ 0xb2618f5c (unknown)
@ 0xb2618b78 (unknown)
@ 0xb261af98 (unknown)
Aborted
make runtest errors:
[----------] Global test environment tear-down
[==========] 838 tests from 169 test cases ran. (911468 ms total)
[ PASSED ] 826 tests.
[ FAILED ] 12 tests, listed below:
[ FAILED ] NetTest/0.TestParamPropagateDown, where TypeParam = caffe::FloatCPU
[ FAILED ] NetTest/1.TestParamPropagateDown, where TypeParam = caffe::DoubleCPU
[ FAILED ] NetTest/2.TestParamPropagateDown, where TypeParam = caffe::FloatGPU
[ FAILED ] NetTest/3.TestParamPropagateDown, where TypeParam = caffe::DoubleGPU
[ FAILED ] MathFunctionsTest/0.TestSgnbitCPU, where TypeParam = float
[ FAILED ] MathFunctionsTest/0.TestSignCPU, where TypeParam = float
[ FAILED ] MathFunctionsTest/1.TestSignCPU, where TypeParam = double
[ FAILED ] MathFunctionsTest/1.TestSgnbitCPU, where TypeParam = double
[ FAILED ] HingeLossLayerTest/0.TestGradientL1, where TypeParam = caffe::FloatCPU
[ FAILED ] HingeLossLayerTest/1.TestGradientL1, where TypeParam = caffe::DoubleCPU
[ FAILED ] HingeLossLayerTest/2.TestGradientL1, where TypeParam = caffe::FloatGPU
[ FAILED ] HingeLossLayerTest/3.TestGradientL1, where TypeParam = caffe::DoubleGPU
12 FAILED TESTS
YOU HAVE 2 DISABLED TESTS
The text was updated successfully, but these errors were encountered: