Tests not completing successfully #8

davenso · 2016-11-19T15:25:50Z

System:
Processor: Intel(R) Xeon Phi(TM) CPU 7210 @ 1.30GHz
OS: CentOS Linux release 7.2.1511 (Core)

> [davido@knl-data build]$ make test
> Running tests...
> Test project /home/davido/mkl/mkl-dnn-master/build
>       Start  1: simple-net-c
>  1/23 Test  #1: simple-net-c ..........................***Exception: Other  0.11 sec
>       Start  2: simple-net-cpp
>  2/23 Test  #2: simple-net-cpp ........................***Exception: Other  6.68 sec
>       Start  3: api-c
>  3/23 Test  #3: api-c .................................***Exception: Other  0.25 sec
>       Start  4: test_c_symbols-c
>  4/23 Test  #4: test_c_symbols-c ......................   Passed    0.01 sec
>       Start  5: test_sum
>  5/23 Test  #5: test_sum ..............................   Passed    1.98 sec
>       Start  6: test_reorder
>  6/23 Test  #6: test_reorder ..........................   Passed    0.59 sec
>       Start  7: test_concat
>  7/23 Test  #7: test_concat ...........................   Passed    0.57 sec
>       Start  8: test_relu_forward
>  8/23 Test  #8: test_relu_forward .....................***Failed    0.26 sec
>       Start  9: test_relu_backward
>  9/23 Test  #9: test_relu_backward ....................***Failed    0.25 sec
>       Start 10: test_lrn_forward
> 10/23 Test #10: test_lrn_forward ......................***Failed    1.60 sec
>       Start 11: test_lrn_backward
> 11/23 Test #11: test_lrn_backward .....................***Failed    1.98 sec
>       Start 12: test_pooling_forward
> 12/23 Test #12: test_pooling_forward ..................***Failed    5.49 sec
>       Start 13: test_pooling_backward
> 13/23 Test #13: test_pooling_backward .................   Passed    2.19 sec
>       Start 14: test_batch_normalization_forward
> 14/23 Test #14: test_batch_normalization_forward ......***Failed   10.12 sec
>       Start 15: test_batch_normalization_backward
> 15/23 Test #15: test_batch_normalization_backward .....***Failed   15.45 sec
>       Start 16: test_inner_product_forward
> 16/23 Test #16: test_inner_product_forward ............   Passed    2.47 sec
>       Start 17: test_inner_product_backward_data
> 17/23 Test #17: test_inner_product_backward_data ......   Passed    0.87 sec
>       Start 18: test_inner_product_backward_weights
> 18/23 Test #18: test_inner_product_backward_weights ...   Passed    1.41 sec
>       Start 19: test_convolution_format_any
> 19/23 Test #19: test_convolution_format_any ...........   Passed    0.02 sec
>       Start 20: test_convolution_forward
> 20/23 Test #20: test_convolution_forward ..............***Failed   56.88 sec
>       Start 21: test_convolution_relu_forward
> 21/23 Test #21: test_convolution_relu_forward .........***Failed   40.96 sec
>       Start 22: test_convolution_backward_data
> 22/23 Test #22: test_convolution_backward_data ........   Passed   87.59 sec
>       Start 23: test_convolution_backward_weights
> 23/23 Test #23: test_convolution_backward_weights .....   Passed   90.65 sec
> 
> 48% tests passed, 12 tests failed out of 23
> 
> Total Test time (real) = 328.49 sec
> 
> The following tests FAILED:
>           1 - simple-net-c (OTHER_FAULT)
>           2 - simple-net-cpp (OTHER_FAULT)
>           3 - api-c (OTHER_FAULT)
>           8 - test_relu_forward (Failed)
>           9 - test_relu_backward (Failed)
>          10 - test_lrn_forward (Failed)
>          11 - test_lrn_backward (Failed)
>          12 - test_pooling_forward (Failed)
>          14 - test_batch_normalization_forward (Failed)
>          15 - test_batch_normalization_backward (Failed)
>          20 - test_convolution_forward (Failed)
>          21 - test_convolution_relu_forward (Failed)
> Errors while running CTest
> make: *** [test] Error 8

Dave

The text was updated successfully, but these errors were encountered:

rsdubtso · 2016-11-20T14:00:32Z

Hi Dave,

I am not able to reproduce this with the latest master on my Xeon Phi 7250 system:

$ lsb_release -a
LSB Version:    :core-4.1-amd64:core-4.1-noarch
Distributor ID: RedHatEnterpriseServer
Description:    Red Hat Enterprise Linux Server release 7.2 (Maipo)
Release:        7.2
Codename:       Maipo

$ gcc --version
gcc (GCC) 4.8.5 20150623 (Red Hat 4.8.5-4)
Copyright (C) 2015 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

Can you please provide additional details like compiler version and cmake command line? Does your system have selinux enabled?

It would also be great of you could run the simple-net test under gdb and post 'bt' output after an exception.

Thanks,
Roma

tensor-tang · 2016-11-23T08:32:39Z

hi @rsdubtso
I also have the same issue, with Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz

Running tests...
Test project /home/tangjian/mkl-dnn-master/build
      Start  1: simple-net-c
 1/23 Test  #1: simple-net-c ..........................***Exception: Other  0.75 sec
      Start  2: simple-net-cpp
 2/23 Test  #2: simple-net-cpp ........................***Exception: Other  0.86 sec
      Start  3: api-c
 3/23 Test  #3: api-c .................................***Exception: Other  0.07 sec
      Start  4: test_c_symbols-c
 4/23 Test  #4: test_c_symbols-c ......................   Passed    0.01 sec
      Start  5: test_sum
 5/23 Test  #5: test_sum ..............................   Passed    0.27 sec
      Start  6: test_reorder
 6/23 Test  #6: test_reorder ..........................   Passed    0.25 sec
      Start  7: test_concat
 7/23 Test  #7: test_concat ...........................   Passed    0.22 sec
      Start  8: test_relu_forward
 8/23 Test  #8: test_relu_forward .....................***Failed    0.13 sec
      Start  9: test_relu_backward
 9/23 Test  #9: test_relu_backward ....................***Failed    0.17 sec
      Start 10: test_lrn_forward
10/23 Test #10: test_lrn_forward ......................***Failed    0.27 sec
      Start 11: test_lrn_backward
11/23 Test #11: test_lrn_backward .....................***Failed    0.24 sec
      Start 12: test_pooling_forward
12/23 Test #12: test_pooling_forward ..................***Failed    0.26 sec
      Start 13: test_pooling_backward
13/23 Test #13: test_pooling_backward .................   Passed    0.22 sec
      Start 14: test_batch_normalization_forward
14/23 Test #14: test_batch_normalization_forward ......***Failed    0.77 sec
      Start 15: test_batch_normalization_backward
15/23 Test #15: test_batch_normalization_backward .....***Failed    0.94 sec
      Start 16: test_inner_product_forward
16/23 Test #16: test_inner_product_forward ............   Passed    0.23 sec
      Start 17: test_inner_product_backward_data
17/23 Test #17: test_inner_product_backward_data ......   Passed    0.22 sec
      Start 18: test_inner_product_backward_weights
18/23 Test #18: test_inner_product_backward_weights ...   Passed    0.22 sec
      Start 19: test_convolution_format_any
19/23 Test #19: test_convolution_format_any ...........   Passed    0.01 sec
      Start 20: test_convolution_forward
20/23 Test #20: test_convolution_forward ..............***Failed    9.25 sec
      Start 21: test_convolution_relu_forward
21/23 Test #21: test_convolution_relu_forward .........***Failed    9.50 sec
      Start 22: test_convolution_backward_data
22/23 Test #22: test_convolution_backward_data ........   Passed   17.64 sec
      Start 23: test_convolution_backward_weights
23/23 Test #23: test_convolution_backward_weights .....   Passed   17.31 sec

48% tests passed, 12 tests failed out of 23

Total Test time (real) =  59.83 sec

The following tests FAILED:
          1 - simple-net-c (OTHER_FAULT)
          2 - simple-net-cpp (OTHER_FAULT)
          3 - api-c (OTHER_FAULT)
          8 - test_relu_forward (Failed)
          9 - test_relu_backward (Failed)
         10 - test_lrn_forward (Failed)
         11 - test_lrn_backward (Failed)
         12 - test_pooling_forward (Failed)
         14 - test_batch_normalization_forward (Failed)
         15 - test_batch_normalization_backward (Failed)
         20 - test_convolution_forward (Failed)
         21 - test_convolution_relu_forward (Failed)
Errors while running CTest
make: *** [test] Error 8

Could u help?

$ gcc --version
gcc (GCC) 4.8.5 20150623 (Red Hat 4.8.5-4)
Copyright (C) 2015 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

jczaja · 2016-11-23T13:37:47Z

Not sure if this relevant, but I can see some of code using MKLDNN failing when selinux is set into enrofcing mode, while it works fine when permissive/disabled mode is set. So please attach output from:
"sestatus" command.

tensor-tang · 2016-11-23T13:43:10Z

Yes, it's enabled.
What wonderful it works now when permissive is set. I have tried so many ways which does not work.
Thanks very much! @jczaja

taliesinb · 2016-11-23T17:55:05Z

@tensor-tang you might want to format your tool output as markdown code, because github actually interprets all those #n entries as being references to issue pages. So an issue I'm watching got pinged by your tool output.

rsdubtso added the bug A confirmed library bug label Nov 20, 2016

rsdubtso self-assigned this Nov 20, 2016

mkl-dnn closed this as completed in 0360439 Nov 25, 2016

Darwinian2 mentioned this issue Aug 9, 2017

Test Failures #98

Closed

xyzsam mentioned this issue Dec 13, 2017

Build MKL-DNN without linking with OpenMP #166

Closed

greenpdx mentioned this issue Dec 30, 2017

Build error, missing -lpthread for tests #172

Closed

mrinmayk mentioned this issue Jan 9, 2018

Unable to run basic test suite of 32 tests #178

Closed

moting9 mentioned this issue Apr 16, 2018

Arithmetic exception in Xbyak::util::Cpu::Cpu() when libmkldnn running in virtual machine KVM #215

Closed

ghost mentioned this issue Sep 21, 2018

Something has changed in the last commits ? #326

Closed

angus1121 mentioned this issue Jan 9, 2020

convolution core dump #624

Closed

kakulo mentioned this issue Jun 11, 2021

Cannot run test cases on GPUs with OpenCL #1086

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tests not completing successfully #8

Tests not completing successfully #8

davenso commented Nov 19, 2016 •

edited by vpirogov

rsdubtso commented Nov 20, 2016

tensor-tang commented Nov 23, 2016 •

edited by vpirogov

jczaja commented Nov 23, 2016

tensor-tang commented Nov 23, 2016 •

edited

taliesinb commented Nov 23, 2016 •

edited

Tests not completing successfully #8

Tests not completing successfully #8

Comments

davenso commented Nov 19, 2016 • edited by vpirogov

rsdubtso commented Nov 20, 2016

tensor-tang commented Nov 23, 2016 • edited by vpirogov

jczaja commented Nov 23, 2016

tensor-tang commented Nov 23, 2016 • edited

taliesinb commented Nov 23, 2016 • edited

davenso commented Nov 19, 2016 •

edited by vpirogov

tensor-tang commented Nov 23, 2016 •

edited by vpirogov

tensor-tang commented Nov 23, 2016 •

edited

taliesinb commented Nov 23, 2016 •

edited