New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
build with mkl-dnn by default #13303
Conversation
many other failures across CI. Maybe iDeep is not ready for default enabling yet? |
@soumith For #1, should be caused by the ideep case, copy_op_test. Will fix it soon. For other compiling issues, according to previous study, should be caused by building environment when building mkl-dnn, for example, the version of binutils. Could you tell who can help me on such issue? |
@gujinghui for these cases, you can reproduce the build environment using shell scripts in https://github.com/pytorch/pytorch/tree/master/.jenkins/pytorch I have just pasted the errors from the logs of the continuous build (you can press the "Details" button in the entries below that say "ci/circleci: caffe2_onnx_py2_gcc5_ubuntu16_04_test" etc. |
052742d
to
932b9d2
Compare
@soumith The cases with below failure have been modified here.
The reason is zero dim is still not supported for now both in ideep and mkl-dnn. But the coming upgrade will introduce zero dim feature soon. Besides, we still have rest 3 kinds of failures in this PR:
I'm trying to repro this locally. But still suspect this should be building environment issue.
|
FYI: Building PyTorch (using CMake's make[2]: *** No rule to make target 'third_party/pytorch/src/pytorch-build/lib/libmkldnn.so', needed by '../lib/libtest.so.5.0.0'. Stop.
make[1]: *** [CMakeFiles/Makefile2:109: src/api/CMakeFiles/test.dir/all] Error 2
make: *** [Makefile:95: all] Error 2 It seems that pytorch/cmake/Modules/FindMKLDNN.cmake Line 106 in 8444ed9
Maybe MKL-DNN needs to be installed in |
4397947
to
3487da8
Compare
what's your build command? do you have full build log? |
Build log: https://gist.github.com/kleisauke/00df397ffea14b353fbdf125a01fe150#file-build-log From the build log I see -- Set runtime path of "/usr/local/lib/libmkldnn.so.0.14.0" to "/usr/local/lib" However it looks like the wrong library is appended to |
8c22da5
to
7795490
Compare
Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com>
Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com>
7795490
to
bbd22a4
Compare
rebased with latest code base. Finally, we passed all test cases. 🎉 🎉 🎉 @kleisauke This PR should be able to fix the issue in your code. |
@gujinghui I can confirm that this PR resolves the issue that I had, thanks! |
Thanks, @gujinghui!. @orionr, do we have a CI setup to test this? |
@yinghai doesn't look like we have OSS CI with MKLDNN enabled - likely want to test that. However, we do have internal Sandcastle testing of it. If you want to add it, feel free to tweak .jenkins/pytorch/build.sh with something like
|
I wouldn't want MKLDNN tested in all CI configurations, because it's quite important to make sure the non MKLDNN path is working. @orionr where do we add to the config to only add it to some configs, maybe 2 or 3 configs... |
Same place, but you can also do a test similar to https://github.com/pytorch/pytorch/blob/master/.jenkins/pytorch/build.sh#L96 |
@gujinghui let me push a change to the CI so MKLDNN gets tested. |
Changes pushed. Let's see how CI does. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@orionr has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
Summary: build with mkl-dnn by default Pull Request resolved: pytorch/pytorch#13303 Reviewed By: yinghai Differential Revision: D12979633 Pulled By: orionr fbshipit-source-id: 00d23fa27c0d13e82f7e5acb3ebd00ed7ba1d5dc
ppc64le does not support mkl-dnn. I suspect this is why the ppc64le is now breaking. -- USE_MKLDNN : ON <<<<<<<<<<<<<<<<< c++: error: unrecognized command line option '-march=native' Makefile:140: recipe for target 'all' failed |
Please try to disable MKLDNN by settting NO_MKLDNN=1: |
build with mkl-dnn by default