Support MKL2017 DNN API #5

yiheng · 2016-09-12T16:04:32Z

Intel MKL release 2017 version and it contains a DNN API, which provide DNN operation optimized for IA architecture. We will add new layers which leverage these new APIs to get a better performance on CPU.

bhack · 2017-01-17T11:08:30Z

Can Intel finally clarify relation roadmap against this and #363? At "marketing" level is quite confusional.

i8run · 2017-01-17T13:34:50Z

#363 is the same as this issue. @bhack

bhack · 2017-01-17T13:36:48Z

But is mkl-dnn https://github.com/01org/mkl-dnn?

jason-dai · 2017-01-17T14:25:02Z

Here mkl-dnn actually refers to the DNN primitives in Intel MKL 2017 (https://software.intel.com/en-us/articles/introducing-dnn-primitives-in-intelr-mkl)

bhack · 2017-01-17T14:26:02Z

There will be a disambiguation between the two?

bhack · 2017-01-17T14:27:11Z

Cause mkl dnn api and mkl-dnn api have a very similar marketing name.

bhack · 2017-01-17T16:44:36Z

Ok probably it is still not clear the roadmap also if one is opensource and the other one is closed:

Intel MKL-DNN includes functionality similar to Intel(R) Math Kernel Library (Intel(R) MKL) 2017, but is not API compatible. We are investigating how to unify the APIs in future Intel MKL releases.

jason-dai · 2017-01-18T03:51:10Z

We plan to support the DNN primitives in Intel MKL 2017 at this moment; no plan for the https://github.com/01org/mkl-dnn support yet.

refine inference process

fix: rever linear

This feature enables mkl-dnn support, which can speed up deep learning model. We wrapper the native c api in the java, which are in BigDL-core projects. And in BigDL, we integrated the convolution, batchnorm, maxpooling, avgpooling, relu, lrn, softmax, caddtable and concattable. Currently, it supports create the model which only contains dnn layer or container. Because the data layout is optimized in mkl-dnn. The mkl-dnn model will use `DnnTensor` which contains the native buffer as a default tensor. So there're some notations, 1. User should copy the data from jvm heap at the first layer and copy back to jvm heap at the last layer. 2. User should compile the model, which contains the phase (training/inference) and input tensor size. It will infer and allocate the other information. * fix: linear performance issue and serialization of java object in MklDnnTensor * memory leak refactor * memory leak and bn performance issues 1. Memory Leak The internal buffer with MklDnnTensor should not be re-assigned without releasing. So we should check it first. At first iteration or after the changing of input size, we create a new MklDnnTensor as a buffer. 2. Bn perf The JIT BatchNormalization only supports avx2 or avx512, which has much batter performance than ref version. The input and gradOutput format should be the same to get the best performance. * test: add some test cases for BatchNorm. The computation of float value is not the same as C/C++/Native with JVM. And batch norm will make it much greater such as 10^-8 -> 10^-4 -> 10^-1 * fix: rebase with upstream master: 1. Concat and ConcatTable should inherit from DynamicContainer. 2. updateParameters has been depricated. 3. zeroGradParameters should be final. But from now on, the Linear should use it. 4. Some other syntax or semantic errors. * perf: single node and single model performance * perf: single model * feat: add fusion for mkl-dnn * test: add test utils to compare dnn output * test: add some tests compared with caffe * add unit tests for dnn tensor * add unit test for reorder memory * test: fix the test regression errors * checkin reorder manager * add backward for sequential * fix some bugs * update core ref * add unit tests * refactor: move the static class DataType, AlgKind and so on to standalone class (#4) * refactor: delete MklDnn.MemoryFormat * refactor: move the static class DataType, AlgKind and so on to standalone class * fix: core refactor errors * refactor: spec errors (#5) * Mkl dnn dev (#6) * checkin reorder manager * add container and refine reorder manager * fix merge issue * add join table forward * refine inteface (#7) * add LRN and ReLU * add pooling * refactor: conv + linear + bn * add JoinTable backward * refactor: conv + linear + bn * add cAddTable concattable * fix: reorder failed on some of convs * refactor: softmax * refactor: fusion support * refactor: resnet_50 * refactor: move tests to this branch * refactor: delete unusefull files and enable the special old tests. refactor: delete unsed methods in MklDnnOps fix: scalastyle check * fix: rebase with upstream * fix: ignore the prototxt tests * fix: do not change the core commit ref * fix: move set num of threads for mkldnn to ResNet50Perf * fix: serialization disabled for mkldnn module

* feat: mkl-dnn initialize * fix: structure of building * fix: public final static * fix: delete the dependencies of environments * fix: skip tests * add update dnn wrappers * fix: dynamic load iomp5 * feat linear supports and some fix * add more wrapper * add lrn api * fix: add bn and softmax * fix: some fixes * fix: mkl-dnn build * feat: add get format api * fix: add getSize * feat: aligned memory * add conv fuse relu api * fix: add aligned storage * add concat api * fix: mkl envs for lib mkldnn * fix: add mkl add method with 2 ptrs * fix: update to Release * fix: batch norm infer mode * fix: update 0.5.0 -> 0.6.0 * add free (intel-analytics#5) * feat: affinity for java thread * fix: update core branch * fix: delete the memset constant value for debug, and add affinity * feat: add mkl-dnn fusion * fix: memory format enum consistent with dnn * feat: add auto format * refactor: delete the MemoryFormat in MklDnn * Memory should load MKLDnn (intel-analytics#6) * refactor: move enums to seprate classes (intel-analytics#7) * feat: add GetShape and GetFormat api * fix: delete printf * fix a bug * add sum * refactor: change name * refactor: change submodule infos * fix: set block time by default. A property to control to disable it

yiheng added the new feature label Sep 12, 2016

yiheng assigned i8run Sep 12, 2016

yiheng added this to the spark-dl_0.1 milestone Sep 14, 2016

yiheng added the story label Sep 14, 2016

yiheng removed the story label Dec 21, 2016

jason-dai removed this from the spark-dl_0.1 milestone Dec 26, 2016

yiheng mentioned this issue Dec 27, 2016

jdk7 throw exception in MKL class but the error msg is eat #194

Closed

jason-dai added this to the 0.1 release milestone Jan 9, 2017

jason-dai added high priority module and removed new feature labels Jan 9, 2017

jason-dai removed this from the 0.1 release milestone Mar 14, 2017

jason-dai added medium priority and removed high priority labels Mar 21, 2017

wzhongyuan pushed a commit to wzhongyuan/BigDL that referenced this issue Aug 28, 2017

Merge pull request intel-analytics#5 from hhbyyh/refactor

6d86cbd

refine inference process

i8run referenced this issue in i8run/BigDL Feb 7, 2018

Merge pull request #5 from i8run/open-source-mkl-dnn-for-cherry

5dcf617

fix: rever linear

yiheng closed this as completed Jun 29, 2018

jianweimama mentioned this issue Jun 7, 2024

IPEX-LLM(llama.cpp) met core dump when run Qwen-7B-Q4_K_M.gguf on Intel ARC770 #11260

Closed

lucshi mentioned this issue Jun 28, 2024

MTL GPU driver not shown and GPU demo crashed on Linux #11460

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support MKL2017 DNN API #5

Support MKL2017 DNN API #5

yiheng commented Sep 12, 2016

bhack commented Jan 17, 2017

i8run commented Jan 17, 2017

bhack commented Jan 17, 2017

jason-dai commented Jan 17, 2017

bhack commented Jan 17, 2017

bhack commented Jan 17, 2017

bhack commented Jan 17, 2017

jason-dai commented Jan 18, 2017

Support MKL2017 DNN API #5

Support MKL2017 DNN API #5

Comments

yiheng commented Sep 12, 2016

bhack commented Jan 17, 2017

i8run commented Jan 17, 2017

bhack commented Jan 17, 2017

jason-dai commented Jan 17, 2017

bhack commented Jan 17, 2017

bhack commented Jan 17, 2017

bhack commented Jan 17, 2017

jason-dai commented Jan 18, 2017