Squeeze More Performance from Intel MKL #7

RoyiAvital · 2019-05-14T04:37:27Z

Hello,
I don't see the way MKL is built here (I see DLL Open is called, could it be no compilation is done only calling a DLL?).

But it would be great if it is built and compiled into Julia utilizing more features of Intel MKL to improve performance:

I think the Packed API and Compact API are more tricky but if they will be exposed it will be great.

MKL JIT Feature

MKL also has a new JIT feature which I think could be great addition - Intel® Math Kernel Library Improved Small Matrix Performance Using Just-in-Time (JIT) Code Generation for Matrix Multiplication (GEMM).

Remark
It seem you use Intel MKL 2019.0. while Intel MKL 2019.4 is out (OpenMP is even from 2018 release).

Update (14/10/2019)
I listed what I wanted in Benchmark MATLAB & Julia for Matrix Operations - Message 145.

andreasnoack · 2019-06-17T08:51:31Z

This is not currently on our to do. It's a relatively low priority since for smaller problems you'd generally like to use StaticArrays which would probably be competitive to what MKL can offer for small matrices. Please let us know if you have evidence otherwise.

RoyiAvital · 2019-06-17T09:33:04Z

@andreasnoack ,
Even if StaticArrays is competitive, why not enable MKL_DIRECT_CALL?
It is only a small compilation flag change.

The other features do something far beyond what you do with StaticArrays.
It uses the structure of a small problem (Say multiplying by the same matrix over and over) to get the gains of large size problem.

RoyiAvital · 2019-09-13T06:15:18Z

Another way to achieve (Better?) compareable performance to StaticArrays would be using MKL JIT Feature - Intel® Math Kernel Library Improved Small Matrix Performance Using Just-in-Time (JIT) Code Generation for Matrix Multiplication (GEMM).

Performance look pretty impressive:

Pay attention one could use it in 2 manners:

Enable JIT and let the MKL Engine do decisions.
Ask MKL to export a pointer to a JITted variation of a function and use it.

RoyiAvital · 2019-10-07T04:44:07Z

By the way, there is a simple trick to squeeze better performance from MKL on AMD CPU's.
This is done by replacing the CPU Dispatch function as guided by Agner Fog.

An example for that is given in:

https://github.com/fo40225/Anaconda-Windows-AMD

ViralBShah · 2019-10-27T17:06:53Z

I believe much of the discussion here is out of scope for the current MKL.jl package, and might perhaps find a bit more traction on discourse.

RoyiAvital · 2019-10-28T20:37:37Z

@ViralBShah , What do you mean?
Those features are about integrating MKL into a project.

ViralBShah · 2019-10-29T02:53:52Z

This package is mainly about providing MKL as a replacement for OpenBLAS. For further functionality, other packages can be created that leverage the presence of MKL. For now, I'm going to close this issue.

RoyiAvital · 2019-10-29T10:06:27Z

@ViralBShah , I think you are missing the point.
All the features above must be turned on in the integration of MKL instead of OpenBLAS. They can't be done in a different place.

For instance, the flag for direct call can't be done in later place only on the integration.

ViralBShah closed this as completed Oct 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Squeeze More Performance from Intel MKL #7

Squeeze More Performance from Intel MKL #7

RoyiAvital commented May 14, 2019 •

edited

andreasnoack commented Jun 17, 2019

RoyiAvital commented Jun 17, 2019

RoyiAvital commented Sep 13, 2019 •

edited

RoyiAvital commented Oct 7, 2019

ViralBShah commented Oct 27, 2019

RoyiAvital commented Oct 28, 2019

ViralBShah commented Oct 29, 2019

RoyiAvital commented Oct 29, 2019

Squeeze More Performance from Intel MKL #7

Squeeze More Performance from Intel MKL #7

Comments

RoyiAvital commented May 14, 2019 • edited

MKL JIT Feature

andreasnoack commented Jun 17, 2019

RoyiAvital commented Jun 17, 2019

RoyiAvital commented Sep 13, 2019 • edited

RoyiAvital commented Oct 7, 2019

ViralBShah commented Oct 27, 2019

RoyiAvital commented Oct 28, 2019

ViralBShah commented Oct 29, 2019

RoyiAvital commented Oct 29, 2019

RoyiAvital commented May 14, 2019 •

edited

RoyiAvital commented Sep 13, 2019 •

edited