New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Squeeze More Performance from Intel MKL #7
Comments
This is not currently on our to do. It's a relatively low priority since for smaller problems you'd generally like to use |
@andreasnoack , The other features do something far beyond what you do with |
Another way to achieve (Better?) compareable performance to Performance look pretty impressive: Pay attention one could use it in 2 manners:
|
By the way, there is a simple trick to squeeze better performance from MKL on AMD CPU's. An example for that is given in: |
I believe much of the discussion here is out of scope for the current MKL.jl package, and might perhaps find a bit more traction on discourse. |
@ViralBShah , What do you mean? |
This package is mainly about providing MKL as a replacement for OpenBLAS. For further functionality, other packages can be created that leverage the presence of MKL. For now, I'm going to close this issue. |
@ViralBShah , I think you are missing the point. For instance, the flag for direct call can't be done in later place only on the integration. |
Hello,
I don't see the way MKL is built here (I see DLL Open is called, could it be no compilation is done only calling a DLL?).
But it would be great if it is built and compiled into Julia utilizing more features of Intel MKL to improve performance:
I think the Packed API and Compact API are more tricky but if they will be exposed it will be great.
MKL JIT Feature
MKL also has a new JIT feature which I think could be great addition - Intel® Math Kernel Library Improved Small Matrix Performance Using Just-in-Time (JIT) Code Generation for Matrix Multiplication (GEMM).
Remark
It seem you use Intel MKL 2019.0. while Intel MKL 2019.4 is out (OpenMP is even from 2018 release).
Update (14/10/2019)
I listed what I wanted in Benchmark MATLAB & Julia for Matrix Operations - Message 145.
The text was updated successfully, but these errors were encountered: