-
Notifications
You must be signed in to change notification settings - Fork 16
pyMIC dgemm performance? #6
Comments
Hi, May I ask what setting you used and what coprocessor is in the system? Can you please try with the following settings: PYMIC_KMP_AFFINITY=granularity=fine,balanced,verbose ./dgemm.py That enables core pinning of the OpenMP threads of MKL’s dgemm on the device. If it worked, you will see some additional output of the OpenMP runtime, indicating which OpenMP thread of the dgemm runs on what core. Let me know if this helped to push the performance. Cheers, Dr.-Ing. Michael Klemm From: hhuuggoo [mailto:notifications@github.com] I'm getting around 100 GFLOPS using the dgemm.py example for 4096x4096 matrices vs ~300 GFLOPS reported in http://www.dlr.de/sc/Portaldata/15/Resources/dokumente/pyhpc2014/submissions/pyhpc2014_submission_8.pdf. any ideas on why? Thanks — |
I'm using a Phi 31S1P. I haven't configured any settings other than the env vars you prescribed - do you know what settings were used in that paper, and which env vars to set? thanks |
Hi, An alternative method to measure performance of Xeon Phi cards is to use the micperf packages which is included in MPSS. As described in this thread https://software.intel.com/en-us/forums/topic/498633 to install micperf run from the directory where you untarred the MPSS tar ball:
Then make sure compilervars.sh (Composer 2013 you can get the distributable version from https://software.intel.com/en-us/articles/redistributable-libraries-for-the-intel-c-and-fortran-composer-xe-2013-sp1-for-linux) was sourced and to run DGEMM: $ micprun -k dgemm -c optimal # -c optimal tells micperf to use the optimal parameters for DGEMM if you only interested in the environment variables micperf sets before executing DGEMM, you can look at the mipcerf dgemm.py source code (/usr/src/micperf/micp-/micp/kernels/dgemm.py) :
Hope this helps, Luis |
I'm getting around 100 GFLOPS using the dgemm.py example for 4096x4096 matrices vs ~300 GFLOPS reported in http://www.dlr.de/sc/Portaldata/15/Resources/dokumente/pyhpc2014/submissions/pyhpc2014_submission_8.pdf.
any ideas on why?
Thanks
The text was updated successfully, but these errors were encountered: