[Important] MKLDNN version of the code must have all space of memory #11481

mozga-intel · 2018-06-14T10:50:59Z

Problem with the MKLDNN performance after this change : commit

Description and solution
The mkldnn performance of the newest Paddle is a worse than the code before the changes: nearly 50%, The mkldnn version of code need to have all space of the memory to work correctly. During the tests with htop the loss of power of the threads is significant and we can not keep the fluency between the threads.

The code shows the solution to this problem. Summary: this is the fall back to the previous version of the code + MKLDNN flag.

luotao1 · 2018-06-14T11:02:53Z

@mozga-intel I couldn't open

mozga-intel · 2018-06-14T11:39:48Z

@luotao1. I am sorry. The link is up-to-date

tensor-tang · 2018-06-14T17:06:17Z

Thanks @mozga-intel.

But I am afraid you can not change like this, because initial_cpu_memory_in_mb should also be used when WITH_MKLDNN.

The performance drop is only caused by the changes of memory size. You can try to get the necessary memory size for MKLDNN, like 1G not 500M. Then change the default size with ifdef PADDLE_WITH_MKLDNN.

Because online service also need the memory as a target, so we should keep as small memory as possible, The original default flag would take about 3% of system memory, which would be too large sometimes, like 6G,

That is the reason we add this flag.

tensor-tang

Please try to keep this flag available no matter with or without MKLDNN.

The recommend way: change the default size when use MKLDNN. At least, we can know MKLDNN need how much memory to get the best performance.

tensor-tang · 2018-06-15T17:39:29Z

@mozga-intel
I can refine it later, please wait a moment.

mozga-intel · 2018-06-15T21:03:28Z

@tensor-tang Thank you for your quickly answer.

I stay in touch with you.

MKLDNN version of the code must have all space of memory

3d9956b

mozga-intel added the Intel label Jun 14, 2018

mozga-intel requested review from luotao1 and tensor-tang June 14, 2018 10:50

tensor-tang requested changes Jun 14, 2018

View reviewed changes

tensor-tang mentioned this pull request Jun 16, 2018

refine the initial cpu memory flag for mkldnn #11525

Merged

tensor-tang closed this in #11525 Jun 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Important] MKLDNN version of the code must have all space of memory #11481

[Important] MKLDNN version of the code must have all space of memory #11481

mozga-intel commented Jun 14, 2018 •

edited

Loading

luotao1 commented Jun 14, 2018 •

edited by mozga-intel

Loading

mozga-intel commented Jun 14, 2018

tensor-tang commented Jun 14, 2018 •

edited

Loading

tensor-tang left a comment •

edited

Loading

tensor-tang commented Jun 15, 2018

mozga-intel commented Jun 15, 2018

[Important] MKLDNN version of the code must have all space of memory #11481

[Important] MKLDNN version of the code must have all space of memory #11481

Conversation

mozga-intel commented Jun 14, 2018 • edited Loading

luotao1 commented Jun 14, 2018 • edited by mozga-intel Loading

mozga-intel commented Jun 14, 2018

tensor-tang commented Jun 14, 2018 • edited Loading

tensor-tang left a comment • edited Loading

Choose a reason for hiding this comment

tensor-tang commented Jun 15, 2018

mozga-intel commented Jun 15, 2018

mozga-intel commented Jun 14, 2018 •

edited

Loading

luotao1 commented Jun 14, 2018 •

edited by mozga-intel

Loading

tensor-tang commented Jun 14, 2018 •

edited

Loading

tensor-tang left a comment •

edited

Loading