Malformed graph of ernie when ran with benchmark application #21492

Sand3r- · 2019-12-02T14:42:17Z

Current behaviour

The error has been discovered thanks to level 3 logging enabled by GLOG_v environmental variable.
GLOG has reported, that:

Some operators use the same variables for reading/writing output. For example, when the fp32_model has been ran, one could observe that scale_op as well transpose2_op accept transpose_4.tmp_0 as their input to the operator (while according to the original graph they do not)
Subpart of a GLOG error documenting this:

operator.cc:172 CPUPlace Op(scale), inputs:{X[transpose_4.tmp_0:float[1, 12, 128, 64]({})]}, outputs:{Out[scale_12.tmp_0:float[1, 12, 128, 64]({})]}.
(...several ops later...)
operator.cc:172 CPUPlace Op(transpose2), inputs:{X[transpose_4.tmp_0:float[1, 12, 128, 64]({})]}, outputs:{Out[fc_66.tmp_0:float[1, 128, 12, 64]({})], XShape[transpose_47.tmp_1:[0, 1, 12, 128, 64]({})]}.

As far as I understand that, this is a bug, since variable names should be unique (as long as they are enclosed in the same scope).

To illustrate the problem, please see the following figure depicting a different model (ernie_quant) which suffers from the same problem:

This is a blocking issue for INT8 Ernie quantization task, since our quantization system associates scales with variable names. And if the variable repeats in several places, we have end up with the same scales where we didn't mean to.

Reproduction

based on 8da0cd5
-CPU: including MKLDNN version v.20
-OS Platform Ubuntu 16.04
-Cmake orders -DCMAKE_BUILD_TYPE=RelWithDebInfo -DWITH_GPU=OFF -DON_INFER=ON -DWITH_MKLDNN=ON -DWITH_TESTING=ON -DWITH_PROFILER=ON -DWITH_STYLE_CHECK=OFF -DWITH_INFERENCE_API_TEST=ON
-API information
To Reproduce

Build paddle
Build benchmark Inference application for ernie https://github.com/PaddlePaddle/benchmark/tree/master/Inference/c%2B%2B/ernie
Run any 4-input ernie model.

@luotao1 Could you please assign someone to help solving this issue?

The text was updated successfully, but these errors were encountered:

bingyanghuang · 2019-12-04T11:12:19Z

图中variable混乱问题：
用benchmark repo跑ernie的时候无论是fp32模型还是int模型，都会出现variable连接到错误的op上的情况，这个可能会影响ernie 精度的输出。如下图所示，左图为正确的输出，右图为apply pass前保存下来的图：

混乱的variable如下图所示：

wojtuss · 2019-12-11T22:51:18Z

The graph is malformed when the memory_optimize_pass is enabled (e.g. via EnableMemoryOptim() method). With the pass being disabled, the graph of the model looks fine.

luotao1 · 2019-12-13T06:57:17Z

related #21598, how about disabling this pass when using MKLDNN?

Sand3r- · 2019-12-18T14:36:33Z

related #21598, how about disabling this pass when using MKLDNN?

We can surely do that. I've opened up a PR implementing this: #21826

luotao1 · 2019-12-20T02:29:35Z

#21826 disable memory optimization pass when mkldnn is on

Sand3r- assigned luotao1 Dec 2, 2019

Sand3r- added Intel Bug labels Dec 2, 2019

wojtuss mentioned this issue Dec 15, 2019

Optimize inference performance of ERNIE INT8 on CPU PaddlePaddle/benchmark#275

Open

luotao1 closed this as completed Dec 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Malformed graph of ernie when ran with benchmark application #21492

Malformed graph of ernie when ran with benchmark application #21492

Sand3r- commented Dec 2, 2019 •

edited

bingyanghuang commented Dec 4, 2019

wojtuss commented Dec 11, 2019

luotao1 commented Dec 13, 2019

Sand3r- commented Dec 18, 2019

luotao1 commented Dec 20, 2019

Malformed graph of ernie when ran with benchmark application #21492

Malformed graph of ernie when ran with benchmark application #21492

Comments

Sand3r- commented Dec 2, 2019 • edited

Current behaviour

Reproduction

bingyanghuang commented Dec 4, 2019

wojtuss commented Dec 11, 2019

luotao1 commented Dec 13, 2019

Sand3r- commented Dec 18, 2019

luotao1 commented Dec 20, 2019

Sand3r- commented Dec 2, 2019 •

edited