Benchmark/mnist #5680

dzhwinter · 2017-11-15T19:48:32Z

Since this PR contains TensorFlow code, this benchmark will not be merged, the small fix has been included in our develop branch.

The comparison result is shown in #5862

helinwang

LGTM

reyoung

Do not change default random seed to 1.

qingqing01 · 2017-11-16T06:01:24Z

Better to add testing set in the demo.

QiJune · 2017-11-16T07:58:26Z

What's the purpose of this PR? Does it want to compare the acc with other frameworks, like tensorflow/pytorch?

dzhwinter · 2017-11-16T08:03:57Z

I compare our refactorization with Paddle V1 / TensorFlow.
The reason I change the random seed into 1 --- Paddle V1 cannot set a 0 random seed, 0 means the seed selected by the system in the old config. Just for comparing the results.

dzhwinter · 2017-11-16T08:04:43Z

I will merge the tensorflow code into this PR, and make it more clear. Sorry for the misleading.

QiJune · 2017-11-16T09:20:40Z

And what's the result of this benchmark? Does this PR compare the computation batch by batch? If we initialize parameter the same, does each batch's result is the same with tensorflow? We may need to control the error against tensorflow under 1e-6 with double compile.

emailweixu · 2017-11-16T18:00:26Z

python/paddle/v2/fluid/layers.py

+            attr=param_attr,
+            shape=param_shape,
+            dtype=dtype,
+            initializer=NormalInitializer(0.0, std, 1))


How much is the effect of this initialization?
I know this change make it same as v2 initialization. But perhaps the default XavierInitializer is more common choice. We should allow the user to supply initializer. In fact, it is can be supplied in param_attr. Adding an initializer here will ignore the initializer setting in param_attr and cause confusing.

In order to make the sample mnist code, it is better to change the example to set the initializer through param_attr. And if the different std does not have much effect, we don't even need to bother to change the mnist example.

Sorry for such a late reply caused by the trip. I compared the forward cost, accuracy, loss with v2/tensorflow batch by batch. Because our default Initializer is UniformInitializer, has a
-0.02 compared with the benchmark at the first several batches. With the NormalInitalizer w e have reached the same result with Paddle V2, even at the last decimal. I think we can draw the conclusion that our implementation has got the same accuracy with the Paddle V2.

Tensorflow uses a special random number generator algorithm Philox which we don't have. We can reach the approximal same result when we fill Variable with the same random value, but there is a difference at 6 to 8 decimal place, caused by the conv2d operator implementation difference.

We should allow the user to supply initializer. In fact, it is can be supplied in param_attr. Adding an initializer here will ignore the initializer setting in param_attr and cause confusing.

We are working on make Initializer configurable. . Sure will only change the mnist example configuration in the benchmark test and set fc layer with XavierInitilizer as default.
#5819
#5805
#5760

dzhwinter added 8 commits November 13, 2017 23:43

keep same with book

ac2c7e7

Merge remote-tracking branch 'origin/develop' into bench/mnist

3116847

"check each mini-batch accuracy"

6def96d

Merge remote-tracking branch 'origin/develop' into bench/mnist

25a41c4

"fix GPU elementwise error"

9b4f5ca

"debug random seed"

039f1b0

Merge remote-tracking branch 'origin/develop' into bench/mnist

ada9813

"fix fc layer init method"

015ffd2

helinwang previously approved these changes Nov 15, 2017

View reviewed changes

reyoung requested changes Nov 16, 2017

View reviewed changes

emailweixu reviewed Nov 16, 2017

View reviewed changes

dzhwinter dismissed helinwang’s stale review via 3d808a9 November 20, 2017 13:10

dzhwinter added 3 commits November 22, 2017 19:14

add profiler

4cc2f74

time profiler

d653970

merge develop

689b551

dzhwinter force-pushed the bench/mnist branch from 3d808a9 to 689b551 Compare November 23, 2017 05:17

dzhwinter added 5 commits November 22, 2017 22:03

refine the random seed setting

03d178e

fix CI

73eb8c7

Merge remote-tracking branch 'origin/develop' into bench/mnist

d7fce77

"fix ci"

8c56873

"fix dtype"

facb965

dzhwinter closed this Jun 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark/mnist #5680

Benchmark/mnist #5680

dzhwinter commented Nov 15, 2017 •

edited

helinwang left a comment

reyoung left a comment

qingqing01 commented Nov 16, 2017

QiJune commented Nov 16, 2017 •

edited

dzhwinter commented Nov 16, 2017

dzhwinter commented Nov 16, 2017

QiJune commented Nov 16, 2017 •

edited

emailweixu Nov 16, 2017

dzhwinter Nov 22, 2017

Benchmark/mnist #5680

Benchmark/mnist #5680

Conversation

dzhwinter commented Nov 15, 2017 • edited

helinwang left a comment

Choose a reason for hiding this comment

reyoung left a comment

Choose a reason for hiding this comment

qingqing01 commented Nov 16, 2017

QiJune commented Nov 16, 2017 • edited

dzhwinter commented Nov 16, 2017

dzhwinter commented Nov 16, 2017

QiJune commented Nov 16, 2017 • edited

emailweixu Nov 16, 2017

Choose a reason for hiding this comment

dzhwinter Nov 22, 2017

Choose a reason for hiding this comment

dzhwinter commented Nov 15, 2017 •

edited

QiJune commented Nov 16, 2017 •

edited

QiJune commented Nov 16, 2017 •

edited