Performance issues in examples/mnist/estimator (by P3) #573

DLPerf · 2021-08-22T13:34:39Z

Hello! I've found a performance issue in examples/mnist/estimator: batch() should be called before map(), which could make your program more efficient. Here is the tensorflow document to support it.

Detailed description is listed below:

in mnist_spark_streaming.py: .batch(BATCH_SIZE)(here) should be called before .map(scale)(here).
in mnist_spark.py: .batch(BATCH_SIZE)(here) should be called before .map(scale)(here).
in mnist_pipeline.py: .batch(BATCH_SIZE)(here) should be called before .map(scale)(here).

Besides, you need to check the function called in map()(e.g., scale called in .map(scale)) whether to be affected or not to make the changed code work properly. For example, if scale needs data with shape (x, y, z) as its input before fix, it would require data with shape (batch_size, x, y, z).

Looking forward to your reply. Btw, I am very glad to create a PR to fix it if you are too busy.

The text was updated successfully, but these errors were encountered:

leewyang · 2021-08-25T21:47:57Z

closing bot activity.

DLPerf · 2021-08-31T06:48:04Z

I'm not a robot! @tmielika

DLPerf · 2021-11-04T09:14:09Z

fuck u!

leewyang closed this as completed Aug 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance issues in examples/mnist/estimator (by P3) #573

Performance issues in examples/mnist/estimator (by P3) #573

DLPerf commented Aug 22, 2021

leewyang commented Aug 25, 2021

DLPerf commented Aug 31, 2021

DLPerf commented Nov 4, 2021

Performance issues in examples/mnist/estimator (by P3) #573

Performance issues in examples/mnist/estimator (by P3) #573

Comments

DLPerf commented Aug 22, 2021

leewyang commented Aug 25, 2021

DLPerf commented Aug 31, 2021

DLPerf commented Nov 4, 2021