Wrong predictions when using BatchNormalization with training flag set #562

zaidalyafeai · 2018-07-31T12:39:28Z

To get help from the community, check out our Google group.

TensorFlow.js version

latest

Browser version

Version 66.0.3359.139

Describe the problem or feature request

Batchnorm has wrong predictions when setting training = 1

Code to reproduce the bug / link to feature request

I created this simple keras model

def SimpleModel():
  x = Input(shape = (2, 2, 3))
  y = BatchNormalization()(x, training = 1)
  y = Flatten()(y)
  z = Dense(units = 1)(y)
  return Model(inputs = x, outputs = z)

After training, the batch norm layer weights are

[array([0.99683774, 0.99683774, 0.99683774], dtype=float32),
 array([-0.00316227,  0.00316227, -0.00316228], dtype=float32),
 array([-0.08008331,  0.01483306,  0.12279604], dtype=float32),
 array([1.0677528, 1.0555032, 0.9067482], dtype=float32)]

After running the prediction
model.predict(np.zeros((1, 2, 2, 3)))
The output

array([[[[-0.00316227,  0.00316227, -0.00316228],
         [-0.00316227,  0.00316227, -0.00316228]],

        [[-0.00316227,  0.00316227, -0.00316228],
         [-0.00316227,  0.00316227, -0.00316228]]]], dtype=float32)

On the browser the weights are the same but the activations are

Tensor
    [[[[0.0740574, -0.0112231, -0.1316395],
       [0.0740574, -0.0112231, -0.1316395]],

      [[0.0740574, -0.0112231, -0.1316395],
       [0.0740574, -0.0112231, -0.1316395]]]]

Explanation

on keras when setting training = 1, it uses the statics of the prediction sample

Tensorflow.js uses the stored moving mean and variance of the training data

The text was updated successfully, but these errors were encountered:

zaidalyafeai · 2018-08-08T01:36:19Z

The same problem happens in TensorFlow as well.

caisq · 2018-08-08T01:37:51Z

@zaidalyafeai you mean tf.keras?

zaidalyafeai · 2018-08-08T01:47:38Z

No, this definition

https://www.tensorflow.org/api_docs/python/tf/layers/batch_normalization

zaidalyafeai · 2018-08-08T02:24:38Z

@caisq This is a quote from the TensorFlow page

training: Either a Python boolean, or a TensorFlow boolean scalar tensor (e.g. a placeholder). Whether to return the output in training mode (normalized with statistics of the current batch) or in inference mode (normalized with moving statistics). NOTE: make sure to set this parameter correctly, or else your training/inference will not work properly.

zaidalyafeai · 2018-08-09T00:46:30Z

@caisq, I am trying to understand the source code. Could you please explain to me what is broadcasting ?

nsthorat · 2018-10-29T21:20:32Z

@caisq did we ever resolve this issue? I assume tf.layers is doing the right thing in TensorFlow..

zaidalyafeai · 2018-12-13T10:19:23Z

I resolved this issue by modifying the source code and changing the definition of batch norm during inference time. My pix2pix demo is based on that!

caisq · 2018-12-13T14:02:57Z

Training with BatchNormazliation should be working. See the ACGAN example under review at
tensorflow/tfjs-examples#187

I'd like to see the code you're using and the change you made in order for it to work, @zaidalyafeai , if possible.

zaidalyafeai · 2018-12-13T22:48:01Z

@caisq, I may have accidentally deleted the source code :/ but the idea is simple I just forced batch norm layer to use the statistics of the input sample as if it was training. So, I didn't add any code just re-routing.

rthadur · 2020-04-28T16:55:06Z

Closing this due to lack of activity, feel to reopen. Thank you

zaidalyafeai changed the title ~~Wrting predictions when using BatchNorm with training flag set~~ Wrong predictions when using BatchNormalization with training flag set Jul 31, 2018

nsthorat assigned caisq Jul 31, 2018

zaidalyafeai mentioned this issue Aug 14, 2018

Pix2pix improvements ml5js/ml5-library#198

Closed

rthadur added the stat:awaiting tensorflower label Nov 16, 2018

zaidalyafeai mentioned this issue Apr 2, 2020

Warning occurred while load the model zaidalyafeai/ml-projects#10

Closed

clkruse mentioned this issue Apr 27, 2020

Evaluate network in training=True configuration #3152

Closed

rthadur closed this as completed Apr 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong predictions when using BatchNormalization with training flag set #562

Wrong predictions when using BatchNormalization with training flag set #562

zaidalyafeai commented Jul 31, 2018

zaidalyafeai commented Aug 8, 2018

caisq commented Aug 8, 2018

zaidalyafeai commented Aug 8, 2018

zaidalyafeai commented Aug 8, 2018

zaidalyafeai commented Aug 9, 2018

nsthorat commented Oct 29, 2018

zaidalyafeai commented Dec 13, 2018

caisq commented Dec 13, 2018

zaidalyafeai commented Dec 13, 2018

rthadur commented Apr 28, 2020

Wrong predictions when using BatchNormalization with training flag set #562

Wrong predictions when using BatchNormalization with training flag set #562

Comments

zaidalyafeai commented Jul 31, 2018

TensorFlow.js version

Browser version

Describe the problem or feature request

Code to reproduce the bug / link to feature request

zaidalyafeai commented Aug 8, 2018

caisq commented Aug 8, 2018

zaidalyafeai commented Aug 8, 2018

zaidalyafeai commented Aug 8, 2018

zaidalyafeai commented Aug 9, 2018

nsthorat commented Oct 29, 2018

zaidalyafeai commented Dec 13, 2018

caisq commented Dec 13, 2018

zaidalyafeai commented Dec 13, 2018

rthadur commented Apr 28, 2020