Fix different results over three backends for ResNet50 and MobileNet #9473

taehoonlee · 2018-02-24T14:34:03Z

As discussed in #9457, there are the discrepancies in ResNet50 and MobileNet. The tests scripts are:

import numpy as np
from keras.preprocessing import image
from keras.applications.mobilenet import decode_predictions

from keras.applications.mobilenet import MobileNet
from keras.applications.mobilenet import preprocess_input as p1
from keras.applications.resnet50 import ResNet50
from keras.applications.resnet50 import preprocess_input as p2

model1 = MobileNet(weights='imagenet')
model2 = ResNet50(weights='imagenet')

img = image.load_img('cat.png', target_size=(224, 224))
x = image.img_to_array(img)
x = np.expand_dims(x, axis=0)
x1 = p1(x.copy())
x2 = p2(x.copy())

print('Predicted:', decode_predictions(model1.predict(x1), top=3)[0])
print('Predicted:', decode_predictions(model2.predict(x2), top=3)[0])

The outputs of CNTK and Theano are:

 $ KERAS_BACKEND=cntk python test_mobilenet.py
Using CNTK backend
Selected GPU[0] Tesla P100-PCIE-16GB as the process wide default device.
('Predicted:', [(u'n02123159', u'tiger_cat', 0.25886607), (u'n02124075', u'Egyptian_cat', 0.14433344), (u'n02123045', u'tabby', 0.06473238)])
('Predicted:', [(u'n02124075', u'Egyptian_cat', 0.17055313), (u'n02123597', u'Siamese_cat', 0.11803201), (u'n03482405', u'hamper', 0.055440076)])

 $ KERAS_BACKEND=theano python test_mobilenet.py
Using Theano backend.
('Predicted:', [(u'n02123159', u'tiger_cat', 0.2602814), (u'n02124075', u'Egyptian_cat', 0.14234008), (u'n02123045', u'tabby', 0.06521355)])
('Predicted:', [(u'n02124075', u'Egyptian_cat', 0.17173238), (u'n02123597', u'Siamese_cat', 0.10258683), (u'n02130308', u'cheetah', 0.0565808)])

The problem is that TensorFlow differs as follows:

 $ KERAS_BACKEND=tensorflow python test_mobilenet.py
Using TensorFlow backend.
('Predicted:', [(u'n03482405', u'hamper', 0.21035317), (u'n02123159', u'tiger_cat', 0.113607496), (u'n04074963', u'remote_control', 0.09508652)])
('Predicted:', [(u'n02124075', u'Egyptian_cat', 0.116303556), (u'n02123597', u'Siamese_cat', 0.075559795), (u'n02130308', u'cheetah', 0.048775144)])

The main problem is from different padding behaviour of Conv2D(filters, kernel, padding='same', strides=(2, 2)). With explicit ZeroPadding2D, TensorFlow is able to show small numerical differences.

 $ git checkout fix_resnet_mobilenet
Switched to branch 'fix_resnet_mobilenet'
 $ KERAS_BACKEND=tensorflow python test_mobilenet.py
Using TensorFlow backend.
('Predicted:', [(u'n02123159', u'tiger_cat', 0.2602828), (u'n02124075', u'Egyptian_cat', 0.14233977), (u'n02123045', u'tabby', 0.0652139)])
('Predicted:', [(u'n02124075', u'Egyptian_cat', 0.17173262), (u'n02123597', u'Siamese_cat', 0.102586426), (u'n02130308', u'cheetah', 0.056580666)])

fchollet

Thanks for looking into this. LGTM.

…ack-embeddings-from-layer-outputs * upstream/master: (443 commits) Fix `pool_2d` of Theano for backend compatibility (keras-team#9479) Clean up conv backend tests (keras-team#9478) Refactoring of ConvLSTM2D. Added ConvRNN2D and ConvLSTM2DCell. (keras-team#9094) Fix different results over three backends for ResNet50 and MobileNet (keras-team#9473) Add depthwise conv2d for Theano and CNTK (keras-team#9457) Fix ImageDataGenerator preprocessing_function (keras-team#9273) Add separable conv2d for CNTK (keras-team#9442) Remove word “shuffled” from comments in examples (keras-team#9453) Add random brightness to Image Preprocessing (Code Cleanup) (keras-team#9390) Only print Theano RNN dropout warning when needed. Add train test split to DirectoryIterator (keras-team#6152) Misc: Slight optimisation (keras-team#9445) Force update of Sequences for Windows (keras-team#9436) Move tests for datasets (keras-team#9439) Style fixes (keras-team#9441) Increase test coverages by excluding several lines (keras-team#9428) Fixing minor bug in pretrained_word_embeddings example (keras-team#9438) Optimizer - set_weights : check weights length (keras-team#9435) Add `conv_utils_test` (keras-team#9429) Enable `variable_input_channels` test for `InceptionV3` (keras-team#9425) ...

ozabluda · 2018-03-11T20:23:57Z

@taehoonlee, since you already understand the issue (I don't), could you, please, document the difference in behavior of Keras Conv2D(padding='same') and friends, across backends. Maybe this will answer my question why we synchronize this way (instead of, say "fixing" Theano/CNTK Conv2D and friends).

@fchollet

taehoonlee · 2018-03-12T01:06:25Z

@ozabluda, Here is a toy example:

import numpy as np

from keras.layers import Conv2D
from keras.models import Sequential
from keras import backend as K

w = [np.arange(3*3*5*2).reshape(3, 3, 5, 2)]
if K.backend() == 'theano':
    w = [np.flip(np.flip(w[0], 0), 1)]

model = Sequential()
model.add(Conv2D(2, kernel_size=3, strides=2, input_shape=(6, 6, 5),
                 padding='SAME', use_bias=False))
model.set_weights(w)
model.predict(np.ones((1, 6, 6, 5)))

The results are:

 $ KERAS_BACKEND=theano python test.py
array([[[[1280., 1300.],
         [1770., 1800.],
         [1770., 1800.]],

        [[1470., 1500.],
         [1980., 2025.],
         [1980., 2025.]],

        [[1470., 1500.],
         [1980., 2025.],
         [1980., 2025.]]]], dtype=float32)
 $ KERAS_BACKEND=cntk python test.py
array([[[[1280., 1300.],
         [1770., 1800.],
         [1770., 1800.]],

        [[1470., 1500.],
         [1980., 2025.],
         [1980., 2025.]],

        [[1470., 1500.],
         [1980., 2025.],
         [1980., 2025.]]]], dtype=float32)
 $ KERAS_BACKEND=tensorflow python test.py
array([[[[1980., 2025.],
         [1980., 2025.],
         [1170., 1200.]],

        [[1980., 2025.],
         [1980., 2025.],
         [1170., 1200.]],

        [[ 870.,  900.],
         [ 870.,  900.],
         [ 480.,  500.]]]], dtype=float32)

The difference comes from the difference in padding behavior when padding='same', strides=2. Here's a little more expansion:

	Theano	TensorFlow
odd kernel / odd input
odd kernel / even input		differs
even kernel / odd input	differs
even kernel / even input	differs

goravkaul · 2018-06-07T11:14:30Z

I'm getting same output for 1by1 kernel with different padding, for tensorflow backend:

conv5 = Conv2D(16,kernel_size=(1,1),strides=1,padding='same')
conv6 = Conv2D(16,kernel_size=(1,1),strides=1,padding='valid')
conv7 = Conv2D(16,kernel_size=(1,1),strides=2,padding='same')
conv8 = Conv2D(16,kernel_size=(1,1),strides=2,padding='valid')

x10=inputs
xx5 = conv5(x10)
xx6 = conv6(x10)
xx7 = conv7(x10)
xx8 = conv8(x10)

print("Input shape is:",inputs.shape)
print("kernel_size=(1,1),strides=1,padding='same'",xx5.shape)
print("kernel_size=(1,1),strides=1,padding='valid'",xx6.shape)
print("kernel_size=(1,1),strides=2,padding='same'",xx7.shape)
print("kernel_size=(1,1),strides=2,padding='valid'",xx8.shape)

Output:
('Input shape is:', TensorShape([Dimension(None), Dimension(32), Dimension(32), Dimension(3)]))
("kernel_size=(1,1),strides=1,padding='same'", TensorShape([Dimension(None), Dimension(32), Dimension(32), Dimension(16)]))
("kernel_size=(1,1),strides=1,padding='valid'", TensorShape([Dimension(None), Dimension(32), Dimension(32), Dimension(16)]))
("kernel_size=(1,1),strides=2,padding='same'", TensorShape([Dimension(None), Dimension(16), Dimension(16), Dimension(16)]))
("kernel_size=(1,1),strides=2,padding='valid'", TensorShape([Dimension(None), Dimension(16), Dimension(16), Dimension(16)]))

In the above results, xx5 and xx6 have same shape (xx7 and xx8 also ) even though different padding is used for convolution.

masoodmortazavi · 2018-08-30T17:34:40Z

It is really amazing to me that padding which should be an integer has been left to be interpreted from some strange set of strings (for ease of use?). When ease of use destroys clarity, it is time for some upgrade :-) .

polarCatZhao · 2019-05-30T14:44:02Z

Hi! Is/Has anybody working/worked on making Conv2D(padding='same') consistent for the Tensorflow backend? If no, can I help with that issue? I'm a first time contributor, so please let me know if I have any misunderstanding. @fchollet

Fix different results over three backends for ResNet50 and MobileNet

686855f

fchollet approved these changes Feb 24, 2018

View reviewed changes

fchollet merged commit c2a8450 into keras-team:master Feb 24, 2018

taehoonlee deleted the fix_resnet_mobilenet branch February 25, 2018 03:07

This was referenced Mar 12, 2018

2D tf.nn.convolution() output is inconsistent between inputs 1x1 and 2x2 with kernel=3x3, strides=2, padding='SAME'. tensorflow/tensorflow#17654

Closed

Document that "same" is inconsistent across backends with strides!=1 #9629

Merged

ppwwyyxx mentioned this pull request Apr 3, 2018

Request a new padding mode tensorflow/tensorflow#18213

Closed

hvpham mentioned this pull request Aug 19, 2018

Inconsistency between Tensorflow and Theano/CNTK when using NASNetLarge/NASNetMobile and MobileNetV2 in Keras 2.2.2 keras-team/keras-applications#27

Closed

majianjia mentioned this pull request Jun 5, 2019

padding calculation ARM-software/CMSIS_5#603

Closed

Eichhof mentioned this pull request Jul 23, 2019

CNN variational autoencoder with non square images #13135

Closed

toshal-a mentioned this pull request Dec 21, 2019

Add padding feature in MaxPooling and MeanPooling layers. mlpack/mlpack#2119

Closed

Lasers67 mentioned this pull request Dec 27, 2019

incompatible shape in conv1D when using strides as value '2' #13649

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix different results over three backends for ResNet50 and MobileNet #9473

Fix different results over three backends for ResNet50 and MobileNet #9473

taehoonlee commented Feb 24, 2018

fchollet left a comment

ozabluda commented Mar 11, 2018

taehoonlee commented Mar 12, 2018

goravkaul commented Jun 7, 2018 •

edited

Loading

masoodmortazavi commented Aug 30, 2018 •

edited

Loading

polarCatZhao commented May 30, 2019

Fix different results over three backends for ResNet50 and MobileNet #9473

Fix different results over three backends for ResNet50 and MobileNet #9473

Conversation

taehoonlee commented Feb 24, 2018

fchollet left a comment

Choose a reason for hiding this comment

ozabluda commented Mar 11, 2018

taehoonlee commented Mar 12, 2018

goravkaul commented Jun 7, 2018 • edited Loading

masoodmortazavi commented Aug 30, 2018 • edited Loading

polarCatZhao commented May 30, 2019

goravkaul commented Jun 7, 2018 •

edited

Loading

masoodmortazavi commented Aug 30, 2018 •

edited

Loading