Subtract mean image or pixel in both training and inference #169

lukeyeager · 2015-07-16T00:16:28Z

Thanks to Yuguang Lee for pointing this out to me on digits-users.

Problem

Currently, DIGITS subtracts the mean image during training (see here) and the mean pixel during inference (see here). That's both illogical and incorrect [but still works pretty well, which is why I hadn't noticed it].

Explanation

I moved to subtracting the mean pixel during inference because it simplifies the inference path. I'll give an example to explain:

Let's say you have 256x256 images in your dataset and your crop size is 227. During training, Caffe first subtracts the 256x256 mean image from the input image, and then takes a random 227x227 crop of the image before passing it to your network. But at inference time, the network only knows that images are supposed to be 227x227. So DIGITS resizes your test image to 227x227 and then subtracts the mean pixel across the whole image before passing it to the network.

Solution

DIGITS should allow you to do either mean pixel subtraction or mean image subtraction, but it shouldn't do this strange hybrid.

Mean pixel

For training - provide transform_param.mean_value (see here).
For inference - resize test image to crop size and then subtract the mean pixel. DIGITS already does this.

Mean image

For training - provide transform_param.mean_file (see here). DIGITS already does this.
For inference - resize test image to the mean file size, subtract mean, resize again to the crop size.

The text was updated successfully, but these errors were encountered:

In the future, DIGITS should allow the user to subtract EITHER the mean image OR the mean pixel. For now, let's at least be consistent.

lukeyeager · 2015-07-17T20:15:46Z

Temporary fix for master in 3ea12e6 and for digits-2.0 in 7143260.

Still want to enable selection of either option in the UI.

andredubbel · 2015-12-17T12:58:12Z

Hi @lukeyeager

Dunno If this is the correct way of bringing this up but I stumbled upon this, when trying to figure out why my pycaffe implementation is giving different output than DIGITS at test time, and I had some thoughts I wanted to share:

Since the test image is being resized without cropping, I guess the assumption is that the test image has already been cropped to the proper scale. Wouldn't it then be more correct to crop the mean image before subtraction rather than resizing it and thus changing the scale of whatever "mean object" it depicts?

Probably won't have a very big impact either way but feels like this way should be closer to what happens during training.

lukeyeager · 2015-12-17T18:08:39Z

Good question @andredubbel. I dug into it some more and learned a little more:

Training with caffe -train:

The crop window is calculated for each image (chosen randomly for TRAIN phase)
The same crop window is used to pull data from both the image and the mean file
So, during TRAIN the cropped mean image changes randomly with the crop window
- That's interesting, but not particularly relevant for this discussion about what do do when deploying the model.

Using caffe.io.Transformer:

The mean that you provide must already be cropped
- Again, not an issue for this discussion, but what do you do if you want to train with pycaffe? It seems you wouldn't be able to replicate the command-line results exactly without random crop windows for the mean image.

In order to match the VAL phase of training exactly when we test an image in DIGITS, we would need to do the following:

Crop the mean according to crop_size
Initialize the Transformer object with the cropped mean
Resize test images to the original dataset size (i.e. 256x256)
Crop test image to crop size (i.e. 227x227)
- Can't let the Transformer do the 256->227 conversion because it resizes instead of crops
Use the Transformer to preprocess the image
- Subtract the mean, etc.

Here's what we're currently doing instead:

Resize mean to crop_size
Initialize the Transformer with the resized mean
Resize test image to the original dataset size
Use the Transformer to preprocess the image
- Resize to crop_size
- Subtract the resized mean

Things we would need to change:

Crop the mean image instead of resizing to crop_size
Crop images before passing them to Transformer.preprocess (then the resize within Transformer will be a noop)

Those seem like pretty reasonable changes to me.

andredubbel · 2015-12-18T10:03:07Z

Wow, thanks for a very thorough response @lukeyeager

Things we would need to change:

Crop the mean image instead of resizing to crop_size

Crop images before passing them to Transformer (then the resize within Transformer will be a noop)

I definitely agree with point 2, point 1 I think is a question of preference if not correctness.

I tend to think of the training data's crop size as it's "actual" size, and the input size that size plus padding. The question then is if you expect test images to have the same padding or not.

When deploying the model you most likely won't be doing any padding and if you wanted DIGITS to match a deployed system you might be better off just sticking to the squashing. That is, unless you want to have the option of oversampling in which case some padding might be needed :)

In the end I think I would be happy with either soluion as long as it's clear what's expected of the test images.

willishf · 2017-03-27T16:56:10Z

Looks like I have run into the mean pixel vs mean image problem comparing results from DIGITS 5.0 interface to method used in example.py. It appears to have a significant impact and based on searching the forum can only find a discussion with no solution.

I have already trained model with mean image subtraction which was the default and it took 3 days to train the model. In testing the model using the Digits web interface classify many very happy with the results in that for the primary target that I want to identify minimal false positives in other test images and near perfect on the primary image. Working on an April 1st deadline for a cancer tumor classification challenge so don't have time to do anything significantly different at this point.

The images that I am submitting to example.py are already 256x256 so they don't need to be resized. The images I submitted to Caffe for training were also 256x256 so shouldn't have been resized related to the potential of a resizing problem.

I am passing the mean file on the command line from the model via --mean mean.binaryproto which is triggering if mean_file in the get_transformer(deploy_file, mean_file=None)

The comment clearly indicates #set mean pixel as the method. In looking through the code where I have no familiarity with the API I don't see an obvious path to doing a mean of the image where the assumption it should be a trivial change.

Is it possible that someone who is familiar with the API could look at the code and provide the changes that are required to do mean image subtraction instead of mean pixel subtraction?

The results I am getting with test images using the model that appears to be very good using the DIGITS web interface is not performing well via the example.py approach. I am already setup with a pipeline that has been tested using example.py for many many many images so hoping for a one or two line code update to improve results.

karimhasebou · 2017-08-30T12:11:29Z

@willishf were you able to fix the problem ? I have a model that was trained in digits gets 96 % validation in digits. Once i load it into pycaffe, it gets 10 % on the same validation data

willishf · 2017-08-30T12:35:06Z

Wasn't that bad. I suspect you need to select a different epoch model.

…

On Wed, Aug 30, 2017, 8:11 AM karimhasebou ***@***.***> wrote: @willishf <https://github.com/willishf> were you able to fix the problem ? I have a model that was trained in digits gets 96 % validation in digits. Once i load it into pycaffe, it gets 10 % on the same validation data — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#169 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AD3dnuskwtxPjmHfyNMaSTajpdlwYgZXks5sdVF9gaJpZM4FZg6W> .

lukeyeager added the bug label Jul 16, 2015

lukeyeager added a commit that referenced this issue Jul 17, 2015

Subtract mean pixel during training - see #169

3ea12e6

In the future, DIGITS should allow the user to subtract EITHER the mean image OR the mean pixel. For now, let's at least be consistent.

lukeyeager added a commit that referenced this issue Jul 17, 2015

Subtract mean pixel during training - see #169

7143260

In the future, DIGITS should allow the user to subtract EITHER the mean image OR the mean pixel. For now, let's at least be consistent.

lukeyeager assigned jmancewicz Aug 13, 2015

lukeyeager mentioned this issue Sep 9, 2015

'Subtract Mean File' option does not produce a train_val.prototxt with the mean image path in 'transformation_param' #279

Closed

jmancewicz mentioned this issue Sep 18, 2015

Make it optional to subtract mean pixel or mean image #321

Merged

jmancewicz closed this as completed in #321 Oct 30, 2015

mfernezir mentioned this issue Feb 19, 2016

Using mean pixel instead of mean image in classification/example.py #588

Closed

szm-R mentioned this issue Sep 16, 2016

Evaluating test data #472

Closed

rperdon mentioned this issue Dec 29, 2017

Divergence in classification between digits and GRE NVIDIA/gpu-rest-engine#27

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Subtract mean image or pixel in both training and inference #169

Subtract mean image or pixel in both training and inference #169

lukeyeager commented Jul 16, 2015

lukeyeager commented Jul 17, 2015

andredubbel commented Dec 17, 2015

lukeyeager commented Dec 17, 2015

andredubbel commented Dec 18, 2015

willishf commented Mar 27, 2017

karimhasebou commented Aug 30, 2017

willishf commented Aug 30, 2017 via email

Subtract mean image or pixel in both training and inference #169

Subtract mean image or pixel in both training and inference #169

Comments

lukeyeager commented Jul 16, 2015

Problem

Explanation

Solution

Mean pixel

Mean image

lukeyeager commented Jul 17, 2015

andredubbel commented Dec 17, 2015

lukeyeager commented Dec 17, 2015

andredubbel commented Dec 18, 2015

willishf commented Mar 27, 2017

karimhasebou commented Aug 30, 2017

willishf commented Aug 30, 2017 via email