Low Test Accuracy using Inception_V3 #422

soumendukrg · 2019-11-07T01:32:05Z

I found out that the test accuracy for pretrained Inception_V3 model on ImageNet dataset in distiller are not the same as reported by the paper as well as by torchvision(PyTorch) documentation. Here are the results:
Top 1: 69.538 Top5: 88.654
Expected Top 1: 77.45 Top5: 93.56

I am using the following code to evaluate:
$ python compress_classifier.py /path_to_imagenet2012/ -a=inception_v3 --gpu 0 -e --pretrained

However, the accuracy for other networks like ResNet, DenseNet, AlexNet seemed to be almost similar to the ones reported in PyTorch docs. Do I have to specify any additional argument for Inception_V3?

The text was updated successfully, but these errors were encountered:

soumendukrg · 2019-11-07T02:38:07Z

I found out that the file data_loaders has fixed classification image size for imagenet as 1, 3, 224, 224. However, PyTorch inception_v3 requires image size of 1, 3, 299, 299.

When I modified the file to use the image size corresponding to inception, the test accuracy reported are:
Top1: 77.318 Top5: 93.402 , which are almost same as reported by Torchvision classification models.

This file and linked files needs to be modified. @nzmora unless there is any other method to fix this issue.

nzmora · 2019-11-07T15:02:31Z

Hi @soumendukrg,

Thanks - this is an important bug to fix!
Would you care to issue a PR with the fix? It would help speed the delivery of a fix to others.
Thanks!
Neta

soumendukrg · 2019-11-07T15:26:03Z

Thanks for responding. I will submit a PR with the fix. In fact for training we need to change data size as well as loss function in image_classifier.py, at line 501.

    if not early_exit_mode(args):
        if isinstance(output, tuple):
            loss = sum((criterion(o,target) for o in output))
        else:
            loss = criterion(output, target)

        # Measure accuracy
        if isinstance(output, tuple):
            classerr.add(output[0].data, target)
        else:
            classerr.add(output.data, target)`

This adds the aux logits loss with normal loss thereby increasing training efficiency, but I read somewhere that we can omit that loss since during validation/test, that part is essentially cut of from network. In that case, the above code will change. What do you suggest I do for this part?

@soumendukrg

* Merge pytorch 1.3 commits This PR is a fix for issue #422. 1. ImageNet models usually use input size [batch, 3, 224, 224], but all Inception models require an input image size of [batch, 3, 299, 299]. 2. Inception models have auxiliary branches which contribute to the loss only during training. The reported classification loss only considers the main classification loss. 3. Inception_V3 normalizes the input inside the network itself. More details can be found in @soumendukrg's PR #425 [comments](#425 (comment)). NOTE: Training using Inception_V3 is only possible on a single GPU as of now. This issue talks about this problem. I have checked and this problem persists in torch 1.3.0: [inception_v3 of vision 0.3.0 does not fit in DataParallel of torch 1.1.0 #1048](pytorch/vision#1048) Co-authored-by: Neta Zmora <neta.zmora@intel.com>

nzmora · 2020-04-27T19:47:02Z

Thanks @soumendukrg for the fix! It took some time, but we finally merged the PR.

soumendukrg · 2020-04-28T18:43:52Z

Thanks @nzmora for the merge. I have been working on the object detection compression and have one suggestion/fix which I will put up in a separate issue.

@soumendukrg

* Merge pytorch 1.3 commits This PR is a fix for issue #422. 1. ImageNet models usually use input size [batch, 3, 224, 224], but all Inception models require an input image size of [batch, 3, 299, 299]. 2. Inception models have auxiliary branches which contribute to the loss only during training. The reported classification loss only considers the main classification loss. 3. Inception_V3 normalizes the input inside the network itself. More details can be found in @soumendukrg's PR #425 [comments](#425 (comment)). NOTE: Training using Inception_V3 is only possible on a single GPU as of now. This issue talks about this problem. I have checked and this problem persists in torch 1.3.0: [inception_v3 of vision 0.3.0 does not fit in DataParallel of torch 1.1.0 #1048](pytorch/vision#1048) Co-authored-by: Neta Zmora <neta.zmora@intel.com>

soumendukrg mentioned this issue Nov 14, 2019

Modify image size and training for Inception Models #425

Merged

nzmora closed this as completed Apr 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Low Test Accuracy using Inception_V3 #422

Low Test Accuracy using Inception_V3 #422

soumendukrg commented Nov 7, 2019 •

edited

soumendukrg commented Nov 7, 2019

nzmora commented Nov 7, 2019

soumendukrg commented Nov 7, 2019

nzmora commented Apr 27, 2020

soumendukrg commented Apr 28, 2020

Low Test Accuracy using Inception_V3 #422

Low Test Accuracy using Inception_V3 #422

Comments

soumendukrg commented Nov 7, 2019 • edited

soumendukrg commented Nov 7, 2019

nzmora commented Nov 7, 2019

soumendukrg commented Nov 7, 2019

nzmora commented Apr 27, 2020

soumendukrg commented Apr 28, 2020

soumendukrg commented Nov 7, 2019 •

edited