Batchnormtrack Flag setting for cifar10 #10

jizongFox · 2019-05-08T23:50:14Z

Hi @xu-ji,
Thanks for this wonderful work. I am rerunning your code here and noticed that in the commands.txt, the setting with cifar10 is without --batchnorm_track, while most other commands are with this flag. I can understand that freezing the BN can be used in a finetuning setting but here apparently not the case. Can you tell me why the BN has been freezed for this particular setting for training Cifar10 from scratch?
Thanks for your help in advance.

The text was updated successfully, but these errors were encountered:

xu-ji · 2019-05-09T16:57:56Z

Setting --batchnorm_track means each batchnorm module in the network has track_running_stats as True. So after calling net.eval(), while performing inference on the evaluation data, the batchnorm statistics are taken from training. If False the batchnorm statistics are taken from the evaluation data on the fly (docs). There's no particular reason to go with one or the other. It makes a negligible difference, e.g. 0.7% for CIFAR10, because training and test sets are the same for most of the datasets in the fully unsupervised setting.

The exception is STL, where the full training set contains a lot of distractor classes in the unlabelled images not present in test. This is why when setting --batchnorm_track for STL it makes sense to set --double_eval too, which makes an additional run through the test data (= training data for main output head) without any training on the IIC loss. This updates the batchnorm stats to the main output head's (= test) data prior to use in the evaluation.

jizongFox · 2019-06-01T18:05:06Z

Hi,
Thanks for the reply.
Here I would like to ask you for another details.

IIC/code/scripts/cluster/cluster_greyscale_twohead.py

Line 344 in 90f262e

assert (all_imgs.requires_grad and all_imgs_tf.requires_grad)

in this line, you let the input image to be differentiable, while in the cluster_greyscale.py, you haven't. Could you explain why here the gradient is required?

xu-ji · 2019-06-02T18:47:32Z

Hi, that's redundant, thanks for pointing it out. I've removed it.

jizongFox · 2019-06-14T05:31:31Z

Hi @xu-ji
See attached a training summary of mnist using your provided code.
Is that normal?

xu-ji · 2019-06-14T10:29:43Z

Yes, it is.

jizongFox · 2019-06-18T18:58:25Z

Is that normal to have an average acc of 84%?

xu-ji · 2019-06-18T23:40:27Z

Sorry, I skimmed over that second graph. It’s ok but not as good as my reported model. My trained model:

(By the way if you download the models you can see the plots and records.)

As you can see in that graph the average is 98.4. Other MNIST models I've trained have averaged at 96.6, 92.0, 92.5, 95.9.

jizongFox · 2019-06-27T03:46:30Z

If I understand correctly, you may run several experiments and choose to show the best one.

xu-ji · 2019-06-27T10:01:53Z

Yes, I ran a few experiments and show the best model.

primecai · 2020-04-11T14:17:30Z

Can we say that as long as the distribution of train and test set stays the same，setting this flag or not should not make too much difference？For example，if we split CIFAR10 into a 7:3 train-test partition，train on the train set and test on the unseen test set，using a batch size of 660 this flag should not affect much of the performance？

xu-ji · 2020-04-13T22:20:43Z

If the test batches' statistics are representative of the training batches' statistics (same class distribution, same input distribution, same size) and training is given enough time for batchnorm stats to reflect the latest features, then yes theoretically there should not be a material difference between taking the test time batchnorm statistics from training batches or test batches. In practice, there may be a small difference.

xu-ji closed this as completed May 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batchnormtrack Flag setting for cifar10 #10

Batchnormtrack Flag setting for cifar10 #10

jizongFox commented May 8, 2019 •

edited

Loading

xu-ji commented May 9, 2019 •

edited

Loading

jizongFox commented Jun 1, 2019

xu-ji commented Jun 2, 2019

jizongFox commented Jun 14, 2019 •

edited

Loading

xu-ji commented Jun 14, 2019

jizongFox commented Jun 18, 2019

xu-ji commented Jun 18, 2019 •

edited

Loading

jizongFox commented Jun 27, 2019 •

edited

Loading

xu-ji commented Jun 27, 2019

primecai commented Apr 11, 2020

xu-ji commented Apr 13, 2020

Batchnormtrack Flag setting for cifar10 #10

Batchnormtrack Flag setting for cifar10 #10

Comments

jizongFox commented May 8, 2019 • edited Loading

xu-ji commented May 9, 2019 • edited Loading

jizongFox commented Jun 1, 2019

xu-ji commented Jun 2, 2019

jizongFox commented Jun 14, 2019 • edited Loading

xu-ji commented Jun 14, 2019

jizongFox commented Jun 18, 2019

xu-ji commented Jun 18, 2019 • edited Loading

jizongFox commented Jun 27, 2019 • edited Loading

xu-ji commented Jun 27, 2019

primecai commented Apr 11, 2020

xu-ji commented Apr 13, 2020

jizongFox commented May 8, 2019 •

edited

Loading

xu-ji commented May 9, 2019 •

edited

Loading

jizongFox commented Jun 14, 2019 •

edited

Loading

xu-ji commented Jun 18, 2019 •

edited

Loading

jizongFox commented Jun 27, 2019 •

edited

Loading