Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some problems about baseline #5

Closed
ZzBros opened this issue Sep 13, 2021 · 1 comment
Closed

Some problems about baseline #5

ZzBros opened this issue Sep 13, 2021 · 1 comment

Comments

@ZzBros
Copy link

ZzBros commented Sep 13, 2021

When I was training the baseline network (useing exp_cifar10_wrn2810_1net_standard_bar1.yaml) , I want to know if I understand the following correctly?

  1. mixmo uses random sampling (learning every sample once on one epoch)
  2. mixmo uses DADataset
  3. mixmo uses wideRsenet28-10
  4. mixmo with warmup lr in first epoch and reduce to 0.1 * lr in [101,201,226] epoch
  5. mixmo with L2 regular for net params
  6. mixmo uses the specified initialization

I look forward to your comment if I have missed anything !!!
Thanks!!!

@alexrame
Copy link
Owner

alexrame commented Sep 13, 2021

Yes, all this is true!
Note that config/cifar10/exp_cifar10_wrn2810_1net_standard_bar1.yaml is a configuration file for a standard baseline network without data augmentation nor ensembling, so should not be called "mixmo" but rather "vanilla".
In detail, the critical code sections for the different points are:

  1. vanilla uses random sampling (learning every sample once on one epoch)

  2. vanilla uses MSDADataset with msda_mix_method == None, which is strictly equivalent to DADataset

    class_dataset_wrapper = dataset_wrapper.MSDADataset

  3. vanilla uses wide Resnet28-10

    class WideResNet(PreActResNet):

  4. vanilla with warmup lr in first epoch (=782 steps) and reduce to 0.1 * lr in [101,201,226] epoch


    def _init_scheduler(self, epoch):

  5. vanilla with L2 regular for net params


    if self.config_loss.get("l2_reg"):

  6. vanilla uses the specified initialization

    def _init_weights_resnet(self):

If you have any questions please let me know.
Best regards

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants