How to use the pretrained model uniformer_base_in1k.pth as my backbone ? #20

hongsheng-Z · 2022-02-25T13:20:57Z

There are some problems when I use the pre-trained model uniformer_base_in1k.pth as my backbone?
missing keys: ['patch_embed1.norm.weight', 'patch_embed1.norm.bias', 'patch_embed1.proj.weight', 'patch_embed1.proj.bias', 'patch_embed2.norm.weight', .....
unexpected keys: ['model']

Andy1621 · 2022-02-26T07:55:57Z

Have you used the latest version? The bug has been fixed in the latest version as follows:

UniFormer/video_classification/slowfast/models/uniformer.py

Lines 398 to 404 in 53215bf

    
           def get_pretrained_model(self, cfg): 
        
               if cfg.UNIFORMER.PRETRAIN_NAME: 
        
                   checkpoint = torch.load(model_path[cfg.UNIFORMER.PRETRAIN_NAME], map_location='cpu') 
        
                   if 'model' in checkpoint: 
        
                       checkpoint = checkpoint['model'] 
        
                   elif 'model_state' in checkpoint: 
        
                       checkpoint = checkpoint['model_state']

Andy1621 · 2022-02-27T01:49:26Z

As there is no more activity, I am closing the issue, don't hesitate to reopen it if necessary.

hongsheng-Z · 2022-03-01T03:17:57Z

Ok, thanks. I have applied your model (ImageNet-1K pretrained with Token Labeling (224x224): uniformer_base_tl_224.pth) as the backbone to my visual tracking. But from the current training logs, it seems that your model is not as good as other backbones (such as swinT, Resnet50) in this task

Andy1621 · 2022-03-01T03:26:51Z

@hongsheng-Z Can you try uniformer_small_224? It has the same FLOPs as swinT, and I'm not sure whether you use the proper hyperparameter for the larger model. As expected, UniFormer works for most downstream tasks.

Moreover, I am not sure whether you have used the code of the new version, since I have updated the model config, where head_dim=32. The previous head_dim=64 does not match the pre-trained weights, thus the performance will be slow.

Besides, head_dim=32 requires more GPU memory for downstream tasks, and I'm retrain the models with head_dim=64.

Andy1621 · 2022-03-01T03:31:22Z

Someone also meets similar problems because of the wrong model config, but the performance is normal with right config.

I suggest you can check the model config ~~

Andy1621 · 2022-03-01T04:00:05Z

By the way, for the downstream tasks, you'd better freeze BN.
I forget to freeze BN in my experiments. Such a common trick will also improve the performance.

hongsheng-Z · 2022-03-01T08:02:44Z

Thank you very much for your careful reply, but I don't know how to freeze BN, can you provide the relevant reference code?

Andy1621 · 2022-03-01T11:56:04Z

You can freeze BN as follows:
https://github.com/open-mmlab/mmdetection/blob/01b55b29e9a32b6989b453dfe226b52eff249821/mmdet/models/backbones/resnet.py#L648-L657

Andy1621 · 2022-03-05T04:34:04Z

@hongsheng-Z Hi! Does the new pre-trained model work for your task?

hongsheng-Z · 2022-03-05T04:38:37Z

Yes, it seems it has worked. But I still don't know how to freeze BN, and I'm not sure which BatchNorm layer in the uniformer should be frozen. Thanks for your excellent work.

Andy1621 · 2022-03-05T05:01:23Z

@hongsheng-Z Freezing BN is a trick for the downstream task. The BN should be frozen if your batch is too small, like 2 for each GPU for object detection. If your batch is large enough (>8 for each GPU), freezing BN does not help. Besides, you can use SyncBN as well.

For freezing BN, you can simply set eval() for all the BN in the backbone. BN is used in CBlock in UniFormer.
You can find the code in MMDetection

  def train(self, mode=True):
      """Convert the model into training mode while keep normalization layer
      freezed."""
      super(ResNet, self).train(mode)
      self._freeze_stages()
      if mode and self.norm_eval:
          for m in self.modules():
              # trick: eval have effect on BatchNorm only
              if isinstance(m, _BatchNorm):
                  m.eval()

Andy1621 · 2022-03-16T05:54:11Z

@hongsheng-Z Hi! Does UniFormer work for your task now?

hongsheng-Z · 2022-03-16T08:11:17Z

yeah！ Thank you very much for your patience in replying

hongsheng-Z · 2022-03-31T04:16:04Z

Thanks for your excellent work, I have used it as the backbone for tracking tasks. In order to explain its validity it might be possible to use the structure diagram in your paper such as Figure 3 (may be slightly changed, like SwinT uses Swin Transformer), not sure if this is allowed or not.

Andy1621 · 2022-04-01T06:25:29Z

Thanks! Never mind to do it!

Andy1621 · 2022-04-21T16:14:27Z

As there is no more activity, I am closing the issue, don't hesitate to reopen it if necessary.

Andy1621 closed this as completed Feb 27, 2022

Andy1621 reopened this Mar 1, 2022

hongsheng-Z closed this as completed Mar 16, 2022

hongsheng-Z reopened this Mar 31, 2022

Andy1621 closed this as completed Apr 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use the pretrained model uniformer_base_in1k.pth as my backbone ? #20

How to use the pretrained model uniformer_base_in1k.pth as my backbone ? #20

hongsheng-Z commented Feb 25, 2022

Andy1621 commented Feb 26, 2022

Andy1621 commented Feb 27, 2022

hongsheng-Z commented Mar 1, 2022

Andy1621 commented Mar 1, 2022

Andy1621 commented Mar 1, 2022 •

edited

Andy1621 commented Mar 1, 2022 •

edited

hongsheng-Z commented Mar 1, 2022

Andy1621 commented Mar 1, 2022

Andy1621 commented Mar 5, 2022

hongsheng-Z commented Mar 5, 2022

Andy1621 commented Mar 5, 2022

Andy1621 commented Mar 16, 2022

hongsheng-Z commented Mar 16, 2022

hongsheng-Z commented Mar 31, 2022

Andy1621 commented Apr 1, 2022

Andy1621 commented Apr 21, 2022

How to use the pretrained model uniformer_base_in1k.pth as my backbone ? #20

How to use the pretrained model uniformer_base_in1k.pth as my backbone ? #20

Comments

hongsheng-Z commented Feb 25, 2022

Andy1621 commented Feb 26, 2022

Andy1621 commented Feb 27, 2022

hongsheng-Z commented Mar 1, 2022

Andy1621 commented Mar 1, 2022

Andy1621 commented Mar 1, 2022 • edited

Andy1621 commented Mar 1, 2022 • edited

hongsheng-Z commented Mar 1, 2022

Andy1621 commented Mar 1, 2022

Andy1621 commented Mar 5, 2022

hongsheng-Z commented Mar 5, 2022

Andy1621 commented Mar 5, 2022

Andy1621 commented Mar 16, 2022

hongsheng-Z commented Mar 16, 2022

hongsheng-Z commented Mar 31, 2022

Andy1621 commented Apr 1, 2022

Andy1621 commented Apr 21, 2022

Andy1621 commented Mar 1, 2022 •

edited

Andy1621 commented Mar 1, 2022 •

edited