About Checkpoints #9

WY-2022 · 2022-04-12T21:41:06Z

Hi! I have another question. If I just pip, and then :

class SWIN(nn.Module):
     def __init__(self, num_classes=4):
        super().__init__()
        self.num_classes = num_classes
        # self.pool = nn.MaxPool2d(2, 2)
        self.encoder: SwinTransformerV2 = swin_transformer_v2_t(in_channels=3,
                                                            window_size=8,
                                                            input_resolution=(1024, 1280),
                                                            sequential_self_attention=False,
                                                            use_checkpoint=True)
        self.p=self.encoder.patch_embedding
        self.encoder0 = self.encoder.stages[0]
        ... ...

How to use the checkpoint now? And Is there a pre-trained model for v2_base?
(And when I just run like above, a wired problem arises: 'warnings.warn("None of the inputs have requires_grad=True. Gradients will be None")')

The text was updated successfully, but these errors were encountered:

ChristophReich1996 · 2022-04-13T15:03:25Z

For loading the provided checkpoints you need to initialize the network in the training configuration, load the state dict, and then change the resolution/window size for your need. Here an example for the CIFAR10 checkpoint:

import torch
from swin_transformer_v2 import swin_transformer_v2_b, SwinTransformerV2

swin_transformer: SwinTransformerV2 = swin_transformer_v2_t(input_resolution=(32, 32),
                                                                window_size=8,
                                                                sequential_self_attention=False,
                                                                use_checkpoint=True)
swin_transformer.load_state_dict(torch.load("path_to_weights/cifar10_swin_t_best_model_backbone.pt"))
swin_transformer.update_resolution(new_window_size=8, new_input_resolution=(1024, 1280))

Here an example for the Places365 dataset:

import torch
from swin_transformer_v2 import swin_transformer_v2_b, SwinTransformerV2

swin_transformer: SwinTransformerV2 = swin_transformer_v2_b(input_resolution=(256, 256),
                                                                window_size=8,
                                                                sequential_self_attention=False,
                                                                use_checkpoint=True)
swin_transformer.load_state_dict(torch.load("path_to_weights/places365_swin_b_best_model_backbone.pt"))
swin_transformer.update_resolution(new_window_size=8, new_input_resolution=(1024, 1280))

The CIFAR10 checkpoint is for the tiny model and the Places365 checkpoint is for the base model.

Please note that there are pre-trained weights on ImageNet1k available in the Timm library!

ChristophReich1996 closed this as completed Apr 13, 2022

ChristophReich1996 mentioned this issue Sep 12, 2022

大佬，您能导入窗口为16的预训练权重吗 #10

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About Checkpoints #9

About Checkpoints #9

WY-2022 commented Apr 12, 2022 •

edited

Loading

ChristophReich1996 commented Apr 13, 2022

About Checkpoints #9

About Checkpoints #9

Comments

WY-2022 commented Apr 12, 2022 • edited Loading

ChristophReich1996 commented Apr 13, 2022

WY-2022 commented Apr 12, 2022 •

edited

Loading