Bugs repair please~ #2

pengfeiZhao1993 · 2021-07-02T06:57:22Z

Hi, buddy, there are a lot of bugs that I cannot run the "train-supernet.py" successfully !

model settings issue in train_supernet.py

In addition, there is one imported model >from model.supermodel import BertForSequenceClassification
And you try to create teacher_model and super_model with the same SuperModel (SuperEmbedding, SuperLinear, SuperLayerNorm) setting?

skzhang1 · 2021-07-02T07:35:27Z

The teacher_model is just a used for further distillation, but in our implementation we do not use it as you can see in our code, so please ignore it. And we set the config of superBERT using the function set_sample_config , so this two models are different. I will make the code clearer in the next few days.

pengfeiZhao1993 · 2021-07-02T07:51:02Z

Ok, certainly. I notice that.
Thank you for your early reply.

Some minor mistakes in this code.

call torch.nn.Dataparallel twice, one in main function , and another call in train(). so you doubly pack the super_model, resulting in the >line 253 "super_model.module.set_sample_config()" ineffective.
In Multi-GPU training, there is a error: >RuntimeError: Input, output and indices must be on the current device.
load train_dataset twice.
hope to help you to speed up.

Look forward to your codes soon.

skzhang1 · 2021-07-02T08:02:08Z

Thanks for your suggestions!I will repair it in the future version.

skzhang1 closed this as completed Jul 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugs repair please~ #2

Bugs repair please~ #2

pengfeiZhao1993 commented Jul 2, 2021

skzhang1 commented Jul 2, 2021

pengfeiZhao1993 commented Jul 2, 2021

skzhang1 commented Jul 2, 2021

Bugs repair please~ #2

Bugs repair please~ #2

Comments

pengfeiZhao1993 commented Jul 2, 2021

model settings issue in train_supernet.py

skzhang1 commented Jul 2, 2021

pengfeiZhao1993 commented Jul 2, 2021

Some minor mistakes in this code.

skzhang1 commented Jul 2, 2021