Some question about trainning #9

zbw0329 · 2021-09-23T13:52:09Z

Hi~Thanks for your excellent work!
I have a machine with 2 1080Ti,and I want to train your model on CIFAR10 with resnet18.

I use the parmeters like this ,but it seems don't work.

The program is stuck in this situation.

zbw0329 · 2021-09-23T13:56:41Z

If I use this parameters

And I get this back

akuxcw · 2021-09-26T03:10:52Z

Hi, if you want to train models on CIFAR10, you have to change the dataset and data augmentation. The methods of loading ImageNet and CIFAR10 are different.

zbw0329 · 2021-09-27T01:14:44Z

What is the different between '--resume' and '--pretrained'?
How to evaluate my model?

zbw0329 · 2021-09-27T09:56:41Z

I changed the function of loading datasets,it works.

But when I evaluate my model,the output tensor is [1,1000] and the target tensor is [1].
I think it cause by the difference between CIFAR10 and ImageNet.
Do you think I should add a softmax to the end of the model?

akuxcw · 2021-09-28T07:48:05Z

What is the different between '--resume' and '--pretrained'?
How to evaluate my model?

"resume" will load optimizer, which is used to resume training from unexcepted break.
You can use linear evaluation or downstream tasks to evaluate the model.

akuxcw · 2021-09-28T07:53:56Z

I changed the function of loading datasets,it works.

But when I evaluate my model,the output tensor is [1,1000] and the target tensor is [1].
I think it cause by the difference between CIFAR10 and ImageNet.
Do you think I should add a softmax to the end of the model?

I suppose the dataloader is wrong. You do need to change the model's classifier to fit CIFAR10, but the shape of output and target tensor is wired. Do you use batch_size = 1?

zbw0329 · 2021-09-28T08:51:18Z

Yes, I set batch_size to 1 by mistake.
Thanks for your help.
I wish you success in your research.

zbw0329 · 2021-09-29T07:47:29Z

My Loss_clu is always the same during my trainning,what happened?

zbw0329 · 2021-10-12T01:31:46Z

Could you release the code of CIFAR10?
I notice that you have showed the table of CIFAR10 on your paper

akuxcw · 2021-10-20T08:29:11Z

Hi, the results on CIFAR are linear evaluation / finetune results. We didn't try to train pretrained models on CIFAR. The losses during training do reduce very slowly. Are the linear evaluation results on CIFAR reasonable?

zbw0329 closed this as completed Sep 28, 2021

zbw0329 reopened this Sep 29, 2021

zbw0329 closed this as completed Nov 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some question about trainning #9

Some question about trainning #9

zbw0329 commented Sep 23, 2021

zbw0329 commented Sep 23, 2021

akuxcw commented Sep 26, 2021

zbw0329 commented Sep 27, 2021

zbw0329 commented Sep 27, 2021 •

edited

akuxcw commented Sep 28, 2021

akuxcw commented Sep 28, 2021

zbw0329 commented Sep 28, 2021

zbw0329 commented Sep 29, 2021 •

edited

zbw0329 commented Oct 12, 2021

akuxcw commented Oct 20, 2021

Some question about trainning #9

Some question about trainning #9

Comments

zbw0329 commented Sep 23, 2021

zbw0329 commented Sep 23, 2021

akuxcw commented Sep 26, 2021

zbw0329 commented Sep 27, 2021

zbw0329 commented Sep 27, 2021 • edited

akuxcw commented Sep 28, 2021

akuxcw commented Sep 28, 2021

zbw0329 commented Sep 28, 2021

zbw0329 commented Sep 29, 2021 • edited

zbw0329 commented Oct 12, 2021

akuxcw commented Oct 20, 2021

zbw0329 commented Sep 27, 2021 •

edited

zbw0329 commented Sep 29, 2021 •

edited