Error(s) in loading state_dict for BARTModel #8

engindeniz · 2020-12-04T10:37:44Z

I have the same problem with #6. I installed fairseq as you suggested. There are required two paths for pretrained model, I defined them as follows:

checkpoint_file: I downloaded your model from here.
data_name_or_path: "Multi-View-Seq2Seq/train_sh/cnn_dm-bin_2/" (It is already in the repo)

Could you suggest any solution?

jiaaoc · 2020-12-05T00:25:33Z

The errors look like the models you installed from fairseq do not have some modules in the pre-trained model (like the "w_proj"). I would suggest checking the installed faieseq.

jiaaoc · 2020-12-05T00:28:21Z

For a quick check, I would suggest random initializing a Bart model and checking its named parameters.

jiaaoc · 2020-12-07T15:01:06Z

When I load the BART pretrained model, I can load without any error. I think that mismatch of named parameters is between your pretrained model and the Bart model. Could you check it, please? Btw, I am using "Eval_Sum.ipynb".

In this case, it seems that the fairseq you are using in the notebook is not correct. As our model indeed is different from original BART.

engindeniz · 2020-12-08T10:13:10Z

I uninstalled and installed it again. It is working now. I don't know what happened exactly before. It might about my conda configuration.
Thank you so much for your response.

jiaaoc closed this as completed Dec 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error(s) in loading state_dict for BARTModel #8

Error(s) in loading state_dict for BARTModel #8

engindeniz commented Dec 4, 2020

jiaaoc commented Dec 5, 2020

jiaaoc commented Dec 5, 2020

jiaaoc commented Dec 7, 2020

engindeniz commented Dec 8, 2020

Error(s) in loading state_dict for BARTModel #8

Error(s) in loading state_dict for BARTModel #8

Comments

engindeniz commented Dec 4, 2020

jiaaoc commented Dec 5, 2020

jiaaoc commented Dec 5, 2020

jiaaoc commented Dec 7, 2020

engindeniz commented Dec 8, 2020