-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error(s) in loading state_dict for BARTModel #8
Comments
The errors look like the models you installed from fairseq do not have some modules in the pre-trained model (like the "w_proj"). I would suggest checking the installed faieseq. |
For a quick check, I would suggest random initializing a Bart model and checking its named parameters. |
In this case, it seems that the fairseq you are using in the notebook is not correct. As our model indeed is different from original BART. |
I uninstalled and installed it again. It is working now. I don't know what happened exactly before. It might about my conda configuration. |
I have the same problem with #6. I installed fairseq as you suggested. There are required two paths for pretrained model, I defined them as follows:
checkpoint_file: I downloaded your model from here.
data_name_or_path: "Multi-View-Seq2Seq/train_sh/cnn_dm-bin_2/" (It is already in the repo)
Could you suggest any solution?
The text was updated successfully, but these errors were encountered: