Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update conversion script to transformers v4.2.0 #166

Closed
wants to merge 3 commits into from
Closed

Update conversion script to transformers v4.2.0 #166

wants to merge 3 commits into from

Conversation

adamwawrzynski
Copy link

I have updated conversion script from transformers 3.0.2 to 4.2.0. Any suggestions and comments are welcommed.

Signed-off-by: Adam Wawrzyński adamwawrzynski@wp.pl

Signed-off-by: Adam Wawrzyński <adamwawrzynski@wp.pl>
Signed-off-by: Adam Wawrzyński <adamwawrzynski@wp.pl>
Signed-off-by: Adam Wawrzyński <adamwawrzynski@wp.pl>
@HerrKrishna
Copy link

HerrKrishna commented May 7, 2021

Hi Adam,

i use your script to convert a model to longformer, which works well. But when i load the saved model, i get the warning:

Some weights of the model checkpoint at output/xlm-roberta-longformer-unsupervised/0_TransformerWrapper were not used when initializing RobertaLongModel:

followed by a list of all weights and biases in the attention layers. Also the performance of the model is drastically worse than before saving and reloading it. Do you know how to fix this?

@adamwawrzynski adamwawrzynski closed this by deleting the head repository Dec 8, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants