Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot load model after weight generation #27

Closed
R4ZZ3 opened this issue Nov 12, 2022 · 6 comments
Closed

Cannot load model after weight generation #27

R4ZZ3 opened this issue Nov 12, 2022 · 6 comments

Comments

@R4ZZ3
Copy link

R4ZZ3 commented Nov 12, 2022

Hi and thanks for creating this,

I am trying to use https://huggingface.co/Finnish-NLP/roberta-large-finnish-v2?text=Moikka+olen+%3Cmask%3E+kielimalli. with this repo. I have successfully run the weight generation:

python gen_weight_roberta_like.py --lilt lilt-only-base/pytorch_model.bin --text roberta-large-finnish-v2/roberta-large-finnish-v2/pytorch_model.bin --config roberta-large-finnish-v2/roberta-large-finnish-v2/config.json --out lilt-roberta-large-finnish-v2

But when I try to load the model then I get the following error:
image

Do you have idea what might cause this and how could it be fixed?

@R4ZZ3
Copy link
Author

R4ZZ3 commented Nov 12, 2022

Is it ok to use roberta-large models also?

@jpWang
Copy link
Owner

jpWang commented Nov 21, 2022

Hi,
due to the limitation of computing resources, we haven't trained the large LiLT model yet. It is considered for future work.

@jpWang jpWang closed this as completed Nov 21, 2022
@logan-markewich
Copy link

+1 for training LiLT-Large 👍🏻

@AnQueth
Copy link

AnQueth commented Jun 6, 2023

will this ever be done? I am trying to use LILT but we have some forms that go over the 512 limit and we can't just truncate as the data could be anywhere in the tokens.

@logan-markewich
Copy link

@AnQuethit it seems this project is mostly dead in terms of development.

My solution for that was to just tokenize longer documents in chunks with some overlap

The doc_stride parameter in huggingface tokenizer is very useful for this

@AnQueth
Copy link

AnQueth commented Jun 6, 2023

i ran across the stride parameter 30 minutes ago but i haven't figured out how to get around the error it causes. do you have working code with it save me some time lol ?

Couldn't cast array of type
list<item: int64>
to
int64

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants