Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

From pytorch model (with hugging_face library) to XLM model #42

Open
ArthurVanSchendel opened this issue Apr 15, 2022 · 1 comment
Open

Comments

@ArthurVanSchendel
Copy link

Hello,

I have a problem currently regarding the finetuning of Flaubert on FLUE. I have a model that I re-trained with custom data, so I have new weights for it, and I currently have it as .json file and .bin files. However, when I want to finetune this model on FLUE tasks, they ask me for vocab and codes files from pretraining, that I don't have when using the hugging face library. I see that there is a module to go from XLM to hugging_face, but not the opposite. Is it possible to transform a model under .json and .bin format to get vocab, codes and .pth files ?

Or maybe there is a clever workaround to this problem ?

Many thanks in advance

@formiel
Copy link
Contributor

formiel commented Nov 1, 2022

Hello @ArthurVanSchendel !

I'm really sorry for the very late reply! I had been on leave for several months and have been quite overwhelmed after coming back to work. I'm not sure if my response is helpful to you at the moment but I think HuggingFace datasets repo offers fine-tuning on FLUE, so maybe you don't need to do the conversion?

Please feel free to let me know if there is something else that I can help you with.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants