Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generating weights for efk/mend for new model #12

Closed
salemohamedo opened this issue Apr 17, 2022 · 1 comment
Closed

Generating weights for efk/mend for new model #12

salemohamedo opened this issue Apr 17, 2022 · 1 comment

Comments

@salemohamedo
Copy link

Hi, I was wondering if you guys can tell me how I can generate weights for distilgpt2 for the mend/efk baselines, similar to what you have for gpt2-xl here: https://rome.baulab.info/data/weights/. I'm trying to run these baselines but don't have the saved weights. I tried simply loading and saving huggingface's weights for distilgpt2 but it looks like the code is looking for something a bit different. If you guys have a script/suggestions, that would be great.

Thanks!

@kmeng01
Copy link
Owner

kmeng01 commented Apr 27, 2022

Hi @salemohamedo, you'll want to refer to the MEND repository for more information.

They have instructions for training a MEND baseline for a variety of GPT models; I don't recall whether DistilGPT is supported out-of-the-box, but I'm sure Eric can help you with this if not! After training a model, simply place the trained .pt model in baselines/mend/weights. You can consult mend_main.py for details on naming conventions, so that the code registers the new model file.

Let me know if you have further questions!

@kmeng01 kmeng01 closed this as completed Sep 21, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants