Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed GPT-125m #37

Merged
merged 2 commits into from
Jul 6, 2023
Merged

Fixed GPT-125m #37

merged 2 commits into from
Jul 6, 2023

Conversation

aksj98
Copy link
Contributor

@aksj98 aksj98 commented Jul 6, 2023

GPT-Neo on huggingface has is gpt-neo-125m , training scripts in biencoder/nli-msmacro listed it as gpt-neo-125M throwing errors while training - fixed it to say gpt-neo-125m

Hugginface doesn't have 125M, instead has 125m
@Muennighoff
Copy link
Owner

Nice! Can you leave the SGPT models with the capitalized M, since that's how they're named on the hub (e.g. SGPT-125M-mean-nli) - then we can merge it 👍

Fix SGPT-125m to M
@aksj98
Copy link
Contributor Author

aksj98 commented Jul 6, 2023

Done @Muennighoff ! Also, on a sidenote, is it possible to make public the wandb runs of your model training and share the link? I am trying to train some custom models and it would be helpful! Thank you!

@Muennighoff
Copy link
Owner

Sure they're here: https://wandb.ai/muennighoff/sgpt?workspace=

@Muennighoff Muennighoff merged commit 0b0ec10 into Muennighoff:main Jul 6, 2023
@aksj98
Copy link
Contributor Author

aksj98 commented Jul 6, 2023

Thank you, much appreciated!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants