-
Notifications
You must be signed in to change notification settings - Fork 949
GPT3XL training #109
Comments
there are some incompatibility between the tokenizers to the transformers version (it's installing the current transformers version, but the old tokenizers one).
|
@srulikbd I asked to Thomas Wolf from HF about this, and his suggestion was to use the latest version of both. Could you be more specific about the tokenizer's version issue? |
hey. it seems that it's working right now.
|
Great! Can you put these changes on a branch and open a PR? That way we can verify that it doesn’t break anything on the TPUs and merge it. |
yeah, of course. I'll do that as soon as possible. |
@StellaAthena hey. here is the output after running on google colab the GPTNEO example:
|
Where are you running this code? Are you using your own GPUs? |
I tried both GPU and tpu on colab. I tried also on aws AMI instance with V100. |
Sorry this slipped through the cracks. I assume you got everything working based on your PR? |
actually it might still not work. I saw that you are focused on gptneox, so I switched over there :) |
It's not clear to me how to train the GPT3XL via GPU/Colab.
Could you add more details?
Thank you.
The text was updated successfully, but these errors were encountered: