New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TpuEmbeddingEngine_WriteParameters not available in this library. #202
Comments
check jax version ig you are using 0.2.16 but the correct version in order to run the training is 0.2.12 |
If I run
|
Also @whoislimshady when I run it, it crashes and prints out
|
jax 0.2.12 won't run and crashes with this error. Using
works but introduces new issues. |
@nikhilanayak i also faced the same issue but somehow it got resolved by just changing version from 0.2.16 to 12 |
@whoislimshady It looks like you are using torch 1.8.1? I got 1.11.1... |
Just to be sure that everyone is on the correct page: What version of TPU-VM are you actually running? It might be that the version that people are running is actually incorrect, and that a lower version actually performs better than the newer ones. I am running with |
For fine tuning I have always used tpu version |
Here's a full set of commands to reproduce the error:
|
Okay, I fixed the issue this way:
This seemed to fix a LOT of my issues I was having, and its now working :) |
@mrseeker Your solution fixed my problem too. Thank you for sharing it! |
I followed all of the instructions in the training guide but when I run the device_train script, I get this error:
This is my exact command for the training process:
The text was updated successfully, but these errors were encountered: