New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Examples Timeout #2
Comments
Hi, I found out about the timeout flag and increased it, but now I got a different error 137 (maybe OOM), can you help me out please? Cloud Build log for torch example
|
Hi Joao, I see that these errors all use Google Cloud Build and Kaniko. There are two separate fixes you can try.
This also has the added benefit of caching builds so iterative builds become faster.
|
How do I build them locally? @andrewluchen |
Localy builds are automatically done if Docker is installed as seen in this line: Do you see anything in your output logs like:
|
Yeah was having that problem, looks like my docker wasn't initializing on boot. It worked now, thanks! |
Hi! I'm trying to use xmanager and while the setup went well all of the examples are timing out before even running the network. Any ideas what the error could be?
cifar10 pytorch log
Tensorflow take 1
Tensorflow take 2
Torch XLA
Thank you in advance for your help
The text was updated successfully, but these errors were encountered: