We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Custom training job completed successfully.
Job failed with error "Replica exited with a non-zero status code 1"
When running the notebook: https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/community-content/pytorch_text_classification_using_vertex_sdk_and_gcloud/pytorch-text-classification-vertex-ai-train-tune-deploy.ipynb, the custom training job always failed with "Replica exited with a non-zero status code" error. This error codes are potentially caused by problems in the training code.
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Expected Behavior
Custom training job completed successfully.
Actual Behavior
Job failed with error "Replica exited with a non-zero status code 1"
Steps to Reproduce the Problem
Specifications
When running the notebook: https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/community-content/pytorch_text_classification_using_vertex_sdk_and_gcloud/pytorch-text-classification-vertex-ai-train-tune-deploy.ipynb, the custom training job always failed with "Replica exited with a non-zero status code" error. This error codes are potentially caused by problems in the training code.
The text was updated successfully, but these errors were encountered: