New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What's the function of tfReplicaType: Master ? #1442
Comments
Master now is replaced by chief. |
Earlier versions of TensorFlow used "Master" to indicate the process that coordinates distributed training. TensorFlow then changed to calling this "Chief". We added "Chief" to be consistent with TensorFlow terminology; but we've kept "Master" for backwards compatibility with older versions of TF. |
As @jlewi said, Master type is still kept for backwards compatibility. https://github.com/kubeflow/tf-operator/blob/master/pkg/apis/tensorflow/v1alpha2/types.go#L135 |
@johnugeorge |
Why and when kubeflow removed Master?
In a doc, we see: The master only acts as the chief and doesn't do any training. When master started and intilized, how master waits ps and worker to completed?
The text was updated successfully, but these errors were encountered: