K8s Custom Resource and Operator For TensorFlow jobs
TFJob provides a Kubernetes custom resource that makes it easy to run distributed or non-distributed TensorFlow jobs on Kubernetes.
- For a complete reference of the custom resource definitions, please refer to the API Definition.
- For details on its design, please refer to the v1alpha2 design doc.
- For details on its obersibility, please refer to the monitoring design doc.
- Version >= 1.16 of Kubernetes
kustomize build manifests/overlays/standalone | kubectl apply -f -
Please refer to API Documentation
This is a part of Kubeflow, so please see readme in kubeflow/kubeflow to get in touch with the community.
Please refer to the developer_guide
Please refer to CHANGELOG
The following table lists the most recent few versions of the operator.
|Operator Version||API Version||Kubernetes Version|