Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Labels in Python client are out of date #1587

Closed
person142 opened this issue May 5, 2022 · 2 comments · Fixed by #1588
Closed

Labels in Python client are out of date #1587

person142 opened this issue May 5, 2022 · 2 comments · Fixed by #1588

Comments

@person142
Copy link
Contributor

person142 commented May 5, 2022

With the latest version of the Python sdk and operator, a call like

client = TFJobClient()
# ... do some stuff to launch a job
client.get_logs(<job-name>, namespace=<namespace>, follow=True)

will fail with an error like

Not found Pods of the TFJob <job-name> with the labels {'group-name': 'kubeflow.org', 'job-name': <job-name>, 'job-role': 'master'}

This is because the Python client still looks for the label job-role:

https://github.com/kubeflow/training-operator/blob/master/sdk/python/kubeflow/training/constants/constants.py#L34

whereas in kubeflow/common@6c9fe11 the label was updated to the fully-qualified training.kubeflow.org/job-role.

Some testing reveals that the 1.3 release is the last version to work with the current version of the sdk.

person142 added a commit to person142/training-operator that referenced this issue May 5, 2022
Currently certain operations like tailing logs from the Python sdk
against the latest version of the operator due to a label mistmatch;
fix that.

Closes kubeflow#1587.
@gaocegege
Copy link
Member

Thanks for the issue. Would you help fix it? I saw a commit person142@c1c155e here.

@person142
Copy link
Contributor Author

person142 commented May 6, 2022

Yes indeed - #1588 is the PR. Had to wait a bit to get approval on my end to make it.

person142 added a commit to person142/training-operator that referenced this issue Jun 2, 2022
Currently certain operations like tailing logs from the Python sdk
against the latest version of the operator due to a label mistmatch;
fix that.

Closes kubeflow#1587.
google-oss-prow bot pushed a commit that referenced this issue Jun 3, 2022
Currently certain operations like tailing logs from the Python sdk
against the latest version of the operator due to a label mistmatch;
fix that.

Closes #1587.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants