fix(deps): pin below kubernetes 36.0.0 (multiple client regressions)#511
Conversation
There was a problem hiding this comment.
Pull request overview
Note
Copilot was unable to run its full agentic suite in this review.
Updates the Python dependency constraint for the Kubernetes client to avoid known regressions in version 36.0.0.
Changes:
- Replaces an exclusion specifier (
!=36.0.0) with an upper bound (<36.0.0) forkubernetes.
Signed-off-by: tariq-hasan <mmtariquehsn@gmail.com>
2defe53 to
e0accd6
Compare
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: andreyvelich The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
@tariq-hasan Could you check why E2Es are failing ? |
|
The E2E tests are failing due to timeout. This is unrelated to the kubernetes python client error. The original error |
|
@tariq-hasan For this SDK PR, since the |
|
The notebook-level exception is a timeout, but the pod logs show the TrainJob failed earlier while loading the SQuAD dataset. The failing line is: So the TrainJob never reaches Complete, and then the notebook reports: The likely reason this started failing now is that the notebook installs unpinned Hugging Face packages: |
|
@XploY04 Thanks for the detailed analysis. It would be helpful if you open a small PR on the Kubeflow Trainer notebook for the fix. That way it would help unblock the E2E tests here as they are reading from the Trainer notebooks. |
|
Let's manually merge it |
What this PR does / why we need it:
This is a followup to #507 to pin kubernetes python client to
<36.0.0.There has been a recent
36.0.1release which has been breaking the tests again because it does not address the error withread_namespaced_pod_log.Which issue(s) this PR fixes (optional, in
Fixes #<issue number>, #<issue number>, ...format, will close the issue(s) when PR gets merged):Fixes #
Checklist: