Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Revisit Pdb calls during the reconciles while job is completed #824

Closed
jian-he opened this issue Sep 5, 2018 · 7 comments
Closed

Revisit Pdb calls during the reconciles while job is completed #824

jian-he opened this issue Sep 5, 2018 · 7 comments

Comments

@jian-he
Copy link
Contributor

jian-he commented Sep 5, 2018

Since, the reconcileTFJob is continuously invoked, this DeletePdb code will be continuously invoked even when job is completed.

That will cause the following FailedDeletePdb/SuccessfulDeletePdb events continuously get generated after job completes.

@jian-he
Copy link
Contributor Author

jian-he commented Sep 5, 2018

It's fine to continuously invoke reconcileTFJob while the job is running even with no updates, but I think it's better to NOT invoke it continuously for completed jobs with no updates.

@gaocegege @johnugeorge @jlewi @ScorpioCPH , your opinion ?

@gaocegege
Copy link
Member

SGTM. I think we definitely should do it. While it is not in high priority since the performance is not the primary goal in this release.

@jian-he Are you facing some problems with the performance?

@gaocegege
Copy link
Member

gaocegege commented Sep 5, 2018

It is dup with #804 , IMO. @jian-he WDYT? We could close this one and discuss about it in #804 .

There are some other resources like pods and services which also have the problem

@jlewi
Copy link
Contributor

jlewi commented Sep 18, 2018

I agree once a TFJob reaches a terminal state we no longer need to call reconcile periodically; but how would we implement that?

We rely on the informer to periodically generate update events for all TFJobs to ensure reconcile gets called periodically.

@jian-he
Copy link
Contributor Author

jian-he commented Sep 20, 2018

hmm, I guess we can have a cache to store that whether the job is already completed and reconciled once. Then next time, it can skip the reconcilation.

@johnugeorge
Copy link
Member

This should be fixed in #1012
closing the issue
/close

@k8s-ci-robot
Copy link

@johnugeorge: Closing this issue.

In response to this:

This should be fixed in #1012
closing the issue
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants