controller should reconcile the status of PodGroup #166

Huang-Wei · 2021-03-19T20:13:19Z

In the latest code, the controller doesn't quite reconcile the status of PodGroup:

Field status.Scheduled not displayed - fixed by controller should reconcile the status of PodGroup #166

If a PodGroup is created, we'd expect all status fields to display properly. But for now it's only status.phase.
```
$ k get pg pg1 -o yaml 
...
status:
  phase: pending
```
status.phase not reconciled well

If a PodGroup doesn't have any associated Pods, it should stay in Pending state. However, it sometimes stays in Running (or other state). We's expect status to be always reconciled.

/kind bug

The text was updated successfully, but these errors were encountered:

Huang-Wei · 2021-03-19T20:15:09Z

@cwdsuzhou @denkensk could you take some time to look into the above 2 issues?

cwdsuzhou · 2021-03-21T02:17:09Z

I will open a PR to resolve these issues

cwdsuzhou · 2021-03-21T02:17:21Z

/assign

cwdsuzhou · 2021-03-30T12:47:44Z

@Huang-Wei

to resolve the 1st issue, we just need remove omitempty of these fields.

about the 2nd issue, I have different opinions. If a group has been Running , it should not go back to Pending again. IMO, we should covert it to Success, Failed or Unknown. If no associated Pods exist, Unknown would be better. For other status, conversion should be similar.

WDYT?

Huang-Wei · 2021-03-30T17:04:39Z

to resolve the 1st issue, we just need remove omitempty of these fields.

Thanks. Please raise a PR fixing that.

IMO, we should covert it to Success, Failed or Unknown. If no associated Pods exist, Unknown would be better. For other status, conversion should be similar.

It sounds strange to use different states to show whether a PodGroup has been scheduled successfully once or not. Shouldn't the state just stick to its actual state, in a "stateless" manner?

cwdsuzhou · 2021-03-31T04:05:09Z

to resolve the 1st issue, we just need remove omitempty of these fields.

Thanks. Please raise a PR fixing that.

This does not work. I raise a PR, just add default value to status

cwdsuzhou · 2021-03-31T04:09:04Z

It sounds strange to use different states to show whether a PodGroup has been scheduled successfully once or not. Shouldn't the state just stick to its actual state, in a "stateless" manner?

Usually, a PG would be related to a job, from the perspective of a job, job status should not be from Running to Pending

Huang-Wei · 2021-03-31T19:41:47Z

/reopen
as the 2nd item hasn't been resolved.

I think the workflow of how status changes need to rethink a bit. Probably it should be accompanied by informational messages, so that users can know what phase a PodGropu is in, for what reason.

k8s-ci-robot · 2021-03-31T19:41:52Z

@Huang-Wei: Reopened this issue.

In response to this:

/reopen
as the 2nd item hasn't been resolved.

I think the workflow of how status changes need to rethink a bit. Probably it should be accompanied by informational messages, so that users can know what phase a PodGropu is in, for what reason.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

fejta-bot · 2021-06-29T20:10:09Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

k8s-triage-robot · 2021-07-29T21:00:42Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle rotten

Huang-Wei · 2021-08-02T19:35:05Z

/remove-lifecycle rotten

JaneLiuL · 2021-09-07T06:55:36Z

/assign I would like to have a try.

JaneLiuL · 2021-09-07T07:44:32Z

Is it scenario as below is correct? Actually I am really confuse about the PodGroup status. If my understand is correct, I would like to create a doc to describe first. Then I would like to fix the code issue for it :)
@Huang-Wei

Case 1:
Create podGroup minMember=2 , podGroup.status.Phase=pending
PodNum=0 , podGroup.status.Phase=pending
PodNum=2 (Pod all success) , podGroup.status.Phase=pending-->PreScheduling-->Scheduling-->Finished-->Scheduled
PodNum=1 (Pod all success) , podGroup.status.Phase=Scheduled-->Failed
PodNum=0 , podGroup.status.Phase=Failed-->Pending

Case 2:
Create podGroup minMember=2 , podGroup.status.Phase=pending
PodNum=0 , podGroup.status.Phase=pending
PodNum=2 (Pod all fail to start, like imagePullFail) , podGroup.status.Phase=pending--->PreScheduling-->Scheduling-->Failed
PodNum=2 (Pod all success) , podGroup.status.Phase=Failed--->PreScheduling-->Scheduling-->Scheduled
PodNum=0 , podGroup.status.Phase=Scheduled-->Pending

k8s-triage-robot · 2021-12-06T08:29:51Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-ci-robot added the kind/bug Categorizes issue or PR as related to a bug. label Mar 19, 2021

Huang-Wei mentioned this issue Mar 19, 2021

Update install doc to match release v0.19.8 #163

Merged

k8s-ci-robot assigned cwdsuzhou Mar 21, 2021

cwdsuzhou mentioned this issue Mar 31, 2021

Add default value to pg status #170

Merged

k8s-ci-robot closed this as completed in #170 Mar 31, 2021

k8s-ci-robot reopened this Mar 31, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 29, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 29, 2021

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Aug 2, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 6, 2021

lianghao208 mentioned this issue Dec 7, 2021

Fix controller reconcile PodGroup status #308

Merged

k8s-ci-robot closed this as completed in #308 Dec 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

controller should reconcile the status of PodGroup #166

controller should reconcile the status of PodGroup #166

Huang-Wei commented Mar 19, 2021 •

edited

Loading

Huang-Wei commented Mar 19, 2021

cwdsuzhou commented Mar 21, 2021

cwdsuzhou commented Mar 21, 2021

cwdsuzhou commented Mar 30, 2021

Huang-Wei commented Mar 30, 2021

cwdsuzhou commented Mar 31, 2021

cwdsuzhou commented Mar 31, 2021

Huang-Wei commented Mar 31, 2021

k8s-ci-robot commented Mar 31, 2021

fejta-bot commented Jun 29, 2021

k8s-triage-robot commented Jul 29, 2021

Huang-Wei commented Aug 2, 2021

JaneLiuL commented Sep 7, 2021

JaneLiuL commented Sep 7, 2021 •

edited

Loading

k8s-triage-robot commented Dec 6, 2021

controller should reconcile the status of PodGroup #166

controller should reconcile the status of PodGroup #166

Comments

Huang-Wei commented Mar 19, 2021 • edited Loading

Huang-Wei commented Mar 19, 2021

cwdsuzhou commented Mar 21, 2021

cwdsuzhou commented Mar 21, 2021

cwdsuzhou commented Mar 30, 2021

Huang-Wei commented Mar 30, 2021

cwdsuzhou commented Mar 31, 2021

cwdsuzhou commented Mar 31, 2021

Huang-Wei commented Mar 31, 2021

k8s-ci-robot commented Mar 31, 2021

fejta-bot commented Jun 29, 2021

k8s-triage-robot commented Jul 29, 2021

Huang-Wei commented Aug 2, 2021

JaneLiuL commented Sep 7, 2021

JaneLiuL commented Sep 7, 2021 • edited Loading

k8s-triage-robot commented Dec 6, 2021

Huang-Wei commented Mar 19, 2021 •

edited

Loading

JaneLiuL commented Sep 7, 2021 •

edited

Loading