Skip to content

Conversation

joelanford
Copy link
Member

No description provided.

@openshift-ci-robot openshift-ci-robot added jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. labels Feb 3, 2025
@openshift-ci-robot
Copy link

@joelanford: This pull request references Jira Issue OCPBUGS-49683, which is valid.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target version (4.19.0) matches configured target version for branch (4.19.0)
  • bug is in the state POST, which is one of the valid states (NEW, ASSIGNED, POST)

Requesting review from QA contact:
/cc @Xia-Zhao-rh

The bug has been updated to refer to the pull request using the external bug tracker.

In response to this:

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Copy link
Contributor

openshift-ci bot commented Feb 3, 2025

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: joelanford

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 3, 2025
@joelanford joelanford changed the title OCPBUGS-49683: CA cert and catalog cache bug fixes (round 2) OCPBUGS-49683: cache bug fixes (round 2) Feb 3, 2025
Copy link
Contributor

openshift-ci bot commented Feb 3, 2025

@joelanford: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/okd-scos-e2e-aws-ovn bbb79e1 link false /test okd-scos-e2e-aws-ovn

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@openshift-merge-robot openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 3, 2025
oceanc80 and others added 3 commits February 3, 2025 13:06
…#1669)

Signed-off-by: Per Goncalves da Silva <pegoncal@redhat.com>
…#1677)

* Rename util package name and file

Signed-off-by: Per Goncalves da Silva <pegoncal@redhat.com>

* Refactor and add missing unit tests

Signed-off-by: Per Goncalves da Silva <pegoncal@redhat.com>

---------

Signed-off-by: Per Goncalves da Silva <pegoncal@redhat.com>
Co-authored-by: Per Goncalves da Silva <pegoncal@redhat.com>
…e perms prior to delete (#1691)

Signed-off-by: Joe Lanford <joe.lanford@gmail.com>
@openshift-merge-robot openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 3, 2025
@joelanford
Copy link
Member Author

/payload-aggregate periodic-ci-openshift-release-master-ci-4.19-e2e-aws-upgrade-ovn-single-node 10
/payload-aggregate periodic-ci-openshift-release-master-ci-4.19-e2e-aws-ovn-upgrade 10
/payload-job periodic-ci-openshift-release-master-ci-4.19-e2e-aws-ovn-upgrade-out-of-change

Copy link
Contributor

openshift-ci bot commented Feb 3, 2025

@joelanford: trigger 3 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command

  • periodic-ci-openshift-release-master-ci-4.19-e2e-aws-ovn-upgrade-out-of-change
  • periodic-ci-openshift-release-master-ci-4.19-e2e-aws-upgrade-ovn-single-node
  • periodic-ci-openshift-release-master-ci-4.19-e2e-aws-ovn-upgrade

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/62f2dad0-e262-11ef-981b-802b61a5ede3-0

@joelanford
Copy link
Member Author

/hold
until we have some signal from some extra payload jobs.

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Feb 3, 2025
@neisw
Copy link
Contributor

neisw commented Feb 4, 2025

Looks like we still have catalogd failures in sno

@joelanford
Copy link
Member Author

I looked at the first failure's must-gather.

Looking at the openshift-catalogd replicasets, I think I see that the older of the replicasets was running with the commit that we reverted. The newer replicaset shows the expected pod spec from main, and I see that the pod in the must gather has no previous logs. However I do see one restart of the new pod. Without the previous pod log, it's hard to tell why.

Does "[sig-architecture] platform pods in ns/openshift-catalogd should not exit an excessive amount of times" capture restarts before and after the upgrade, or just after? I suspect it's counting failures from the known bad commit that was on main prior to the revert.

@joelanford
Copy link
Member Author

Doh! It's right there! The log output of "[sig-architecture] platform pods in ns/openshift-catalogd should not exit an excessive amount of times" shows the failures due to the old replica set's pod, which is what we reverted. That test must have picked up a "from payload" version that had our bugged code.

Cluster version history shows the "from" version was 4.19.0-0.ci-2025-02-03-060046, and the catalogd commit there was c20afa6

Once there's a new accepted CI payload with our revert, we should kickoff the payload jobs again.

@tmshort
Copy link
Contributor

tmshort commented Feb 4, 2025

I'm seeing the excessive restarts in #260 as well, I might need to wait until this merges.

@tmshort
Copy link
Contributor

tmshort commented Feb 14, 2025

@joelanford this can probably be closed as #266 finally merged from upstream?

@openshift-merge-robot openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 14, 2025
@openshift-merge-robot
Copy link
Contributor

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@joelanford joelanford closed this Feb 14, 2025
@openshift-ci-robot
Copy link

@joelanford: This pull request references Jira Issue OCPBUGS-49683. The bug has been updated to no longer refer to the pull request using the external bug tracker.

In response to this:

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants