Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug 1804284: Revert "RHCOS: bump to 44.81.202002071430-0" #3117

Conversation

miabbott
Copy link
Member

@miabbott miabbott commented Feb 17, 2020

This reverts commit 5654e02.

Seeing reports via Slack that networking is not coming up properly on masters. This is preventing bare metal installs from completing successfully.

See: https://bugzilla.redhat.com/show_bug.cgi?id=1803926

NOTE: This is not a fix for the issue but a return to a previously working state for ipv4 usage on bare metal.

@openshift-ci-robot openshift-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Feb 17, 2020
@hardys
Copy link
Contributor

hardys commented Feb 17, 2020

Thanks @miabbott - I've reproduced the issue and am attempting to get some clues as to what's going wrong - it seems the external network doesn't come up on the masters, so they get stuck trying to download the ignition config from the bootstrap VM - reverting to the previous image does solve that problem

@hardys
Copy link
Contributor

hardys commented Feb 17, 2020

I tested a modified image, and can confirm that just reverting the ignition_network_kcmdline to ip=dhcp does fix the problem on ipv4 - obviously we then still have issues with ipv6 (not yet tested the latest image in an ipv6 setup yet)

@miabbott
Copy link
Member Author

/test e2e-metal
/test e2e-azure
/test e2e-vsphere
/test e2e-gcp

@ashcrow
Copy link
Member

ashcrow commented Feb 17, 2020

Makes sense to roll back for the moment ... but I'm a little confused as I would think this would have happened as well for "ipv4 only" based on the dracut documentation.

@ashcrow
Copy link
Member

ashcrow commented Feb 17, 2020

/test e2e-aws
/test e2e-ovirt

@ashcrow
Copy link
Member

ashcrow commented Feb 17, 2020

/retest

@sdodson
Copy link
Member

sdodson commented Feb 17, 2020

/approve
/lgtm
/hold
Clear the hold when RHCOS folks are happy with this.

@openshift-ci-robot openshift-ci-robot added do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. lgtm Indicates that a PR is ready to be merged. labels Feb 17, 2020
@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: sdodson

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 17, 2020
@ashcrow
Copy link
Member

ashcrow commented Feb 17, 2020

could not wait for build: the build openstack-installer failed after 7m1s with reason PushImageToRegistryFailed: Failed to push the image to the registry.

@ashcrow
Copy link
Member

ashcrow commented Feb 17, 2020

/retest

@ashcrow
Copy link
Member

ashcrow commented Feb 17, 2020

level=fatal msg="failed to fetch Cluster: failed to generate asset \"Cluster\": failed to create cluster: failed to apply using Terraform"

and

Throttling: Rate exceeded\n\tstatus code: 400,

@eranco74
Copy link
Contributor

Seems to work 👍

@sdodson
Copy link
Member

sdodson commented Feb 18, 2020

Critical platforms have all passed, gcp, metal, aws. Azure is an e2e flake.
/bugzilla refresh
/hold cancel

@openshift-ci-robot
Copy link
Contributor

@sdodson: No Bugzilla bug is referenced in the title of this pull request.
To reference a bug, add 'Bug XXX:' to the title of this pull request and request another bug refresh with /bugzilla refresh.

In response to this:

Critical platforms have all passed, gcp, metal, aws. Azure is an e2e flake.
/bugzilla refresh
/hold cancel

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot openshift-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Feb 18, 2020
@sdodson
Copy link
Member

sdodson commented Feb 18, 2020

/refresh

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

1 similar comment
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@ashcrow
Copy link
Member

ashcrow commented Feb 18, 2020

 level=fatal msg="Bootstrap failed to complete: failed to wait for bootstrapping to complete: timed out waiting for the condition" 

@sdodson
Copy link
Member

sdodson commented Feb 18, 2020

@miabbott @ashcrow Can we get a bug for the reported problems?

@ashcrow
Copy link
Member

ashcrow commented Feb 18, 2020

failed to acquire lease: resources not found

@ashcrow
Copy link
Member

ashcrow commented Feb 18, 2020

@sdodson The original one or the large amount of CI issues? 😄 The former has a bug and we'll update the PR to denote it.

Updated 👍

@ashcrow ashcrow changed the title Revert "RHCOS: bump to 44.81.202002071430-0" Bug1803926: Revert "RHCOS: bump to 44.81.202002071430-0" Feb 18, 2020
@ashcrow
Copy link
Member

ashcrow commented Feb 18, 2020

level=info msg="Cluster operator kube-controller-manager Progressing is True with NodeInstaller: NodeInstallerProgressing: 2 nodes are at revision 0; 1 nodes are at revision 3; 0 nodes have achieved new revision 6"

@sdodson
Copy link
Member

sdodson commented Feb 18, 2020

/bugzilla refresh

@openshift-ci-robot
Copy link
Contributor

@sdodson: No Bugzilla bug is referenced in the title of this pull request.
To reference a bug, add 'Bug XXX:' to the title of this pull request and request another bug refresh with /bugzilla refresh.

In response to this:

/bugzilla refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@ashcrow
Copy link
Member

ashcrow commented Feb 18, 2020

/test e2e-ovirt
/test e2e-vsphere

@sdodson sdodson changed the title Bug1803926: Revert "RHCOS: bump to 44.81.202002071430-0" Bug 1803926: Revert "RHCOS: bump to 44.81.202002071430-0" Feb 18, 2020
@openshift-ci-robot openshift-ci-robot added the bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. label Feb 18, 2020
@openshift-ci-robot
Copy link
Contributor

@miabbott: This pull request references Bugzilla bug 1803926, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

In response to this:

Bug 1803926: Revert "RHCOS: bump to 44.81.202002071430-0"

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot
Copy link
Contributor

@miabbott: This pull request references Bugzilla bug 1803926, which is valid.

In response to this:

Bug 1803926: Revert "RHCOS: bump to 44.81.202002071430-0"

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@ashcrow
Copy link
Member

ashcrow commented Feb 18, 2020

Eh sorry @sdodson, forgot the space.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@miabbott
Copy link
Member Author

Err...I don't think we want to link this PR to that BZ. Merging this PR doesn't fix the root cause of that BZ, which we want to get to.

I'm going to create a separate BZ to indicate we need to unblock CI and appease the BZ gods.

@miabbott
Copy link
Member Author

/retitle Bug 1804284: Revert "RHCOS: bump to 44.81.202002071430-0"

@openshift-ci-robot openshift-ci-robot changed the title Bug 1803926: Revert "RHCOS: bump to 44.81.202002071430-0" Bug 1804284: Revert "RHCOS: bump to 44.81.202002071430-0" Feb 18, 2020
@openshift-ci-robot
Copy link
Contributor

@miabbott: This pull request references Bugzilla bug 1804284, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

In response to this:

Bug 1804284: Revert "RHCOS: bump to 44.81.202002071430-0"

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-merge-robot openshift-merge-robot merged commit 3a140b3 into openshift:master Feb 18, 2020
@openshift-ci-robot
Copy link
Contributor

@miabbott: All pull requests linked via external trackers have merged. Bugzilla bug 1804284 has been moved to the MODIFIED state.

In response to this:

Bug 1804284: Revert "RHCOS: bump to 44.81.202002071430-0"

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot
Copy link
Contributor

@miabbott: The following tests failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
ci/prow/e2e-vsphere 1951df7 link /test e2e-vsphere
ci/prow/e2e-libvirt 1951df7 link /test e2e-libvirt
ci/prow/e2e-aws-scaleup-rhel7 1951df7 link /test e2e-aws-scaleup-rhel7

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

8 participants