Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

*: conditionally populate the images configmap with cluster-etcd-operator image #1288

Merged
merged 4 commits into from Jan 22, 2020

Conversation

alaypatel07
Copy link
Contributor

@alaypatel07 alaypatel07 commented Nov 24, 2019

This adds a flag to MCO for cluster-etcd-operator image and hardcodes
the value to "".

This PR conditionally populates the cluster-etcd-operator image only
if the etcd custom resource exists and if it has a spec field set as
managementState: Managed. It is the same mechanism used
in the installer. This will make sure that the MC rendered during the initial.
boot with the value coming in from the installer flag is always same
as one generate in the sync loop.

With this, we can use a single knob in cluster-etcd-operator
project to test the bootstrap process in CI with cluster-etcd-operator
enabled.

The existence of cluster-etcd-operator image is used to render static
etcd pod yaml. This PR will be followed by another PR which introduces
a boolean flag to be used for conditional rendering of the static pod image.
It will be a more readable way of achieving the same means.

@openshift-ci-robot openshift-ci-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Nov 24, 2019
@alaypatel07 alaypatel07 changed the title *: install *: enable cluster-etcd-operator Nov 24, 2019
@alaypatel07
Copy link
Contributor Author

/hold testing this manually

@openshift-ci-robot openshift-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 24, 2019
@kikisdeliveryservice
Copy link
Contributor

/skip

@hexfusion
Copy link
Contributor

/retest

@kikisdeliveryservice
Copy link
Contributor

at a quick glance this seems to be failing on masters only so prob the pr? seems etcd server is taking too long: https://storage.googleapis.com/origin-ci-test/pr-logs/pull/openshift_machine-config-operator/1288/pull-ci-openshift-machine-config-operator-master-e2e-aws/5908/artifacts/e2e-aws/pods/openshift-etcd_etcd-member-ip-10-0-137-114.ec2.internal_etcd-member.log

@openshift-ci-robot openshift-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Dec 7, 2019
@alaypatel07 alaypatel07 changed the title *: enable cluster-etcd-operator [WIP]: *: enable cluster-etcd-operator Dec 7, 2019
@openshift-ci-robot openshift-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Dec 7, 2019
@alaypatel07 alaypatel07 changed the title [WIP]: *: enable cluster-etcd-operator *: enable cluster-etcd-operator Dec 7, 2019
@openshift-ci-robot openshift-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Dec 7, 2019
@openshift-ci-robot openshift-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Dec 7, 2019
@alaypatel07 alaypatel07 force-pushed the enable_ceo branch 2 times, most recently from da8bbf6 to 8d9a25c Compare December 7, 2019 16:02
@openshift-ci-robot openshift-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Dec 7, 2019
@alaypatel07
Copy link
Contributor Author

clusterrolebinding.rbac.authorization.k8s.io/system:openshift:controller:resourcequota-controller
E1207 16:34:33.205040       1 watcher.go:214] watch chan error: etcdserver: mvcc: required revision has been compacted
E1207 16:36:17.108350       1 watcher.go:214] watch chan error: etcdserver: mvcc: required revision has been compacted
E1207 16:36:36.023379       1 watcher.go:214] watch chan error: etcdserver: mvcc: required revision has been compacted

Etcd was up but openshift-apiserver had the above error messages, compacted data store, trying again

/test e2e-aws

@openshift-ci-robot openshift-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Dec 9, 2019
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-ci-robot openshift-ci-robot removed the lgtm Indicates that a PR is ready to be merged. label Jan 13, 2020
@hexfusion
Copy link
Contributor

hexfusion commented Jan 13, 2020

level=error msg="Error: Error waiting for AMI (ami-0123e5f5f322fe54f) to be ready: timeout while waiting for state to become 'available' (last state: 'pending', timeout: 40m0s)"

/retest

@hexfusion
Copy link
Contributor

level=fatal msg="failed to fetch Cluster: failed to generate asset "Cluster": failed to create cluster: failed to apply using Terraform"

/test e2e-aws-scaleup-rhel7

@alaypatel07
Copy link
Contributor Author

digging into the error e2e-gcp-op

/hold
/wip

@openshift-ci-robot openshift-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 13, 2020
@alaypatel07
Copy link
Contributor Author

/test e2e-gcp-op

@hexfusion
Copy link
Contributor

hexfusion commented Jan 18, 2020

level=error msg="Error: Error applying IAM policy for project "openshift-gce-devel-ci": Error setting IAM policy for project "openshift-gce-devel-ci": googleapi: Error 400: The number of members in the policy (1,501) is larger than the maximum allowed size 1,500., badRequest"

/retest

EtcdInformer is only used by the MCO pod to reconcile image names.
This pulls out the logic of creating etcd informer from the generic
controller context to MCO pod start method.
@hexfusion
Copy link
Contributor

@hexfusion
Copy link
Contributor

/joke

@openshift-ci-robot
Copy link
Contributor

@hexfusion: A red and a blue ship have just collided in the Caribbean. Apparently the survivors are marooned.

In response to this:

/joke

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@hexfusion
Copy link
Contributor

/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Jan 22, 2020
@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: alaypatel07, hexfusion, kikisdeliveryservice

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:
  • OWNERS [kikisdeliveryservice]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@hexfusion
Copy link
Contributor

/hold cancel

@openshift-ci-robot openshift-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 22, 2020
@hexfusion
Copy link
Contributor

--- FAIL: TestKernelArguments (448.05s)
mcd_test.go:259: Created kargs-1ac503cd-b15f-43e7-9316-403fe39db164
mcd_test.go:115: Pool worker has rendered config kargs-1ac503cd-b15f-43e7-9316-403fe39db164 with rendered-worker-98c738161d53257c99ec59c65bcaa218 (waited 4.007773128s)
mcd_test.go:263: pool worker didn't report updated to rendered-worker-98c738161d53257c99ec59c65bcaa218: Get https://api.ci-op-7jwmbk77-1354f.origin-ci-int-gce.dev.openshift.com:6443/apis/machineconfiguration.openshift.io/v1/machineconfigpools/worker: dial tcp 35.185.78.239:6443: connect: connection refused

New, I have not see this... last round passed

/test e2e-gcp-op

@openshift-ci-robot
Copy link
Contributor

openshift-ci-robot commented Jan 22, 2020

@alaypatel07: The following test failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
ci/prow/e2e-vsphere 6f850a18762ee5a1c6a4d1d709fc47938faec5b8 link /test e2e-vsphere

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-merge-robot openshift-merge-robot merged commit d7e308d into openshift:master Jan 22, 2020
@vrutkovs
Copy link
Member

/cherrypick fcos

@openshift-cherrypick-robot

@vrutkovs: new pull request created: #1413

In response to this:

/cherrypick fcos

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet