[release-4.10] Fast-Forward from main #1233

alvaroaleman · 2022-04-05T13:31:53Z

What this PR does / why we need it:

Which issue(s) this PR fixes (optional, use fixes #<issue_number>(, fixes #<issue_number>, ...) format, where issue_number might be a GitHub issue, or a Jira story:
Fixes #

Checklist

Subject and description added to both, commit and PR.
Relevant issues have been referenced.
This change includes docs.
This change includes unit tests.

This is completely expected during deletion, logging it at error level makes it look like an issue, which it is not.

HO: Don't report NotFund for hostedcluster as error

The KAS needs the proxy settings to communicate with the cloud provider. However, the egress transport it uses wraps another transport that respects proxy settings which is why we need to excempt pod and service CIDR of the guest cluster to not break Konnektivity. I also tried to stop using the egress config and use the konnektivity-socks5-proxy, but that breaks SPDY connections (exec, port-forward). Ref https://issues.redhat.com/browse/HOSTEDCP-333

KAS: Set proxy, but exempt pod and service CIDR

add external-dns flags to CI install make target

sync MaxConcurrentReconciles across all controllers

enable external-dns registry

HostedCluster can optionally reference a configmap, in which case we copy the configmap to the HostedControlPlane namespace (similar to SSHKey and other fields).

When AdditionalTrustBundle is defined we create this ConfigMap to align with the behavior of regular OCP clusters and enable consumption of user-defined CA certs by the guest cluster.

When AdditionalTrustBundle is specified, we serialize the configmap and pass to the MCO bootstrap command via the default user-ca-bundle-config.yaml location - this means the MCO bootstrap will read the file when included, (the code already ignores the case where the file doesn't exist, since openshift/installer only conditionally creates the manifest)

This can be used to reference a ConfigMap that contains a user CA bundle.

The CPO and ignition server need the user CA so the registryclient can access a local registry with a self-signed cert

Adds a CLI option and corresponding volume to the operator pod, this is needed so the operator can look up release image metadata when the release image specified is locally mirrored. Note the mount path/filename were chosen to align with the expected defaults ref https://go.dev/src/crypto/x509/root_linux.go (and also current OCP docs for cert injection using operators)

In its current state, the hosted cluster config operator overwrites any changes made by the guest cluster admin to the registry configuration. This prevents changes like enabling a route or increasing the number of replicas. This commit limits what we change to things we need to change and leave everything else as is.

Currently, dump just drops a lot of files. This is useful for browsing them in the CI job output, but terrible for downloading them for local inspection, as downloading a lot of files is extremely slow, even if the files aren't big. This change makes us always create an archive of the dump to not require extending every CI job to do this manually.

The version we currently use can not compile anything and fails with errors like this: could not load export data: cannot import "math/bits" (unknown iexport format version 2), export data is newer version - update tool (compile) Note that this doesn't mean staticcheck supports generics, it just means it can be compiled with go 1.18.

Signed-off-by: David Vossel <davidvossel@gmail.com>

Registry configuration: reconcile only what we need to changes

…s-v1 Unique OpenShift vxlan port for KubeVirt Platform

Update staticcheck to a version that works with go 1.18

Dump: Always create an archive

This upgrades mkdocs/material to fix Netlify docs compilation breakages resulting from mkdocs/mkdocs#2799.

These components watch both management cluster (Machine scalable resources) and guest cluster. Originally we were pinning the images to a version that would cross any HostedCluster. This PR let us pick them from each particular payload resulting in some benefits: Each hostedCluster runs the component version that was tested with that particular kube/ocp version No additional work needed to productise the images as they com from the payload. Since CAPI CRDs should be backward compatible, having different controller versions shouldn't cause an issue. Once the CAPI image is in the payload we can do the same for it.

docs: Upgrade mkdocs/material to fix Netlify breakages

Signed-off-by: David Vossel <davidvossel@gmail.com>

…rom-payload read apiserver-network-proxy image from ocp payload

The single-hyphen flags do not work anymore due to operator-framework/operator-lifecycle-manager#2362

Fix CPO to work with 4.11

Signed-off-by: David Vossel <davidvossel@gmail.com>

Before this commit, EIP tagging failures resulting from the EIP not being found after the EIP was successfully created led to infra creation failing overall because the tagging operation was not retried. This commit adds retry logic to EIP tagging to account for the case when EIP creation succeeds but tagging fails because the AWS tagging API doesn't yet see the new EIP.

Retry EIP tagging failures during infra creation

…y-v1 AntiAffinity rules to spread KubeVirt VMs across mgmt nodes

Signed-off-by: David Vossel <davidvossel@gmail.com>

…ss-v1 Document KubeVirt Platform Ingress Setup

The olm cronjob had a prioryClass of openshift-user-critical which has a priority that is above all other controlplane components in the management cluster. Downgrade it to the standard hypershift-control-plane and add an e2e test that verifies that no pod has a priority higher than the etcd priority.

Get autoscaler/machine-approver images from the payload

Before this commit, calls to `WaitForConditionsOnHostedControlPlane()` could fail a test if an API lookup fails even though that lookup is recoverable and retried automatically. This made the test flaky. This commit fixes the code so that these retriable errors are logged but do not fail the test. This commit also moves a log message which was intended to emit during retries but was instead placed at the exit point.

…lane component

Before this commit, UWM was enabled by the e2e `setup` command, which was used in the past but is no longer used. The UWM stack is thus wasting resources on management clusters used for e2e runs. This commit removes UWM from the monitoring setup for e2e tests.

Hypershift operator: Give a priority that is higher than any controlplane component

e2e: Don't fail test on transient recoverable API lookup

Fix priority class for olm cronjob and verify priorityclasses in e2e

e2e: Don't enable user workload monitoring on management clusters

openshift-ci · 2022-04-05T13:31:56Z

@alvaroaleman: No Bugzilla bug is referenced in the title of this pull request.
To reference a bug, add 'Bug XXX:' to the title of this pull request and request another bug refresh with /bugzilla refresh.

In response to this:

[release-4.10] Fast-Forward from main

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-ci · 2022-04-05T13:34:03Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: alvaroaleman

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [alvaroaleman]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci · 2022-04-05T14:25:55Z

@alvaroaleman: all tests passed!

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

sjenning · 2022-04-06T01:56:33Z

/lgtm

sjenning and others added 30 commits March 17, 2022 14:44

add external-dns flags to CI install make target

325c5a1

HO: Don't report NotFund for hostedcluster as error

87bfc95

This is completely expected during deletion, logging it at error level makes it look like an issue, which it is not.

enable external-dns registry

32fb191

Merge pull request openshift#1192 from alvaroaleman/fix-logging

3bb8c53

HO: Don't report NotFund for hostedcluster as error

increase MaxConcurrentReconciles on AWS PrivateLink controllers

3f84cd0

Merge pull request openshift#1200 from alvaroaleman/fix

e506d36

KAS: Set proxy, but exempt pod and service CIDR

Merge pull request openshift#1163 from sjenning/dns-e2e

f3e420e

add external-dns flags to CI install make target

Merge pull request openshift#1199 from sjenning/sync-worker-count

82dcf21

sync MaxConcurrentReconciles across all controllers

Merge pull request openshift#1198 from sjenning/dns-enable-registry

339bea9

enable external-dns registry

Add additionalTrustBundle to HostedCluster API

09dfcbb

Copy HostedCluster additionalTrustBundle to HostedControlPlane

40d4dbc

HostedCluster can optionally reference a configmap, in which case we copy the configmap to the HostedControlPlane namespace (similar to SSHKey and other fields).

Create HostedCluster user-ca-bundle configmap

12773d4

When AdditionalTrustBundle is defined we create this ConfigMap to align with the behavior of regular OCP clusters and enable consumption of user-defined CA certs by the guest cluster.

Add create cli option for additional-trust-bundle

0dd4049

This can be used to reference a ConfigMap that contains a user CA bundle.

Add trust bundle volumes to hostedcluster_controller

cdfc64b

The CPO and ignition server need the user CA so the registryclient can access a local registry with a self-signed cert

Add trust bundle to hosted-cluster-config-operator

5626d38

Add default vxlan port for kubevirt clusters

35f8381

Signed-off-by: David Vossel <davidvossel@gmail.com>

Merge pull request openshift#1202 from csrwng/fix-registry-config

e43d7b4

Registry configuration: reconcile only what we need to changes

Merge pull request openshift#1206 from davidvossel/kubevirt-vxlanport…

7b735b8

…s-v1 Unique OpenShift vxlan port for KubeVirt Platform

Merge pull request openshift#1207 from alvaroaleman/upgrade-staticheck

276f01c

Update staticcheck to a version that works with go 1.18

Merge pull request openshift#1204 from alvaroaleman/dump-archive

1b39da1

Dump: Always create an archive

docs: Upgrade mkdocs/material to fix Netlify breakages

7f4d236

This upgrades mkdocs/material to fix Netlify docs compilation breakages resulting from mkdocs/mkdocs#2799.

Merge pull request openshift#1212 from ironcladlou/docs-build-fix

a4fdbc5

docs: Upgrade mkdocs/material to fix Netlify breakages

docs for DNS indirection

799822b

davidvossel and others added 20 commits March 28, 2022 16:59

Update to referencing 4.10 disks and documentation for KV guide

da5972c

Signed-off-by: David Vossel <davidvossel@gmail.com>

Merge pull request openshift#1215 from Basavaraju-G/apiserver-image-f…

cd441b8

…rom-payload read apiserver-network-proxy image from ocp payload

Fix CPO to work with 4.11

2810e6e

The single-hyphen flags do not work anymore due to operator-framework/operator-lifecycle-manager#2362

Merge pull request openshift#1217 from alvaroaleman/fix-4.11

f36feb4

Fix CPO to work with 4.11

default AntiAffinity rules to spread KubeVirt VMs across mgmt nodes

d505b9e

Signed-off-by: David Vossel <davidvossel@gmail.com>

Merge pull request openshift#1219 from ironcladlou/tagging-retries

9706cfc

Retry EIP tagging failures during infra creation

Merge pull request openshift#1218 from davidvossel/vm-default-affinit…

3a35046

…y-v1 AntiAffinity rules to spread KubeVirt VMs across mgmt nodes

Document KubeVirt Platform Ingress/DNS options

2d3945f

Signed-off-by: David Vossel <davidvossel@gmail.com>

Merge pull request openshift#1213 from davidvossel/doc-kubevirt-ingre…

5598e5c

…ss-v1 Document KubeVirt Platform Ingress Setup

Merge pull request openshift#1090 from enxebre/images

337698c

Get autoscaler/machine-approver images from the payload

Hypershift operator: Give a priority that is higher than any controlp…

c49ee9e

…lane component

Merge pull request openshift#1229 from alvaroaleman/prio

9ed6c33

Hypershift operator: Give a priority that is higher than any controlplane component

Merge pull request openshift#1230 from ironcladlou/none-flake-fix

ba6b9c2

e2e: Don't fail test on transient recoverable API lookup

Merge pull request openshift#1226 from alvaroaleman/priority-class

ba2ce8d

Fix priority class for olm cronjob and verify priorityclasses in e2e

Merge pull request openshift#1231 from ironcladlou/disable-uwm

c729c42

e2e: Don't enable user workload monitoring on management clusters

Merge branch 'main' into ff4

f63e3b7

openshift-ci bot requested review from enxebre and sjenning April 5, 2022 13:33

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Apr 5, 2022

openshift-ci bot assigned sjenning Apr 6, 2022

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Apr 6, 2022

openshift-merge-robot merged commit c6ce37a into openshift:release-4.10 Apr 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[release-4.10] Fast-Forward from main #1233

[release-4.10] Fast-Forward from main #1233

alvaroaleman commented Apr 5, 2022

openshift-ci bot commented Apr 5, 2022

openshift-ci bot commented Apr 5, 2022

openshift-ci bot commented Apr 5, 2022

sjenning commented Apr 6, 2022

[release-4.10] Fast-Forward from main #1233

[release-4.10] Fast-Forward from main #1233

Conversation

alvaroaleman commented Apr 5, 2022

openshift-ci bot commented Apr 5, 2022

openshift-ci bot commented Apr 5, 2022

openshift-ci bot commented Apr 5, 2022

sjenning commented Apr 6, 2022