e2e tests: Be resilient to temporary unavailability of k8s #28

JAORMX · 2019-11-27T17:03:21Z

The tests are flaky due to leader re-elections in etcd. The following
error is logged every once in a while:

 single_scan_test.go:109: rpc error: code = Unavailable desc = etcdserver: leader changed

So lets try to be more resilient in our tests.

The tests are flaky due to leader re-elections in etcd. The following error is logged every once in a while: single_scan_test.go:109: rpc error: code = Unavailable desc = etcdserver: leader changed So lets try to be more resilient in our tests.

A timeout might happen cause the cluster might be too busy. So we also need to be resilient to timeouts.

JAORMX · 2019-11-27T18:18:04Z

/test e2e-aws

etcd failed again... but this time it was on the cleanup which is in the operator-sdk code.

JAORMX · 2019-11-27T18:49:03Z

pushed operator-framework/operator-sdk#2277 to avoid issues like this in the future.

jhrozek · 2019-11-27T19:55:38Z

/lgtm

openshift-ci-robot · 2019-11-27T19:55:51Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: JAORMX, jhrozek

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [JAORMX,jhrozek]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci-robot · 2019-11-27T21:12:37Z

@JAORMX: The following test failed, say /retest to rerun them all:

Test name	Commit	Details	Rerun command
ci/prow/e2e-aws	`876a86c`	link	`/test e2e-aws`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

JAORMX mentioned this pull request Nov 27, 2019

Add Dockerfile specifically for CI #26

Merged

openshift-ci-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Nov 27, 2019

openshift-ci-robot requested review from jhrozek and mrogers950 November 27, 2019 17:04

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 27, 2019

e2e: Be resilient to timeouts on the the API calls

876a86c

A timeout might happen cause the cluster might be too busy. So we also need to be resilient to timeouts.

JAORMX mentioned this pull request Nov 27, 2019

Make wget call to get operator-sdk for non-verbose #27

Merged

jhrozek approved these changes Nov 27, 2019

View reviewed changes

openshift-ci-robot assigned jhrozek Nov 27, 2019

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Nov 27, 2019

JAORMX merged commit 596cfc9 into openshift:master Nov 27, 2019

JAORMX deleted the unavailable branch December 11, 2020 14:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

e2e tests: Be resilient to temporary unavailability of k8s #28

e2e tests: Be resilient to temporary unavailability of k8s #28

JAORMX commented Nov 27, 2019

JAORMX commented Nov 27, 2019

JAORMX commented Nov 27, 2019

jhrozek commented Nov 27, 2019

openshift-ci-robot commented Nov 27, 2019

openshift-ci-robot commented Nov 27, 2019

e2e tests: Be resilient to temporary unavailability of k8s #28

e2e tests: Be resilient to temporary unavailability of k8s #28

Conversation

JAORMX commented Nov 27, 2019

JAORMX commented Nov 27, 2019

JAORMX commented Nov 27, 2019

jhrozek commented Nov 27, 2019

openshift-ci-robot commented Nov 27, 2019

openshift-ci-robot commented Nov 27, 2019