e2e: test fixes and bug fixes from test runs #3716

phantomjinx · 2022-10-04T09:35:01Z

Release Note

NONE

* Uses yq to insert the test bundle into existing channels as well as appending bundle into any new channels * Better replicates what the bundle will actually do to the channels in the bundle index

* Bump the operator-sdk version to 1.16 as being used to perform the generation * Updates the samples to the correct syntax

essobedo

Very nice 👍

e2e/global/common/kamelet_test.go

.github/actions/e2e-builder/exec-tests.sh

tadayosi

Some small feedback

tadayosi · 2022-10-05T03:24:58Z

e2e/namespace/install/cli/run_test.go

+	Eventually(PlatformPhase(ns), TestTimeoutLong).Should(Equal(v1.IntegrationPlatformPhaseReady))
+}
+
+func TestKamelCLIRunGitHubExampleJava(t *testing.T) {


Why do we need to decompose the one TestKamelCLIRun() into smaller test funcs? Is it really necessary? Otherwise we'd keep one test func with multiple t.Run() style.

To me it looks a bit overkill to just fix one problematic test case "Run with http dependency".

In the same way as TestRunConfig, the single test with run functions requires the clearing of the namespace at the end of each run - avoiding test contamination. The test executions seems to be less reilable in these circumstances since the delete might not clear everything before the next run is started. Since the WithNewNamespace has its own delete mechanism (deleting the whole namespace) and each test takes a fresh namespace the danger of test contamination is reduced and reliability improved.

I am not convinced yet. From the point of view of test independence, it is obviously good to have separate test functions, but this creates a namespace each time, which then causes time efficiency problems due to the increased number of operator installations and the unavailability of downloaded jar caches and built kits in a new namespace. And we are already suffering from the long execution time of E2E tests. As a result of that trade-off, we've been trying to keep the tests together in E2E as much as possible (refer to the discussion #3298 if you are not aware of it). It is not consistent with the rest if we separate this test only here.

In the same way as TestRunConfig, the single test with run functions requires the clearing of the namespace at the end of each run - avoiding test contamination.

In the light of the above discussion, I don't see it particularly as a problem. And for this particular TestKamelCLIRun test, forcing to use a unique name for each test run would look like better improvement to avoid contamination for me.

If we had to separate the tests to solve the instability issue, then that would be a clear reason to separate them, but as far as I remember, this test hasn't been particularly unstable except this new one "Run with http dependency". Why can't we just fix the problematic test instead? Or has the TestKamelCLIRun test been flaky on OCP?

Looking at the test times between 2 test runs, I can see the time differential for both TestKamelCLIRun and TestRunConfig is large. That being the case, converting the tests is not an ideal solution, despite improvements in reliability. So, I'll modify this back and relook at TestRunConfig (think the kamel delete.... might have to be improved to remove some test contamination).

e2e/support/test_support.go

e2e/support/util/dump.go

* Extends the timeout for testing to 90m since if in debug mode, the extra logging increases the testing time and takes slightly longer than 60m * Adds in extra debug statements for logging when running in debug mode * Exposes the LOG_LEVEL parameter to e2e testsuite so tests can be switched to debug mode * Prints the call stack for kamel binary if running test in debug mode * Fixes the StructuredLogs test by better detection of invalid operator log entries by the test * Provide functions to heal a crashed catalogsource pod by waiting for its pull-secret to become available and then deleting the pod and allowing the source to reprovision a new one * Fixes uninstall test by recognising roles are not deleted when install was by the OLM * Fixes install test by checking the LOG_LEVEL env var rather than string to read the operator log which can be truncated * Fixes to tests by extending timeouts for integration pods coming up * Removes some cleanups from individual tests as these should be taken care of by the cleanup of the namespace * Better logging when building the bundle and bundle index images # Conflicts: # script/Makefile

* See issue apache#3667 for details * Can re-enable once solution has been concluded

* Separate test tasks into separate tests in different namespaces to avoid any contamination

* If test namespace exists then create a different one with extra suffix

* If operator is not uninstalled successfully then provide a warning rather than throw an error as we want to try to keep the tests going

* test_support.go * When installing for the tests, handle CAMEL_K_TEST_MAVEN_CLI_OPTIONS to inject maven-cli options specified by the tests * If e2e tests have a log-level of debug then set the maven-cli-options to "-X" to retrieve debugging from maven builds * Renames CAMEL_K_LOG_LEVEL with TEST prefix

* The default timeout for download timeouts was so quick as to make the test prone to failure. Increasing this timeout allows for more reliable tests.

* Rather than trying to delete resources at the end of each sub-test, it is simpler and more reliable to generate a new namespace for each and have it deleted * The sampleJar URL is changed for the http dependency tests to avoid the request have to do a redirect. This improves the reliability of its retrieval * Sets the http dependency tests to problematic since on OCP4, the repositories are not being detected by the maven build causing test failures. See apache#3708.

* The use of UpdateScale is unreliable in producing the well-known error "the object has been modified; please apply your changes to the latest version and try again". * Replacing UpdateScale with PatchScale avoids this error and allows the test to continue successfully

* While checking for the catalogue source pod, the function needs to assume that the pod may not yet exist.

tadayosi

One outstanding discussion, but otherwise it looks good to me.

phantomjinx · 2022-10-06T09:26:18Z

@tadayosi If you are happy with the explanations and changes then could you, please, merge this?

phantomjinx added 3 commits October 4, 2022 10:17

fix(bundle-index-gen): Improves generation of bundle index

547430d

* Uses yq to insert the test bundle into existing channels as well as appending bundle into any new channels * Better replicates what the bundle will actually do to the channels in the bundle index

fix(platform) Incorrect order of parameters

fb61362

fix(bundle): Small fixes for bundle generation

4bcb5ce

* Bump the operator-sdk version to 1.16 as being used to perform the generation * Updates the samples to the correct syntax

johnpoth approved these changes Oct 4, 2022

View reviewed changes

essobedo reviewed Oct 4, 2022

View reviewed changes

e2e/global/common/kamelet_test.go Outdated Show resolved Hide resolved

.github/actions/e2e-builder/exec-tests.sh Outdated Show resolved Hide resolved

phantomjinx force-pushed the main branch 2 times, most recently from 58e5d98 to af841d8 Compare October 4, 2022 12:42

essobedo approved these changes Oct 4, 2022

View reviewed changes

tadayosi requested changes Oct 5, 2022

View reviewed changes

phantomjinx force-pushed the main branch from af841d8 to caa77a8 Compare October 5, 2022 09:31

phantomjinx added 10 commits October 5, 2022 10:42

(e2e): Mark KameletClasspathLoading test as temporarily problematic

afec3b0

* See issue apache#3667 for details * Can re-enable once solution has been concluded

(e2e): Refactor config test

d7bf1a5

* Separate test tasks into separate tests in different namespaces to avoid any contamination

(e2e): check if namespace already exists

e3d8197

* If test namespace exists then create a different one with extra suffix

(e2e): Modify error of operator uninstall to warning

70b50ef

* If operator is not uninstalled successfully then provide a warning rather than throw an error as we want to try to keep the tests going

(e2e): Increases the download dependency timeout

5e467df

* The default timeout for download timeouts was so quick as to make the test prone to failure. Increasing this timeout allows for more reliable tests.

fix(e2e): Stop go panic if pod or pod status is not initialised

9ee6e08

* While checking for the catalogue source pod, the function needs to assume that the pod may not yet exist.

phantomjinx force-pushed the main branch from caa77a8 to 9ee6e08 Compare October 5, 2022 09:45

phantomjinx requested a review from tadayosi October 5, 2022 12:22

essobedo mentioned this pull request Oct 5, 2022

The --add-repo switch fails with a global operator #3667

Closed

oscerd approved these changes Oct 6, 2022

View reviewed changes

tadayosi reviewed Oct 6, 2022

View reviewed changes

tadayosi merged commit d82ae43 into apache:main Oct 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

e2e: test fixes and bug fixes from test runs #3716

e2e: test fixes and bug fixes from test runs #3716

phantomjinx commented Oct 4, 2022

essobedo left a comment

tadayosi left a comment

tadayosi Oct 5, 2022

phantomjinx Oct 5, 2022

tadayosi Oct 6, 2022 •

edited

Loading

phantomjinx Oct 6, 2022

tadayosi left a comment

phantomjinx commented Oct 6, 2022

e2e: test fixes and bug fixes from test runs #3716

e2e: test fixes and bug fixes from test runs #3716

Conversation

phantomjinx commented Oct 4, 2022

essobedo left a comment

Choose a reason for hiding this comment

tadayosi left a comment

Choose a reason for hiding this comment

tadayosi Oct 5, 2022

Choose a reason for hiding this comment

phantomjinx Oct 5, 2022

Choose a reason for hiding this comment

tadayosi Oct 6, 2022 • edited Loading

Choose a reason for hiding this comment

phantomjinx Oct 6, 2022

Choose a reason for hiding this comment

tadayosi left a comment

Choose a reason for hiding this comment

phantomjinx commented Oct 6, 2022

tadayosi Oct 6, 2022 •

edited

Loading