connectors-ci: improve pytest result evaluation #28767

alafanechere · 2023-07-27T07:13:55Z

What

Original problem

When running pytest in a dagger container we face two challenges:

When a pytest suite is empty or all tests are skipped pytest will return a 5 exit code. exit code >0 in Dagger raise ExecError, but we want to mark this step as skipped and not fail.
When CAT (which is a pytest suite) is failing still need to perform after the facts operations to retrieve updated connector configurations from the container. When a container raises an ExecError we can't interact with it's filesystem without raising the same ExecError again.

How we overcame this problem so far

We installed a pytest plugin to always make pytest suite exit with code 0: pytest-custom_exit_code. We inferred success / failure / skipping of test by parsing the stdout log. This is not robust as the stdout content can change according to the pytest version or if a CAT run outputs unexpected stuff to stdout. This led to the false positive errors mentioned in #27425

The new approach to overcome this problem

We still make CAT exec artificially exit with a 0 exit code, but we store the original exit code to a file in the container filesystem: /exit_code. We get_step_result is run it will check if such a file exist on the container and return the in file exit code value instead of the pure container exit code. This allows us to capture the actual exit code that pytest returned.

How

The new approach mentioned above is mainly introduced via:

a new hack: hacks.never_fail_exec function:

airbyte/airbyte-ci/connectors/pipelines/pipelines/hacks.py

Line 145 in ddb7345

def never_fail_exec(command: List[str]) -> Callable:
updating the Step.get_step_result function to read in file exit code if /exit_code file is available on the container.

airbyte/airbyte-ci/connectors/pipelines/pipelines/bases.py

Line 245 in ddb7345

async def get_step_result(self, container: Container) -> StepResult:

I tried to thoroughly unit test these change to show how async test with a dagger client work:

airbyte/airbyte-ci/connectors/pipelines/tests/tests/test_common.py

Line 20 in ddb7345

class TestAcceptanceTests:

Additional refactor

Centralizing the AcceptanceTest logic

The logic to build the CAT container was in the environment.with_connector_acceptance_test function. I decided to move it to common.AcceptanceTest._build_connector_acceptance_test as environment functions are originally made to be reused in multiple steps. This function is very specific to AcceptanceTest.

Make all step write the `stdout` / `stderr` to a log file

(Relates to #28423)
We originally only wrote CAT output logs to a local directory. I figured that this logic can easily apply to all steps and would provide more debugging abilities to developers. Logs are written when the step results are built in get_step_results

airbyte/airbyte-ci/connectors/pipelines/pipelines/bases.py

Line 258 in ddb7345

await self.write_log_files(stdout, stderr)

Wrap acceptance test into its own dagger `Pipeline`.

(Relates to #28423)
I created Step specific dagger client in AcceptanceTest in order to showcase how we could make step execution wrapped into a dagger Pipeline: this will make the CAT dagger operations grouped in a explicit way in the dagger UI.

airbyte/airbyte-ci/connectors/pipelines/pipelines/bases.py

Line 112 in ddb7345

def dagger_client(self) -> Container:

source-pokeapi test report (commit `9ae813c9e4`) - ❌

⏲️ Total pipeline duration: 02mn32s

Step	Result
Validate airbyte-integrations/connectors/source-pokeapi/metadata.yaml	✅
Connector version semver check	✅
QA checks	✅
Code format checks	✅
Connector package install	✅
Build source-pokeapi docker image for platform linux/x86_64	✅
Unit tests	✅
Acceptance tests	❌

🔗 View the logs here

Please note that tests are only run on PR ready for review. Please set your PR to draft mode to not flood the CI engine and upstream service on following commits.
You can run the same pipeline locally on this branch with the airbyte-ci tool with the following command

airbyte-ci connectors --name=source-pokeapi test

octavia-squidington-iii · 2023-07-27T07:52:15Z

source-openweather test report (commit `9ae813c9e4`) - ✅

⏲️ Total pipeline duration: 03mn46s

Step	Result
Validate airbyte-integrations/connectors/source-openweather/metadata.yaml	✅
Connector version semver check	✅
QA checks	✅
Code format checks	✅
Connector package install	✅
Build source-openweather docker image for platform linux/x86_64	✅
Unit tests	✅
Acceptance tests	✅

🔗 View the logs here

Please note that tests are only run on PR ready for review. Please set your PR to draft mode to not flood the CI engine and upstream service on following commits.
You can run the same pipeline locally on this branch with the airbyte-ci tool with the following command

airbyte-ci connectors --name=source-openweather test

This reverts commit 9ae813c.

…result-evaluation

bnchrch

A few comments but the only one I think is a hard blocker is referencing context.connector in Step

bnchrch · 2023-07-28T01:31:47Z

.github/workflows/airbyte_ci_tests.yml

+on:
+  push:
+    branches:
+      - !master


are our tests now passing for pipelines? Did I miss that PR?!

Sorry, it was a WIP push from a different branch: #28857

bnchrch · 2023-07-28T01:32:14Z

airbyte-ci/connectors/pipelines/pipelines/bases.py

@@ -124,19 +106,23 @@ def run_duration(self) -> timedelta:
    @property
    def logger(self) -> logging.Logger:
        if self.should_log:
-            return self.context.logger
+            return logging.getLogger(f"{self.context.pipeline_name} - {self.title}")


bnchrch · 2023-07-28T01:39:27Z

airbyte-ci/connectors/pipelines/pipelines/bases.py

+            return []
+
+        written_log_files = []
+        log_directory = Path(f"{self.context.connector.code_directory}/airbyte_ci_logs/{slugify(self.context.pipeline_name)}")


Can we use connector here if this is our generic Step class?

bnchrch · 2023-07-28T01:57:19Z

airbyte-ci/connectors/pipelines/pipelines/bases.py

+
+        written_log_files = []
+        log_directory = Path(f"{self.context.connector.code_directory}/airbyte_ci_logs/{slugify(self.context.pipeline_name)}")
+        await log_directory.mkdir(exist_ok=True, parents=True)


Doing all this manual stderr/out handling and log writing makes me nervous.

Particularly now that they write to not just our ci folder but also the connector folder

My worries generally come down to If we keep writing log handling at a low level

We will have to keep repeating our logging code

More importantly we keep repeating if statments like if self.context.is_local and self.should_persist_stdout_stderr_logs: which if missed lead to tricky bugs

And if we go this route for too long, it will be very hard to go back as there will be alot of code to refactor.

This leads me to ask What is stopping us from moving all our log writing/handling to the dagger client boundary here:
https://github.com/airbytehq/airbyte/blob/master/airbyte-ci/connectors/pipelines/pipelines/pipelines/connectors.py#L85

For example a possible solution could be

Write our own logger that extends the typical python logger by adding pipeline specific context

pipeline name

pipeline step

etc.

Write our own subclass of TextIOWrapper that based on the new pipeline context we stuff into the logger

writes a log file per pipeline

writes a log file per step

writes a root log file representing everything

Decides if we are also writing to stdout/stderr

Decides if we need to send these logs anywhere else (e.g. sentry, datadog, cloudwatch etc)

If this is possible and is a good idea.

can we make a small step towards this now by

Moving the decision of whether or not we write to a log file the decision of a custom logger

Moving the writing logic to the logger

e.g. https://stackoverflow.com/questions/6386698/how-to-write-to-a-file-using-the-logging-python-module

@bnchrch let's keep the logging change for a different PR then.
But just to react on top of your comment:

My worries generally come down to If we keep writing log handling at a low level

This is not low level IMO because it's a change made at the Step class level, so we centralize all the logging logic on the base class.

Write our own logger that extends the typical python logger by adding pipeline specific context

This is what this change did: the step have a specific logger with pipeline and step name.

Write our own subclass of TextIOWrapper that based on the new pipeline context we stuff into the logger

In this context you mean parsing the dagger client logs. Splitting them into different per step/pipeline logs file will be very brittle as we have no control on the shape of the dagger logs, I'd rather ask the dagger team to provide such a feature and IMO it directly overlaps with the log access in the Dagger Web UI.
Moreover, if steps are cached their stdout are not shown in the logs (and replaced with CACHED).

can we make a small step towards this now by
Moving the decision of whether or not we write to a log file the decision of a custom logger
Moving the writing logic to the logger.

Yep you're right that using a combo of logging to stdout + write logic can definitely be handled at the logger level. Let's do it later and groom #28423 a bit more.
I think we should be clearer in differentiating what we mean by Dagger logs, steps logs, pipeline logs stp.

bnchrch · 2023-07-28T01:58:04Z

airbyte-ci/connectors/pipelines/pipelines/tests/common.py

-        return self.pytest_logs_to_step_result(logs)
+        return step_result
+
+    def get_cache_buster(self) -> str:


bnchrch · 2023-07-28T02:00:51Z

airbyte-ci/connectors/pipelines/pipelines/tests/common.py

+        """
+        return datetime.datetime.utcnow().strftime("%Y%m%d")
+
+    async def _build_connector_acceptance_test(self, connector_under_test_image_tar: File) -> Container:


well done function.

But Im starting to notice we have a lot of connector code, that isnt obvious based on the file structure.

For example this is in test/common

should we start making it more specific. e.g.:

test/connector_acceptance/common.py

actions/connectors/tests.py

The test/common module is originally made to receive connector testing step that are agnostic to the connector language. This is why AcceptanceTest belongs here.
The actions module was originally made to receive re-usable code. But I think I overused it and suffed action/environements with code that is not re-used in multiple context.

bnchrch

lgtm!

…result-evaluation

alafanechere added 2 commits July 27, 2023 09:12

connectors-ci: improve pytest result evaluation

ddb7345

DEMO - to revert

9ae813c

octavia-squidington-iii added area/connectors Connector related issues connectors/source/pokeapi connectors/source/openweather labels Jul 27, 2023

alafanechere requested a review from a team July 27, 2023 07:46

alafanechere mentioned this pull request Jul 27, 2023

Connector Test Failure reported as success on failure #27425

Closed

Revert "DEMO - to revert"

10a144f

This reverts commit 9ae813c.

alafanechere removed connectors/source/openweather area/connectors Connector related issues labels Jul 27, 2023

octavia-squidington-iii added the connectors/source/openweather label Jul 27, 2023

alafanechere added 2 commits July 27, 2023 15:49

Merge branch 'master' into augustin/connectors-ci/change-pytest-step-…

650793f

…result-evaluation

Merge branch 'master' into augustin/connectors-ci/change-pytest-step-…

e3eae0d

…result-evaluation

alafanechere enabled auto-merge (squash) July 27, 2023 17:09

alafanechere removed connectors/source/pokeapi connectors/source/openweather labels Jul 27, 2023

bnchrch reviewed Jul 28, 2023

View reviewed changes

alafanechere added 3 commits July 31, 2023 14:43

delete .github/workflows/airbyte_ci_tests.yml

27a9b69

revert log writing

01acbd8

bump version

b188ded

alafanechere requested a review from bnchrch July 31, 2023 13:22

bnchrch approved these changes Jul 31, 2023

View reviewed changes

alafanechere added 2 commits August 1, 2023 11:14

Merge branch 'master' into augustin/connectors-ci/change-pytest-step-…

7f7f5b0

…result-evaluation

bump version

e24454d

alafanechere merged commit 502134f into master Aug 1, 2023
16 checks passed

alafanechere deleted the augustin/connectors-ci/change-pytest-step-result-evaluation branch August 1, 2023 09:29

alafanechere mentioned this pull request Aug 1, 2023

connectors-ci: skipped status when metadata upload exits with 5 #28938

Merged

bnchrch pushed a commit that referenced this pull request Aug 3, 2023

connectors-ci: improve pytest result evaluation (#28767)

c7dce72

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

connectors-ci: improve pytest result evaluation #28767

connectors-ci: improve pytest result evaluation #28767

alafanechere commented Jul 27, 2023 •

edited

Loading

octavia-squidington-iii commented Jul 27, 2023

octavia-squidington-iii commented Jul 27, 2023

bnchrch left a comment

bnchrch Jul 28, 2023

alafanechere Jul 31, 2023

bnchrch Jul 28, 2023

bnchrch Jul 28, 2023

bnchrch Jul 28, 2023

alafanechere Jul 31, 2023

bnchrch Jul 28, 2023

bnchrch Jul 28, 2023

alafanechere Jul 31, 2023

bnchrch left a comment

connectors-ci: improve pytest result evaluation #28767

connectors-ci: improve pytest result evaluation #28767

Conversation

alafanechere commented Jul 27, 2023 • edited Loading

What

Original problem

How we overcame this problem so far

The new approach to overcome this problem

How

Additional refactor

Centralizing the AcceptanceTest logic

Make all step write the stdout / stderr to a log file

Wrap acceptance test into its own dagger Pipeline.

Recommended reading order

octavia-squidington-iii commented Jul 27, 2023

source-pokeapi test report (commit 9ae813c9e4) - ❌

octavia-squidington-iii commented Jul 27, 2023

source-openweather test report (commit 9ae813c9e4) - ✅

bnchrch left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bnchrch left a comment

Choose a reason for hiding this comment

alafanechere commented Jul 27, 2023 •

edited

Loading

Make all step write the `stdout` / `stderr` to a log file

Wrap acceptance test into its own dagger `Pipeline`.

source-pokeapi test report (commit `9ae813c9e4`) - ❌

source-openweather test report (commit `9ae813c9e4`) - ✅