CI improvements #2080

moukoublen · 2024-04-02T10:37:06Z

Summary of your changes

~~Move opa ci to new ci workflow~~ Update: reverted to move to separate PR.
Upload cloudbeat logs on fail or debug
Delete docker images from artifact only on success run (but put retention-days to 2)

Screenshot/Data

Related Issues

Checklist

I have added tests that prove my fix is effective or that my feature works
I have added the necessary README/documentation (if appropriate)

Introducing a new rule?

Generate rule metadata using this script
Add relevant unit tests
Generate relevant rule templates using this script, and open a PR in elastic/packages/cloud_security_posture

mergify · 2024-04-02T10:37:52Z

This pull request does not have a backport label. Could you fix it @moukoublen? 🙏
To fixup this pull request, you need to add the backport labels for the needed
branches, such as:

backport-v./d./d./d is the label to automatically backport to the 8./d branch. /d is the digit
NOTE: backport-skip has been added to this pull request.

github-actions · 2024-04-02T11:04:41Z

📊 Allure Report - 💚 No failures were reported.

Result	Count
🟥 Failed	0
🟩 Passed	359
⬜ Skipped	33

oren-zohar · 2024-04-03T14:34:29Z

.github/actions/opa/action.yml

@@ -0,0 +1,39 @@
+name: 'OPA CI'


Why change workflow name?

Suggested change

name: 'OPA CI'

Test OPA Policies

This is mainly because it's not just testing; it involves fmt check, bundle building, test, opa check, and lint with Regal. So, I thought CI better describes the steps.

But I could change it to Test OPA Policies.

No that's what I thought, sounds good. I just want to avoid more workflows in our actions logs, which are already pretty packed, and to avoid confusion, but I agree with you

gurevichdmitry · 2024-04-08T11:09:14Z

.github/actions/aws-ci/action.yml

      name: Upload cloudbeat logs
      uses: actions/upload-artifact@v4
      with:
        name: cloubeat-logs-ci-aws
        path: logs/
+        if-no-files-found: warn
+        retention-days: 5


@moukoublen I'm not sure if we need to store debug logs for more than a day. Enabling debug mode in the workflow and generating logs is usually a proactive action, and typically, you'll use those logs on the same day.

Changed it to 1.

gurevichdmitry · 2024-04-08T11:14:50Z

.github/actions/aws-ci/action.yml

@@ -51,15 +51,19 @@ runs:
      run: poetry run pytest -k "aws" --alluredir=./allure/results/ --clean-alluredir

    - name: Upload test results
-      if: ${{ success() || failure() }}
+      if: ${{ always() }}


Several consecutive commits will cancel the workflow run, and in such cases, we don't need to upload test results. So, we can use !cancelled() instead of always().

It used to be like that (success() || failure() == !cancelled() ), and I specifically changed it because when we have missing results, then pytest retries, and it gets eventually canceled, so the job is marked as canceled.

In this case, we want the logs/results. That's why I changed it.

I'm not entirely clear on the scenario. Are you referring to situations where pytest is running and the results are incomplete when the job gets canceled? If the job is interrupted or canceled, I'm not convinced we need to retain the incomplete results.

Lets say that because of some bug in code or testcase, or cloud resource configuration, we end up with some expected results to be missing.

Pytest retries trying to find the expected rule with the expected status.

In that case, the job's timeout might be reached before the pytest ends with an exit code that indicates an error.

If this happens, we must upload the test results to identify which test case didn't pass.

I don't remember the specific CI that this happened I could search.

Perhaps we could better sync pytest timeout vs job timeout?

Not necessary to search for the specific run, I understand now. In this case, using always() will address the issue, but it might introduce noise when the job is canceled due to a new commit being submitted.
Okay, let's try using always() and see if it behaves as expected for our needs.

Yes, unfortunately, it introduces a bit of noise and a small delay on consecutive commits. 😞

Let's see if it becomes a bummer we can revert back to not canceled.

gurevichdmitry · 2024-04-08T11:21:45Z

.github/workflows/ci.yml

@@ -306,7 +328,7 @@ jobs:
    timeout-minutes: 60
    permissions:
      pull-requests: write
-    if: ${{ success() || failure() }}
+    if: ${{ always() }}


The same applies here; I believe using !cancelled() is better than always()

gurevichdmitry

Added some comments

moukoublen self-assigned this Apr 2, 2024

mergify bot added the backport-skip label Apr 2, 2024

moukoublen force-pushed the move_opa_ga_ci branch 2 times, most recently from 9fa9fdc to beeaa52 Compare April 3, 2024 09:06

moukoublen changed the title ~~Move opa ci to new ci workflow~~ CI improvements Apr 3, 2024

moukoublen force-pushed the move_opa_ga_ci branch 6 times, most recently from 97debbb to 3d624d1 Compare April 3, 2024 13:14

moukoublen marked this pull request as ready for review April 3, 2024 13:15

moukoublen requested a review from a team as a code owner April 3, 2024 13:15

gurevichdmitry approved these changes Apr 3, 2024

View reviewed changes

oren-zohar reviewed Apr 3, 2024

View reviewed changes

moukoublen force-pushed the move_opa_ga_ci branch 2 times, most recently from 6df73da to b1f64bf Compare April 8, 2024 10:35

gurevichdmitry reviewed Apr 8, 2024

View reviewed changes

gurevichdmitry approved these changes Apr 8, 2024

View reviewed changes

moukoublen added 2 commits April 8, 2024 15:39

Move opa ci to new ci workflow

9faeb93

change retention of logs to 1 day

8e44dd2

moukoublen force-pushed the move_opa_ga_ci branch from 5246c12 to 8e44dd2 Compare April 8, 2024 12:40

revert opa to extract to separate pr

2e15969

moukoublen merged commit 84229c2 into main Apr 8, 2024
23 checks passed

moukoublen deleted the move_opa_ga_ci branch April 8, 2024 14:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI improvements #2080

CI improvements #2080

moukoublen commented Apr 2, 2024 •

edited

Loading

mergify bot commented Apr 2, 2024

github-actions bot commented Apr 2, 2024 •

edited

Loading

oren-zohar Apr 3, 2024

moukoublen Apr 3, 2024

oren-zohar Apr 3, 2024

gurevichdmitry Apr 8, 2024

moukoublen Apr 8, 2024 •

edited

Loading

gurevichdmitry Apr 8, 2024

moukoublen Apr 8, 2024 •

edited

Loading

gurevichdmitry Apr 8, 2024

moukoublen Apr 8, 2024

gurevichdmitry Apr 8, 2024

moukoublen Apr 8, 2024

gurevichdmitry Apr 8, 2024

gurevichdmitry left a comment

CI improvements #2080

CI improvements #2080

Conversation

moukoublen commented Apr 2, 2024 • edited Loading

Summary of your changes

Screenshot/Data

Related Issues

Checklist

Introducing a new rule?

mergify bot commented Apr 2, 2024

github-actions bot commented Apr 2, 2024 • edited Loading

📊 Allure Report - 💚 No failures were reported.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

moukoublen Apr 8, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

moukoublen Apr 8, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gurevichdmitry left a comment

Choose a reason for hiding this comment

moukoublen commented Apr 2, 2024 •

edited

Loading

github-actions bot commented Apr 2, 2024 •

edited

Loading

moukoublen Apr 8, 2024 •

edited

Loading

moukoublen Apr 8, 2024 •

edited

Loading