Enforcing correct CI workers for benchmarks #2693

eyalkoren · 2022-07-04T06:15:32Z

Context: benchmarks script failing on new CI processor - https://apm-ci.elastic.co/blue/organizations/jenkins/apm-agent-java%2Fapm-agent-java-mbp/detail/main/285/pipeline:

CPU_MODEL='12th Gen Intel(R) Core(TM) i5-12500 '
...
Cannot determine base frequency for CPU model [12th Gen Intel(R) Core(TM) i5-12500 ]. Please adjust the build script.

Based on Intel's processor's spec.

@kuisathaverat Do you think a ci:benchmarks label to tell CI to run benchmarks for PRs would be useful?

eyalkoren · 2022-07-04T06:20:29Z

scripts/jenkins/run-benchmarks.sh

        BASE_FREQ="1.9GHz"
+    elif [ "${CPU_MODEL}" == "12th Gen Intel(R) Core(TM) i5-12500 " ]
+    then
+        CORE_INDEX=11


Based on the other types, it seems to count the effective threads rather than physical cores, but I am not sure

eyalkoren · 2022-07-04T06:21:31Z

run benchmark tests

ghost · 2022-07-04T06:25:06Z

💚 Build Succeeded

the below badges are clickable and redirect to their specific view in the CI or DOCS

Expand to view the summary

Build stats

Start Time: 2022-07-05T18:39:44.993+0000
Duration: 50 min 6 sec

Test stats 🧪

Test	Results
Failed	0
Passed	3052
Skipped	36
Total	3088

💚 Flaky test report

Tests succeeded.

🤖 GitHub comments

To re-run your PR in the CI, just comment with:

/test : Re-trigger the build.
run benchmark tests : Run the benchmark tests.
run jdk compatibility tests : Run the JDK Compatibility tests.
run integration tests : Run the Agent Integration tests.
run end-to-end tests : Run the APM-ITs.
run windows tests : Build & tests on windows.
run elasticsearch-ci/docs : Re-trigger the docs validation. (use unformatted text in the comment!)

felixbarny · 2022-07-04T06:30:06Z

Why are the benchmarks suddenly executed on a new hardware? This would make any comparisons with past results meaningless.

eyalkoren · 2022-07-04T06:37:57Z

Why are the benchmarks suddenly executed on a new hardware? This would make any comparisons with past results meaningless.

Does this mean we cannot upgrade? Or is it enough to manually compare now and start accumulating new history?

felixbarny · 2022-07-04T06:40:51Z

I suppose it doesn't mean that we can never upgrade but we should carefully review when doing so. For example, is the new worker a dedicated bare metal machine like the old one or is it a cloud VM that may have noisy neighbours? Which background processes are running on the new worker that might interfere with the benchmark?

felixbarny · 2022-07-05T06:58:54Z

@cachedout do you know why the benchmarks aren't executed on the dedicated bare metal machines anymore?

kuisathaverat · 2022-07-05T10:02:43Z

Do you think a ci:benchmarks label to tell CI to run benchmarks for PRs would be useful?

I am not a fan of using labels to trigger CI jobs, I preferred it to be intentional using a GitHub comment because these additional jobs usually do not need to run on every commit, and if you do it you will waisting CI time.

cc @v1v

v1v · 2022-07-05T11:03:30Z

Let me look at what's going on

v1v · 2022-07-05T11:22:08Z

Do you think a ci:benchmarks label to tell CI to run benchmarks for PRs would be useful?

It's not supported,

apm-agent-java/Jenkinsfile

Lines 57 to 58 in 32ee207

    
           // disabled by default, not required for merge 
        
           booleanParam(name: 'bench_ci', defaultValue: false, description: 'Enable benchmarks')

You can enable it by using something like:

                expression { matchesPrLabel(label: 'ci:benchmarks') }

in

apm-agent-java/Jenkinsfile

Lines 300 to 304 in 32ee207

    
           anyOf { 
        
             branch 'main' 
        
             expression { return env.GITHUB_COMMENT?.contains('benchmark tests') } 
        
             expression { return params.bench_ci } 
        
           }

do you know why the benchmarks aren't executed on the dedicated bare metal machines anymore?

It uses metal and, recently there are a bunch of new CI workers for being used by the prodfiler.

Unfortunately, the metal label was added in the CI provisioner, and the CI Jenkinsifle for this project should use something else to be sure it uses the dedicated ones.

What baremetals can you use?

linux&&metal
benchmarks <--- that matches one worker -> worker-1213919 ~~therefore I wonder if this is the one to be used instead~~ and it's consumed by the load-testing as per https://github.com/elastic/apm-agent-java/pull/1467/files#diff-7b74176d56dcf2d85162c7e15a233c3115eca7999fc6744a9b3e59bda5a2861cR65

This reverts commit 6059312.

v1v · 2022-07-05T11:33:29Z

@eyalkoren , I just added some changes to this PR so:

I reverted the change for supporting a new CPU
Fixed the agent labels to filter only those workers that were used in the past.
Supported ci:benchmarks tag
Delete the workspace in the baremetals

Let's see how it goes, I didn't want to change the description of this PR, feel free to do it so 🙇

cachedout · 2022-07-05T17:25:44Z

It uses metal and, recently there are a bunch of new CI workers for being used by the prodfiler.

raises hand This is my fault. I requested workers for the prodfiler benchmarking project and had them labeled with a new label but missed the fact that they'd also be put in an existing pool. Apologies.

Adjusting benchmearks script to discover new core type

6059312

eyalkoren requested a review from kuisathaverat July 4, 2022 06:15

github-actions bot added the agent-java label Jul 4, 2022

eyalkoren commented Jul 4, 2022

View reviewed changes

trentm mentioned this pull request Jul 4, 2022

test: fix setup of benchmarks in CI on new CPU models elastic/apm-agent-nodejs#2804

Merged

v1v added 4 commits July 5, 2022 12:29

ci: use the right metal workers

a4fca73

ci: add ci:benchmarks tag support

c4d6d39

ci: cleanup leftovers

047f06c

Revert "Adjusting benchmearks script to discover new core type"

945a2b7

This reverts commit 6059312.

eyalkoren changed the title ~~Adjusting benchmarks script to discover new processor type~~ Enforcing correct CI workers for benchmarks Jul 5, 2022

v1v added the ci:benchmarks Enable the benchmarks label Jul 5, 2022

jackshirazi approved these changes Jul 6, 2022

View reviewed changes

jackshirazi merged commit 97b8aa1 into elastic:main Jul 6, 2022

Enforcing correct CI workers for benchmarks #2693

Enforcing correct CI workers for benchmarks #2693

Uh oh!

Conversation

eyalkoren commented Jul 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eyalkoren Jul 4, 2022

Choose a reason for hiding this comment

Uh oh!

eyalkoren commented Jul 4, 2022

Uh oh!

ghost commented Jul 4, 2022 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💚 Build Succeeded

Build stats

Test stats 🧪

💚 Flaky test report

🤖 GitHub comments

Uh oh!

felixbarny commented Jul 4, 2022

Uh oh!

eyalkoren commented Jul 4, 2022

Uh oh!

felixbarny commented Jul 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

felixbarny commented Jul 5, 2022

Uh oh!

kuisathaverat commented Jul 5, 2022

Uh oh!

v1v commented Jul 5, 2022

Uh oh!

v1v commented Jul 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

v1v commented Jul 5, 2022

Uh oh!

cachedout commented Jul 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

eyalkoren commented Jul 4, 2022 •

edited

Loading

ghost commented Jul 4, 2022 •

edited by ghost

Loading

felixbarny commented Jul 4, 2022 •

edited

Loading

v1v commented Jul 5, 2022 •

edited

Loading

cachedout commented Jul 5, 2022 •

edited

Loading