Setup parca-agent for ci-runs #187

SimonRichardson · 2024-05-22T09:17:21Z

Start collecting profiling data for ci-runs (specifically looking at longer gating test runs). We want the data for 2 reasons:

See if there are any potential bottlenecks in terms of performance that we can see when the jujud and jujud-controller is running.
We can use profile guided optimisations (PGO) from the runs to apply to the binaries.

This currently only applies to lxd deployments. For other providers we will need to install it via cloud-init or bake it into an image.

This is an experiment to see if it's useful or not.

SimonRichardson · 2024-05-22T09:18:47Z

jobs/ci-run/integration/common/test-runner.sh

+    # Hide the setting of the bearer token to the parca agent.
+    set +x
+    {
+        sudo snap set parca-agent remote-store-bearer-token="${{PARCA_BEARER_TOKEN}}"


The $PARCA_BEARER_TOKEN is setup in the credentials.

Do you need to set up an || true in case it fails? I also don't know if this sort of thing ends up in bash history, etc.

nvinuesa · 2024-05-22T10:04:20Z

jobs/ci-run/integration/common/test-runner.sh

+    fi
+
+    sudo snap install parca-agent --classic
+    sudo snap set parca-agent metadata-external-labels="machine=ci-run-${{BOOTSTRAP_PROVIDER}}"


Just a thought, we could add some labels like TEST_RUNNER_NAME to make it easier to correlate to issues we eventually find?

Maybe, I think we just care about the provider/substrate rather what test is running. Otherwise we might start gaming for a specific test and not juju as a whole.

AIUI, this actually sets it up on the host whenever we have an lxd run, is that correct? So this isn't setting up parca-agent in lxd, it is setting up parca-agent on hosts that run lxd.
I have a slight concern about side effects if you are trying to run the CI suite locally. In general, we might want to have a CI step for Jenkins ephemeral bots to install and configure parca-agent, but I'm kind of against doing it as a side effect of running a test.

nvinuesa

LGTM thanks!

Start collecting profiling data for ci-runs (specifically looking at longer gating test runs). We want the data for 2 reasons: 1. See if there are any potential bottlenecks in terms of performance that we can see when the jujud and jujud-controller is running. 2. We can use profile guided optimisations (PGO) from the runs to apply to the binaries. This currently only applies to lxd deployments. For other providers we will need to install it via cloud-init or bake it into an image. This is an experiment to see if it's useful or not.

manadart

I think this is going to be of limited use.

Test scenarios won't give enough quality signal to inform profile-guided optimisation.

The controllers are so short-lived and so narrow in focus, that we won't see the kind of issues (growth over time for example) that profiling in-theatre shows.

SimonRichardson · 2024-05-29T10:06:43Z

The controllers are so short-lived and so narrow in focus

The problem is, we don't as a Juju team run long-running controllers. Most of our work is driven by features and tearing down controllers is a natural practice. I would say that even some information is better than no information.

I will mention that the tests are inherently backwards in the way they run. The concept was to keep one singular controller runner per substrate for the lifetime of the gating tests. At the moment they're destroyed better each test. Making that change alone would ensure that we're recreating the lifetime of a controller with models added and removed, although in a condensed time frame. That would at least elevate the "narrow in focus" concern (which I do agree with).

I should have stated that this was intended to be a prototype for LXD to ensure that we are surfacing the parca-agent information to polar signals.

SimonRichardson self-assigned this May 22, 2024

SimonRichardson commented May 22, 2024

View reviewed changes

nvinuesa reviewed May 22, 2024

View reviewed changes

nvinuesa approved these changes May 22, 2024

View reviewed changes

SimonRichardson force-pushed the parca-agent branch from b357113 to 7ffa6bb Compare May 22, 2024 14:08

manadart reviewed May 29, 2024

View reviewed changes

SimonRichardson closed this May 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup parca-agent for ci-runs #187

Setup parca-agent for ci-runs #187

SimonRichardson commented May 22, 2024

SimonRichardson May 22, 2024

jameinel May 22, 2024

nvinuesa May 22, 2024

SimonRichardson May 22, 2024

jameinel May 22, 2024

nvinuesa left a comment

manadart left a comment

SimonRichardson commented May 29, 2024

Setup parca-agent for ci-runs #187

Setup parca-agent for ci-runs #187

Conversation

SimonRichardson commented May 22, 2024

SimonRichardson May 22, 2024

Choose a reason for hiding this comment

jameinel May 22, 2024

Choose a reason for hiding this comment

nvinuesa May 22, 2024

Choose a reason for hiding this comment

SimonRichardson May 22, 2024

Choose a reason for hiding this comment

jameinel May 22, 2024

Choose a reason for hiding this comment

nvinuesa left a comment

Choose a reason for hiding this comment

manadart left a comment

Choose a reason for hiding this comment

SimonRichardson commented May 29, 2024