Add benchmark-runner Prow CI step-registry and test jobs by arpsharm · Pull Request #77965 · openshift/release

arpsharm · 2026-04-17T12:48:04Z

Summary

Prow CI step-registry and test jobs for benchmark-runner workloads on pre-provisioned bare-metal cluster.

31 test entries across sysbench, uperf, hammerdb, fio, vdbench, bootstorm, winmssql, windows scale
Reusable step-ref redhat-performance-benchmark-runner
Sequential "all" entry runs 29 workloads

Test plan

openshift-ci · 2026-04-17T12:48:23Z

Hi @arpsharm. Thanks for your PR.

I'm waiting for a openshift member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work.

Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

coderabbitai · 2026-04-17T12:48:39Z

Walkthrough

Adds a benchmark-runner step to the step-registry (image, command, env, resources, metadata, OWNERS), an entrypoint script to run workloads and collect artifacts, a test-step YAML matrix targeting OCP 4.22 nightly, and many presubmit Prow jobs on cluster build03 for the new scenarios.

Changes

Benchmark Runner CI Pipeline

Layer / File(s)	Summary
Step Registry Definition and Ownership `ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml`, `ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json`, `ci-operator/step-registry/redhat-performance/benchmark-runner/OWNERS`	Defines the `redhat-performance-benchmark-runner` step with `ci/benchmark-runner:latest`, command script, credential mount at `/secret`, 30s grace, 24h timeout, configurable env vars (WORKLOAD, SCALE, WINDOWS_IMAGE, LSO_DISK_ID, WORKER_DISK_IDS, REDIS, THREADS_LIMIT, ENABLE_PROMETHEUS_SNAPSHOT, TIMEOUT, RUN_TYPE), resource requests, metadata, and owners.
Benchmark Runner Execution Script `ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`	Bash entrypoint that loads Vault kubeadmin/kubeconfig, optionally establishes SSH SOCKS proxy, exports Vault secrets into env vars, creates the `benchmark-runner` namespace, waits for KubeVirt when needed, selects Windows image URLs, runs `/benchmark_runner/main/main.py` in single or `all` modes with per-workload SCALE/workload mapping, monitors VM runs, and writes debug artifacts on failure.
Test Step Configuration and Scenarios `ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml`	Targets OCP `releases.latest.candidate` stream `nightly` v4.22, sets global workload resource requests (100m CPU, 200Mi), and defines a tests array of named scenarios (individual workloads with WORKLOAD env and optional SCALE/WINDOWS_IMAGE), plus an `all` scenario with 24h timeout and generated metadata.
Presubmit Job Definitions `ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml`	Adds ~30+ Prow presubmit jobs for the `test-step` variant on cluster `build03`, each running `ci-operator` with standardized args, env (HTTP_SERVER_IP from pod status), and secrets/volume mounts. Jobs cover benchmark-runner-all, bootstorm, fio, hammerdb (pod/vm and LSO variants), sysbench, uperf, vdbench, and Windows VM scale targets plus winmssql-vm, each with `/test ...` trigger regexes.

🎯 3 (Moderate) | ⏱️ ~25 minutes

suggested_labels: rehearsals-ack

🚥 Pre-merge checks | ✅ 12

✅ Passed checks (12 passed)

Check name	Status	Explanation
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Stable And Deterministic Test Names	✅ Passed	PR adds CI/CD infrastructure (YAML configs, Bash scripts) with no Ginkgo tests. Custom check for Ginkgo test name stability is not applicable.
Test Structure And Quality	✅ Passed	Check is not applicable: PR adds CI/CD configuration (YAML, Bash scripts, JSON metadata), not Ginkgo test code. No test files to review against Ginkgo quality requirements.
Microshift Test Compatibility	✅ Passed	This PR adds CI infrastructure (YAML configs, bash scripts) for benchmark-runner, not Ginkgo e2e tests, so MicroShift test compatibility check is not applicable.
Single Node Openshift (Sno) Test Compatibility	✅ Passed	PR adds benchmark-runner CI infrastructure (YAML configs, shell scripts, metadata) with no Ginkgo e2e tests, so SNO compatibility check does not apply.
Topology-Aware Scheduling Compatibility	✅ Passed	PR adds CI/Prow configuration files and test orchestration scripts only. No Kubernetes deployment manifests, operator code, or scheduling constraints are introduced.
Ote Binary Stdout Contract	✅ Passed	PR adds only CI/CD configuration files (YAML, JSON), OWNERS files, and Bash scripts. No Go test code or OTE binaries are added; check is not applicable.
Ipv6 And Disconnected Network Test Compatibility	✅ Passed	PR adds CI/CD configuration and bash helper script for benchmark-runner, not Ginkgo e2e tests. Check is not applicable as no It(), Describe(), Context(), or When() test patterns are introduced.
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title accurately summarizes the main change: adding a reusable step-registry and associated test jobs for benchmark-runner to Prow CI.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (6)

ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml (1)
3-6: Pinning tag: latest makes runs non-reproducible.

Using latest means each CI invocation may pull a different image, which can cause hard-to-diagnose flakes and makes failures impossible to reproduce. Consider using an ImageStreamTag pinned to a specific version or a digest, and bumping it deliberately.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml`
around lines 3 - 6, The from_image entry currently uses an unpinned tag ("tag:
latest") which makes CI runs non-reproducible; update the from_image block (the
from_image key and its nested tag field) to reference a pinned image (either a
specific ImageStreamTag like "name: benchmark-runner tag: v1.2.3" or an
immutable digest) and document the version bump process so the benchmark-runner
image is updated deliberately rather than using "latest".
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh (3)
32-41: Trap function is invoked via trap; shellcheck SC2329 is a false positive.

benchmark_runner_debug is registered on line 41 via trap ... EXIT, so shellcheck's "never invoked" info is a false positive and safe to ignore. Optionally add # shellcheck disable=SC2329 above the function to silence it.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`
around lines 32 - 41, Add a shellcheck disable for SC2329 above the
benchmark_runner_debug function to silence the false-positive "never invoked"
warning because the function is registered by trap EXIT; specifically place a
comment like "# shellcheck disable=SC2329" immediately before the
benchmark_runner_debug() definition (the function referenced by trap
benchmark_runner_debug EXIT) so the linter knows the invocation via trap is
intentional.
9-12: KUBEADMIN_PASSWORD is exported but unused by this script.

If main.py reads it from the environment, this is fine — otherwise it's dead code. Worth a brief inline comment noting which consumer needs it, to avoid accidental removal later.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`
around lines 9 - 12, The script exports KUBEADMIN_PASSWORD from SHARED_DIR if
present but doesn't use it itself; either add a brief inline comment stating
which consumer reads it (e.g., main.py reads KUBEADMIN_PASSWORD from the
environment) or remove the export if truly unused. Locate the export of
KUBEADMIN_PASSWORD and add the comment referencing main.py (or adjust the code
to pass the password directly to the consumer) so future maintainers know why
the environment variable is set.
44-44: Silencing all errors on namespace create also hides real failures.

2>/dev/null || true will mask genuine problems (RBAC denied, API server unreachable, quota, etc.) and the subsequent oc/python calls will just fail opaquely. Prefer an idempotent apply or explicit "AlreadyExists" filtering:
♻️ Proposed refactor
-oc create namespace benchmark-runner 2>/dev/null || true
+oc get namespace benchmark-runner >/dev/null 2>&1 || \
+  oc create namespace benchmark-runner
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`
at line 44, The current line "oc create namespace benchmark-runner 2>/dev/null
|| true" silences all errors and hides real failures; replace it with an
idempotent check/create pattern or an apply-based approach so genuine errors
surface — e.g., first check for existence with an "oc get namespace
benchmark-runner" and only call "oc create namespace benchmark-runner" if it
does not exist, or use "oc apply" with a namespace manifest; update the script
where that exact command appears so RBAC, connectivity, quota and API errors are
not suppressed.
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json (1)
19-19: Add trailing newline.

File lacks a newline at EOF, which is a common lint nit in this repo.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json`
at line 19, The file ends with a closing brace "}" but is missing a trailing
newline; add a single newline character at end-of-file in
redhat-performance-benchmark-runner-ref.metadata.json so the file terminates
with a newline (i.e., ensure the file ends with "\n" after the final "}" to
satisfy repo lint rules).
ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml (1)
13-87: Heavy duplication across the five test entries.

The five test blocks differ only in as and WORKLOAD; every other field (cluster_profile, BASE_DOMAIN, COMPUTE_NODE_*, USER_TAGS, ref, workflow) is identical. ci-operator config doesn't support YAML anchors in a way that survives prowgen round-trips, so this is accepted repetition — just flagging that any future change (e.g. instance type bump or BASE_DOMAIN rename) must be applied in five places. Consider a follow-up to parameterize via the step registry chain if this grows further.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml`
around lines 13 - 87, The five nearly identical test steps (as:
benchmark-runner-step-stressng-pod, benchmark-runner-step-uperf-pod,
benchmark-runner-step-hammerdb-pod-mariadb-ephemeral,
benchmark-runner-step-hammerdb-pod-postgres-ephemeral,
benchmark-runner-step-hammerdb-pod-mssql-ephemeral) only differ by the as name
and the WORKLOAD env value; to fix, consolidate into a single reusable
parameterized step in the step registry (or a shared template) that defines
cluster_profile, BASE_DOMAIN, COMPUTE_NODE_REPLICAS/TYPE,
CONTROL_PLANE_INSTANCE_TYPE, USER_TAGS, test ref:
redhat-performance-benchmark-runner and workflow: ipi-aws, and then create
minimal step entries that reference that template while supplying the unique
WORKLOAD and as values; update references to use the template variable for
WORKLOAD so future changes (e.g., BASE_DOMAIN or instance types) are made in one
place.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`:
- Around line 48-56: Because the script runs under set -e, a non-zero exit from
the python invocation will abort before rc=$? and the ARTIFACT_DIR collection
runs; change the python3.14 /benchmark_runner/main/main.py invocation to capture
the exit code without triggering exit (for example run the command with a
conditional or append "|| true"), then set rc=$? and SCRIPT_EXIT_CODE=$rc and
run the existing ARTIFACT_DIR/benchmark-runner-debug collection block; reference
the python3.14 /benchmark_runner/main/main.py invocation, the rc and
SCRIPT_EXIT_CODE assignments, and ARTIFACT_DIR/benchmark-runner-debug in your
change so the artifact-gathering always executes even when the Python process
fails.

---

Nitpick comments:
In
`@ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml`:
- Around line 13-87: The five nearly identical test steps (as:
benchmark-runner-step-stressng-pod, benchmark-runner-step-uperf-pod,
benchmark-runner-step-hammerdb-pod-mariadb-ephemeral,
benchmark-runner-step-hammerdb-pod-postgres-ephemeral,
benchmark-runner-step-hammerdb-pod-mssql-ephemeral) only differ by the as name
and the WORKLOAD env value; to fix, consolidate into a single reusable
parameterized step in the step registry (or a shared template) that defines
cluster_profile, BASE_DOMAIN, COMPUTE_NODE_REPLICAS/TYPE,
CONTROL_PLANE_INSTANCE_TYPE, USER_TAGS, test ref:
redhat-performance-benchmark-runner and workflow: ipi-aws, and then create
minimal step entries that reference that template while supplying the unique
WORKLOAD and as values; update references to use the template variable for
WORKLOAD so future changes (e.g., BASE_DOMAIN or instance types) are made in one
place.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`:
- Around line 32-41: Add a shellcheck disable for SC2329 above the
benchmark_runner_debug function to silence the false-positive "never invoked"
warning because the function is registered by trap EXIT; specifically place a
comment like "# shellcheck disable=SC2329" immediately before the
benchmark_runner_debug() definition (the function referenced by trap
benchmark_runner_debug EXIT) so the linter knows the invocation via trap is
intentional.
- Around line 9-12: The script exports KUBEADMIN_PASSWORD from SHARED_DIR if
present but doesn't use it itself; either add a brief inline comment stating
which consumer reads it (e.g., main.py reads KUBEADMIN_PASSWORD from the
environment) or remove the export if truly unused. Locate the export of
KUBEADMIN_PASSWORD and add the comment referencing main.py (or adjust the code
to pass the password directly to the consumer) so future maintainers know why
the environment variable is set.
- Line 44: The current line "oc create namespace benchmark-runner 2>/dev/null ||
true" silences all errors and hides real failures; replace it with an idempotent
check/create pattern or an apply-based approach so genuine errors surface —
e.g., first check for existence with an "oc get namespace benchmark-runner" and
only call "oc create namespace benchmark-runner" if it does not exist, or use
"oc apply" with a namespace manifest; update the script where that exact command
appears so RBAC, connectivity, quota and API errors are not suppressed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json`:
- Line 19: The file ends with a closing brace "}" but is missing a trailing
newline; add a single newline character at end-of-file in
redhat-performance-benchmark-runner-ref.metadata.json so the file terminates
with a newline (i.e., ensure the file ends with "\n" after the final "}" to
satisfy repo lint rules).

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml`:
- Around line 3-6: The from_image entry currently uses an unpinned tag ("tag:
latest") which makes CI runs non-reproducible; update the from_image block (the
from_image key and its nested tag field) to reference a pinned image (either a
specific ImageStreamTag like "name: benchmark-runner tag: v1.2.3" or an
immutable digest) and document the version bump process so the benchmark-runner
image is updated deliberately rather than using "latest".

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository: openshift/coderabbit/.coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: dec56fd0-467b-4576-9660-e5eba8ab30b8

📥 Commits

Reviewing files that changed from the base of the PR and between 0ce7c54 and b60069f.

📒 Files selected for processing (6)

ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml
ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml
ci-operator/step-registry/redhat-performance/benchmark-runner/OWNERS
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml

coderabbitai

🧹 Nitpick comments (2)

ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh (2)
43-44: Namespace creation runs even when KUBECONFIG wasn't set.

If /var/run/secrets/ci.openshift.io/multi-stage/kubeconfig isn't present (lines 15–18), KUBECONFIG is never exported and this oc create namespace silently fails against no cluster (swallowed by || true). The subsequent Python run will then also be misconfigured, and failures will be harder to diagnose. Consider failing fast when no kubeconfig is found, or at minimum logging a clear warning:
♻️ Proposed refactor
 if [[ -f /var/run/secrets/ci.openshift.io/multi-stage/kubeconfig ]]; then
   cp /var/run/secrets/ci.openshift.io/multi-stage/kubeconfig /tmp/kubeconfig
   export KUBECONFIG=/tmp/kubeconfig
+else
+  echo "ERROR: no multi-stage kubeconfig found; benchmark-runner needs a test cluster" >&2
+  exit 1
 fi
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`
around lines 43 - 44, The script currently runs oc create namespace
benchmark-runner even when KUBECONFIG was never exported (missing
/var/run/secrets/ci.openshift.io/multi-stage/kubeconfig), which hides failures;
update the bootstrap logic to check for the kubeconfig file and either export
KUBECONFIG or fail fast with a clear error before running oc create namespace
(or at minimum emit a clear warning referencing KUBECONFIG and
/var/run/secrets/ci.openshift.io/multi-stage/kubeconfig); ensure the check is
placed before the oc create namespace invocation so the script exits non‑zero or
logs an explicit message when the kubeconfig is absent.
24-27: Strip trailing newline when loading BASE_DOMAIN.

cat /secret/base_domain preserves any trailing newline in the secret file, which can silently break downstream consumers that concatenate BASE_DOMAIN into hostnames/URLs. Prefer a read that trims it:
♻️ Proposed refactor
-  if [[ -s /secret/base_domain ]]; then
-    BASE_DOMAIN=$(cat /secret/base_domain)
-    export BASE_DOMAIN
-  fi
+  if [[ -s /secret/base_domain ]]; then
+    BASE_DOMAIN=$(<"/secret/base_domain")
+    BASE_DOMAIN="${BASE_DOMAIN%$'\n'}"
+    export BASE_DOMAIN
+  fi
The same concern applies to KUBEADMIN_PASSWORD at line 10 if any consumer does literal string matching.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`
around lines 24 - 27, Replace the raw cat assignments that preserve trailing
newlines by reading the secrets with a newline-trimming read or equivalent so
variables don't include a trailing newline; specifically change the BASE_DOMAIN
assignment to use a newline-trimming read (e.g. read -r BASE_DOMAIN <
/secret/base_domain) and export BASE_DOMAIN, and apply the same pattern to
KUBEADMIN_PASSWORD (read -r KUBEADMIN_PASSWORD < /secret/kubeadmin_password) to
ensure neither variable contains a trailing newline before they are used.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`:
- Around line 43-44: The script currently runs oc create namespace
benchmark-runner even when KUBECONFIG was never exported (missing
/var/run/secrets/ci.openshift.io/multi-stage/kubeconfig), which hides failures;
update the bootstrap logic to check for the kubeconfig file and either export
KUBECONFIG or fail fast with a clear error before running oc create namespace
(or at minimum emit a clear warning referencing KUBECONFIG and
/var/run/secrets/ci.openshift.io/multi-stage/kubeconfig); ensure the check is
placed before the oc create namespace invocation so the script exits non‑zero or
logs an explicit message when the kubeconfig is absent.
- Around line 24-27: Replace the raw cat assignments that preserve trailing
newlines by reading the secrets with a newline-trimming read or equivalent so
variables don't include a trailing newline; specifically change the BASE_DOMAIN
assignment to use a newline-trimming read (e.g. read -r BASE_DOMAIN <
/secret/base_domain) and export BASE_DOMAIN, and apply the same pattern to
KUBEADMIN_PASSWORD (read -r KUBEADMIN_PASSWORD < /secret/kubeadmin_password) to
ensure neither variable contains a trailing newline before they are used.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository: openshift/coderabbit/.coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: 6e7f3c26-2231-4470-ae44-29da587b6edb

📥 Commits

Reviewing files that changed from the base of the PR and between b60069f and 4c31423.

📒 Files selected for processing (6)

ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml
ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml
ci-operator/step-registry/redhat-performance/benchmark-runner/OWNERS
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml

✅ Files skipped from review due to trivial changes (4)

ci-operator/step-registry/redhat-performance/benchmark-runner/OWNERS
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json
ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml

🚧 Files skipped from review as they are similar to previous changes (1)

ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml

coderabbitai

🧹 Nitpick comments (2)

ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh (2)
23-33: /secret/base_domain silently overrides the step-level BASE_DOMAIN env.

The test step in redhat-performance-benchmark-runner-main__test-step.yaml already sets BASE_DOMAIN: rhdh-perfscale.devcluster.openshift.com, and the ref mounts the credentials secret at /secret. If the secret happens to contain a base_domain key, this block will transparently replace the intended value with no log line indicating the override, which can make misconfigurations hard to diagnose. Consider (a) only setting it when unset (: "${BASE_DOMAIN:=$(<"/secret/base_domain")}"), or (b) logging which source won.

Also note: line 29's BASE_DOMAIN="${BASE_DOMAIN%$'\n'}" is redundant — $(<file) already strips trailing newlines, and the <file redirection on line 28 doesn't include them either.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`
around lines 23 - 33, The current block unconditionally replaces BASE_DOMAIN
from /secret/base_domain which can silently override a step-level value; change
the logic in the vault-secret handling so you only set BASE_DOMAIN if it is not
already set (use a conditional default assignment for BASE_DOMAIN when empty) or
explicitly log which source wins before overriding, and remove the redundant
newline-trimming operation (the BASE_DOMAIN="${BASE_DOMAIN%$'\n'}" line) because
command substitution / file redirection already strips trailing newlines; update
references to BASE_DOMAIN and the /secret/base_domain read to reflect this safe
behavior and add a clear log line when the secret provides the value.
48-48: Namespace creation masks all errors, not just AlreadyExists.

oc create namespace benchmark-runner 2>/dev/null || true swallows legitimate failures (RBAC denial, API server unreachable, etc.), so the script will proceed to run Python even when it can't talk to the cluster. Prefer an idempotent apply, or at least only suppress the AlreadyExists case:
♻️ Suggested change
-# Ensure benchmark-runner namespace exists on the TEST cluster
-oc create namespace benchmark-runner 2>/dev/null || true
+# Ensure benchmark-runner namespace exists on the TEST cluster (idempotent)
+oc apply -f - <<EOF
+apiVersion: v1
+kind: Namespace
+metadata:
+  name: benchmark-runner
+EOF
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`
at line 48, The current line `oc create namespace benchmark-runner 2>/dev/null
|| true` hides all errors; replace it with an idempotent apply or only suppress
the AlreadyExists case. For example, replace the command with an apply using a
namespace manifest piped to oc (e.g., `cat <<EOF | oc apply -f -` with
apiVersion: v1, kind: Namespace, metadata.name: benchmark-runner EOF) or, if
keeping create, only ignore the AlreadyExists message by filtering stdout/stderr
(e.g., run `oc create namespace benchmark-runner 2>&1 | grep -v 'AlreadyExists'
|| true`) so RBAC/network errors still surface. Ensure the change updates the
existing `oc create namespace benchmark-runner` invocation.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`:
- Around line 23-33: The current block unconditionally replaces BASE_DOMAIN from
/secret/base_domain which can silently override a step-level value; change the
logic in the vault-secret handling so you only set BASE_DOMAIN if it is not
already set (use a conditional default assignment for BASE_DOMAIN when empty) or
explicitly log which source wins before overriding, and remove the redundant
newline-trimming operation (the BASE_DOMAIN="${BASE_DOMAIN%$'\n'}" line) because
command substitution / file redirection already strips trailing newlines; update
references to BASE_DOMAIN and the /secret/base_domain read to reflect this safe
behavior and add a clear log line when the secret provides the value.
- Line 48: The current line `oc create namespace benchmark-runner 2>/dev/null ||
true` hides all errors; replace it with an idempotent apply or only suppress the
AlreadyExists case. For example, replace the command with an apply using a
namespace manifest piped to oc (e.g., `cat <<EOF | oc apply -f -` with
apiVersion: v1, kind: Namespace, metadata.name: benchmark-runner EOF) or, if
keeping create, only ignore the AlreadyExists message by filtering stdout/stderr
(e.g., run `oc create namespace benchmark-runner 2>&1 | grep -v 'AlreadyExists'
|| true`) so RBAC/network errors still surface. Ensure the change updates the
existing `oc create namespace benchmark-runner` invocation.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository: openshift/coderabbit/.coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: 32b5c075-791f-4b07-92cc-ec7a880c0fde

📥 Commits

Reviewing files that changed from the base of the PR and between 4c31423 and 78fc331.

📒 Files selected for processing (6)

ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml
ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml
ci-operator/step-registry/redhat-performance/benchmark-runner/OWNERS
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml

✅ Files skipped from review due to trivial changes (3)

ci-operator/step-registry/redhat-performance/benchmark-runner/OWNERS
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml

🚧 Files skipped from review as they are similar to previous changes (2)

ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml
ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml

arpsharm · 2026-04-20T11:39:38Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-benchmark-runner-step-hammerdb-pod-mariadb-ephemeral

openshift-merge-bot · 2026-04-20T11:39:41Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

openshift-merge-bot · 2026-04-20T11:39:41Z

@arpsharm: needs-ok-to-test label found, no rehearsals will be run

ebattat · 2026-04-20T11:48:20Z

/ok-to-test

arpsharm · 2026-04-20T11:53:00Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-benchmark-runner-step-hammerdb-pod-mariadb-ephemeral

openshift-merge-bot · 2026-04-20T11:53:04Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

ebattat

/approved

openshift-merge-bot · 2026-04-20T13:28:28Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

openshift-merge-bot · 2026-04-20T14:42:02Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

arpsharm · 2026-04-20T14:48:27Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-benchmark-runner-step-hammerdb-pod-postgres-ephemeral

openshift-merge-bot · 2026-04-20T14:48:30Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh (1)
27-30: Avoid silently overriding the job’s BASE_DOMAIN.

Line 27 makes /secret/base_domain take precedence over the explicit BASE_DOMAIN set by the test-step config, so a stale secret can redirect the runner to the wrong cluster domain. Prefer using the secret only as a fallback, or fail on mismatch if it must be authoritative.
Proposed fallback behavior
-  if [[ -s /secret/base_domain ]]; then
+  if [[ -z "${BASE_DOMAIN:-}" && -s /secret/base_domain ]]; then
     BASE_DOMAIN=$(<"/secret/base_domain")
     BASE_DOMAIN="${BASE_DOMAIN%$'\n'}"
     export BASE_DOMAIN
   fi
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`
around lines 27 - 30, The script currently always overrides BASE_DOMAIN from
/secret/base_domain which can silently redirect jobs; change the logic in the
block that reads /secret/base_domain so the secret is used only as a fallback or
validated: if BASE_DOMAIN is already set and non-empty (the value from test-step
config), read /secret/base_domain and if it differs fail early with a clear
error (or log and exit), otherwise leave the existing BASE_DOMAIN intact; if
BASE_DOMAIN is unset/empty, read the secret and export it as before. Ensure this
behavior is applied where BASE_DOMAIN is referenced in the script (the existing
if [[ -s /secret/base_domain ]] block).

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In
`@ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml`:
- Around line 103-141: The pod specs mount /secrets/gcs (volumeMount with
mountPath /secrets/gcs and name: gcs-credentials) but the volumes: list is
missing a corresponding volume entry; add a volumes entry named
"gcs-credentials" that references the correct Kubernetes secret (e.g.
secretName: gcs-credentials or the repo's GCS publisher secret) and, if needed,
include an items mapping to ensure service-account.json is present; apply the
same addition to every affected job's volumes block that has the /secrets/gcs
mount.

---

Nitpick comments:
In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`:
- Around line 27-30: The script currently always overrides BASE_DOMAIN from
/secret/base_domain which can silently redirect jobs; change the logic in the
block that reads /secret/base_domain so the secret is used only as a fallback or
validated: if BASE_DOMAIN is already set and non-empty (the value from test-step
config), read /secret/base_domain and if it differs fail early with a clear
error (or log and exit), otherwise leave the existing BASE_DOMAIN intact; if
BASE_DOMAIN is unset/empty, read the secret and export it as before. Ensure this
behavior is applied where BASE_DOMAIN is referenced in the script (the existing
if [[ -s /secret/base_domain ]] block).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository: openshift/coderabbit/.coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: 5fc333b6-5f28-407f-8df5-b78dcc7e64cf

📥 Commits

Reviewing files that changed from the base of the PR and between 78fc331 and 7814328.

📒 Files selected for processing (6)

ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml
ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml
ci-operator/step-registry/redhat-performance/benchmark-runner/OWNERS
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml

✅ Files skipped from review due to trivial changes (3)

ci-operator/step-registry/redhat-performance/benchmark-runner/OWNERS
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.metadata.json
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml

🚧 Files skipped from review as they are similar to previous changes (1)

ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml

coderabbitai · 2026-04-21T12:30:15Z

+        volumeMounts:
+        - mountPath: /etc/boskos
+          name: boskos
+          readOnly: true
+        - mountPath: /secrets/ci-pull-credentials
+          name: ci-pull-credentials
+          readOnly: true
+        - mountPath: /secrets/gcs
+          name: gcs-credentials
+          readOnly: true
+        - mountPath: /secrets/manifest-tool
+          name: manifest-tool-local-pusher
+          readOnly: true
+        - mountPath: /etc/pull-secret
+          name: pull-secret
+          readOnly: true
+        - mountPath: /etc/report
+          name: result-aggregator
+          readOnly: true
+      serviceAccountName: ci-operator
+      volumes:
+      - name: boskos
+        secret:
+          items:
+          - key: credentials
+            path: credentials
+          secretName: boskos-credentials
+      - name: ci-pull-credentials
+        secret:
+          secretName: ci-pull-credentials
+      - name: manifest-tool-local-pusher
+        secret:
+          secretName: manifest-tool-local-pusher
+      - name: pull-secret
+        secret:
+          secretName: registry-pull-credentials
+      - name: result-aggregator
+        secret:
+          secretName: result-aggregator


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail file="ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml" python - <<'PY' from pathlib import Path path = Path("ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml") jobs = [] current = None pending_mount = False in_volumes = False for line in path.read_text().splitlines(): if line.startswith(" - agent: kubernetes"): if current: jobs.append(current) current = {"name": "<unknown>", "mounts": set(), "volumes": set()} pending_mount = False in_volumes = False continue if current is None: continue if line.startswith(" name: pull-ci-"): current["name"] = line.split(":", 1)[1].strip() if line.startswith(" volumes:"): in_volumes = True pending_mount = False continue if line.startswith(" - mountPath:"): pending_mount = True continue if pending_mount and line.startswith(" name:"): current["mounts"].add(line.split(":", 1)[1].strip().strip('"')) pending_mount = False continue if in_volumes: if line.startswith(" - name:"): current["volumes"].add(line.split(":", 1)[1].strip().strip('"')) elif line.startswith(" trigger:"): in_volumes = False if current: jobs.append(current) failed = False for job in jobs: missing = sorted(job["mounts"] - job["volumes"]) for name in missing: print(f"{job['name']}: missing volume for volumeMount {name!r}") failed = True raise SystemExit(1 if failed else 0) PY

Repository: openshift/release

Length of output: 1219

Add the missing gcs-credentials volume to all affected jobs.

Each of eight new jobs mounts name: gcs-credentials and uses --gcs-upload-secret=/secrets/gcs/service-account.json, but the volumes list never defines gcs-credentials. Kubernetes will reject these pod specs before the jobs start.

Proposed fix pattern

Apply this to each affected job's volumes: block:

volumes: + - name: gcs-credentials + secret: + secretName: gce-sa-credentials-gcs-publisher - name: boskos secret: items:

Verify the exact secret name against the repo's generated ci-operator jobs if this repo uses a different GCS publisher secret.

Also applies to: 188-226, 273-311, 358-396, 443-481, 528-566, 613-651

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml` around lines 103 - 141, The pod specs mount /secrets/gcs (volumeMount with mountPath /secrets/gcs and name: gcs-credentials) but the volumes: list is missing a corresponding volume entry; add a volumes entry named "gcs-credentials" that references the correct Kubernetes secret (e.g. secretName: gcs-credentials or the repo's GCS publisher secret) and, if needed, include an items mapping to ensure service-account.json is present; apply the same addition to every affected job's volumes block that has the /secrets/gcs mount.

arpsharm · 2026-04-21T12:37:12Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-benchmark-runner-step-stressng-vm

openshift-merge-bot · 2026-04-21T12:37:15Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

openshift-merge-bot · 2026-05-21T13:34:05Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

arpsharm · 2026-05-21T16:50:55Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-benchmark-runner-all

openshift-merge-bot · 2026-05-21T16:50:58Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

arpsharm · 2026-05-24T03:17:15Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-benchmark-runner-uperf-vm

openshift-merge-bot · 2026-05-24T03:17:18Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

arpsharm · 2026-05-24T04:10:39Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-benchmark-runner-uperf-vm

openshift-merge-bot · 2026-05-24T04:10:41Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

arpsharm · 2026-05-24T06:41:33Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-benchmark-runner-sysbench-pod

openshift-merge-bot · 2026-05-24T06:41:35Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

arpsharm · 2026-05-24T07:29:56Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-sysbench-pod

openshift-merge-bot · 2026-05-24T07:29:58Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In
`@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh`:
- Line 27: The log prints the bastion endpoint (${BASTION_ADDRESS}) which
exposes sensitive infrastructure details; update the echo at the SOCKS proxy
startup (the line that currently says "API not directly reachable, starting
SOCKS proxy through ${BASTION_ADDRESS}") to a generic message without the
variable (e.g., "API not directly reachable, starting SOCKS proxy") so no
bastion address or other secrets are output to logs; ensure any other uses of
BASTION_ADDRESS in this script are not echoed or logged either.
- Around line 30-32: Guard reading the secret file "elasticsearch_port" before
using it in the SOCKS fallback path: check that the secret file exists and is
readable (e.g., test -r "/secret/elasticsearch_port") and only then read into
ES_PORT and strip the newline; if the file is missing or unreadable, skip the
ssh -D/ -L proxy setup that uses ES_PORT (or fall back to a sensible
default/explicit error), ensuring the ssh invocation with SSH_ARGS,
BASTION_ADDRESS and SOCKS_PORT is only executed when ES_PORT is successfully
set.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository YAML (base), Central YAML (inherited)

Review profile: CHILL

Plan: Enterprise

Run ID: a6f644b0-f855-4a44-9cc0-a46c15c83dc5

📥 Commits

Reviewing files that changed from the base of the PR and between 591e992 and 660e69d.

⛔ Files ignored due to path filters (1)

ci-operator/jobs/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main-presubmits.yaml is excluded by !ci-operator/jobs/**

📒 Files selected for processing (3)

ci-operator/config/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-main__test-step.yaml
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh
ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-ref.yaml

coderabbitai · 2026-05-24T07:31:13Z

+    cp /secret/jh_priv_ssh_key /tmp/bastion_key
+    chmod 600 /tmp/bastion_key
+    if ! curl -sk --max-time 5 "${CLUSTER_SERVER}/version" &>/dev/null; then
+      echo "API not directly reachable, starting SOCKS proxy through ${BASTION_ADDRESS}"


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Avoid printing bastion endpoint details in logs.

Line 27 logs ${BASTION_ADDRESS}. Please keep the message generic to avoid exposing infrastructure endpoint details.

🔧 Proposed fix

- echo "API not directly reachable, starting SOCKS proxy through ${BASTION_ADDRESS}" + echo "API not directly reachable, starting SOCKS proxy through bastion host"

As per coding guidelines, "Protect sensitive information in step registry scripts - never echo or print passwords, tokens, API keys, cluster URLs, or kubeconfig contents".

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

echo "API not directly reachable, starting SOCKS proxy through ${BASTION_ADDRESS}"

echo "API not directly reachable, starting SOCKS proxy through bastion host"

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh` at line 27, The log prints the bastion endpoint (${BASTION_ADDRESS}) which exposes sensitive infrastructure details; update the echo at the SOCKS proxy startup (the line that currently says "API not directly reachable, starting SOCKS proxy through ${BASTION_ADDRESS}") to a generic message without the variable (e.g., "API not directly reachable, starting SOCKS proxy") so no bastion address or other secrets are output to logs; ensure any other uses of BASTION_ADDRESS in this script are not echoed or logged either.

coderabbitai · 2026-05-24T07:31:14Z

+      ES_PORT=$(<"/secret/elasticsearch_port")
+      ES_PORT="${ES_PORT%$'\n'}"
+      ssh ${SSH_ARGS} root@"${BASTION_ADDRESS}" -fNT -D "${SOCKS_PORT}" -L "127.0.0.1:${ES_PORT}:localhost:${ES_PORT}"


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Guard elasticsearch_port before reading it in the SOCKS fallback path.

At Line 30, the file is read unconditionally; with set -e, a missing secret exits immediately and obscures why proxy setup failed.

🔧 Proposed fix

- ES_PORT=$(<"/secret/elasticsearch_port") - ES_PORT="${ES_PORT%$'\n'}" + if [[ ! -s /secret/elasticsearch_port ]]; then + echo "ERROR: missing /secret/elasticsearch_port for SOCKS proxy setup" >&2 + exit 1 + fi + ES_PORT=$(<"/secret/elasticsearch_port") + ES_PORT="${ES_PORT%$'\n'}" ssh ${SSH_ARGS} root@"${BASTION_ADDRESS}" -fNT -D "${SOCKS_PORT}" -L "127.0.0.1:${ES_PORT}:localhost:${ES_PORT}"

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

ES_PORT=$(<"/secret/elasticsearch_port")

ES_PORT="${ES_PORT%$'\n'}"

ssh ${SSH_ARGS} root@"${BASTION_ADDRESS}" -fNT -D "${SOCKS_PORT}" -L "127.0.0.1:${ES_PORT}:localhost:${ES_PORT}"

if [[ ! -s /secret/elasticsearch_port ]]; then

echo "ERROR: missing /secret/elasticsearch_port for SOCKS proxy setup" >&2

exit 1

fi

ES_PORT=$(<"/secret/elasticsearch_port")

ES_PORT="${ES_PORT%$'\n'}"

ssh ${SSH_ARGS} root@"${BASTION_ADDRESS}" -fNT -D "${SOCKS_PORT}" -L "127.0.0.1:${ES_PORT}:localhost:${ES_PORT}"

🧰 Tools

🪛 Shellcheck (0.11.0)

[info] 32-32: Double quote to prevent globbing and word splitting.

(SC2086)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@ci-operator/step-registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh` around lines 30 - 32, Guard reading the secret file "elasticsearch_port" before using it in the SOCKS fallback path: check that the secret file exists and is readable (e.g., test -r "/secret/elasticsearch_port") and only then read into ES_PORT and strip the newline; if the file is missing or unreadable, skip the ssh -D/ -L proxy setup that uses ES_PORT (or fall back to a sensible default/explicit error), ensuring the ssh invocation with SSH_ARGS, BASTION_ADDRESS and SOCKS_PORT is only executed when ES_PORT is successfully set.

arpsharm · 2026-05-24T08:03:41Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-uperf-vm

openshift-merge-bot · 2026-05-24T08:03:43Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

openshift-merge-bot · 2026-05-25T05:27:34Z

[REHEARSALNOTIFIER]
@arpsharm: the pj-rehearse plugin accommodates running rehearsal tests for the changes in this PR. Expand 'Interacting with pj-rehearse' for usage details. The following rehearsable tests have been affected by this change:

Test name	Repo	Type	Reason
pull-ci-redhat-performance-benchmark-runner-main-test-step-all	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-bootstorm-vm	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-bootstorm-vm-scale	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-fio-pod	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-fio-pod-scale	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-fio-vm	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-fio-vm-scale	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-pod-mariadb	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-pod-mariadb-lso	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-pod-mssql	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-pod-mssql-lso	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-pod-postgres	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-pod-postgres-lso	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-vm-mariadb	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-vm-mariadb-lso	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-vm-mssql	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-vm-mssql-lso	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-vm-postgres	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-hammerdb-vm-postgres-lso	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-sysbench-pod	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-sysbench-vm	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-uperf-pod	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-uperf-vm	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-vdbench-pod	redhat-performance/benchmark-runner	presubmit	Presubmit changed
pull-ci-redhat-performance-benchmark-runner-main-test-step-vdbench-pod-scale	redhat-performance/benchmark-runner	presubmit	Presubmit changed

A total of 31 jobs have been affected by this change. The above listing is non-exhaustive and limited to 25 jobs.

A full list of affected jobs can be found here

Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse list to get an up-to-date list of affected jobs
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

arpsharm · 2026-05-25T05:27:52Z

/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-sysbench-pod

openshift-merge-bot · 2026-05-25T05:27:54Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

openshift-ci · 2026-05-25T05:58:33Z

@arpsharm: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/rehearse/redhat-performance/benchmark-runner/main/test-step-benchmark-runner-step-stressng-vm	`87f4f9d`	link	unknown	`/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-benchmark-runner-step-stressng-vm`
ci/rehearse/redhat-performance/benchmark-runner/main/test-step-benchmark-runner-sysbench-pod	`591e992`	link	unknown	`/pj-rehearse pull-ci-redhat-performance-benchmark-runner-main-test-step-benchmark-runner-sysbench-pod`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

ebattat

/approve

ebattat

/lgtm

ebattat

/rehearsals-ack

openshift-merge-bot · 2026-05-25T11:18:55Z

@arpsharm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

jhutar · 2026-05-25T19:05:04Z

/lgtm
/approved

openshift-ci · 2026-05-25T19:05:36Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: arpsharm, ebattat, jhutar

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~ci-operator/config/redhat-performance/benchmark-runner/OWNERS~~ [ebattat]
~~ci-operator/jobs/redhat-performance/benchmark-runner/OWNERS~~ [ebattat]
~~ci-operator/step-registry/redhat-performance/OWNERS~~ [jhutar]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci Bot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Apr 17, 2026

openshift-ci Bot requested review from ebattat and pmacik April 17, 2026 12:48

coderabbitai Bot reviewed Apr 17, 2026

View reviewed changes

Comment thread ...registry/redhat-performance/benchmark-runner/redhat-performance-benchmark-runner-commands.sh Outdated

arpsharm force-pushed the benchmark-runner-prow-ci branch from b60069f to 4c31423 Compare April 17, 2026 18:30

coderabbitai Bot reviewed Apr 17, 2026

View reviewed changes

arpsharm force-pushed the benchmark-runner-prow-ci branch from 4c31423 to 78fc331 Compare April 17, 2026 19:30

coderabbitai Bot reviewed Apr 17, 2026

View reviewed changes

openshift-ci Bot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Apr 20, 2026

ebattat approved these changes Apr 20, 2026

View reviewed changes

openshift-ci Bot assigned ebattat Apr 20, 2026

openshift-ci Bot added the lgtm Indicates that a PR is ready to be merged. label Apr 20, 2026

arpsharm force-pushed the benchmark-runner-prow-ci branch from 78fc331 to 7814328 Compare April 21, 2026 12:23

openshift-ci Bot removed the lgtm Indicates that a PR is ready to be merged. label Apr 21, 2026

coderabbitai Bot reviewed Apr 21, 2026

View reviewed changes

arpsharm force-pushed the benchmark-runner-prow-ci branch 2 times, most recently from 540c244 to 72214bc Compare May 21, 2026 16:50

arpsharm force-pushed the benchmark-runner-prow-ci branch from 72214bc to 591e992 Compare May 23, 2026 06:34

coderabbitai Bot reviewed May 24, 2026

View reviewed changes

Add benchmark-runner Prow CI step-registry and test jobs

7f4447f

ebattat approved these changes May 25, 2026

View reviewed changes

arpsharm mentioned this pull request May 25, 2026

Test: trigger Prow CI redhat-performance/benchmark-runner#1233

Open

coderabbitai Bot mentioned this pull request May 26, 2026

Add PerfCI e2e pipeline for benchmark-runner (stages 01-05) #79707

Open

	echo "API not directly reachable, starting SOCKS proxy through ${BASTION_ADDRESS}"
	echo "API not directly reachable, starting SOCKS proxy through bastion host"

Conversation

arpsharm commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

openshift-ci Bot commented Apr 17, 2026

Uh oh!

coderabbitai Bot commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

arpsharm commented Apr 20, 2026

Uh oh!

openshift-merge-bot Bot commented Apr 20, 2026

Uh oh!

openshift-merge-bot Bot commented Apr 20, 2026

Uh oh!

ebattat commented Apr 20, 2026

Uh oh!

arpsharm commented Apr 20, 2026

Uh oh!

openshift-merge-bot Bot commented Apr 20, 2026

Uh oh!

ebattat left a comment

Choose a reason for hiding this comment

Uh oh!

openshift-merge-bot Bot commented Apr 20, 2026

Uh oh!

openshift-merge-bot Bot commented Apr 20, 2026

Uh oh!

arpsharm commented Apr 20, 2026

Uh oh!

openshift-merge-bot Bot commented Apr 20, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

arpsharm commented Apr 21, 2026

Uh oh!

openshift-merge-bot Bot commented Apr 21, 2026

Uh oh!

openshift-merge-bot Bot commented May 21, 2026

Uh oh!

arpsharm commented May 21, 2026

Uh oh!

openshift-merge-bot Bot commented May 21, 2026

Uh oh!

arpsharm commented May 24, 2026

Uh oh!

openshift-merge-bot Bot commented May 24, 2026

Uh oh!

arpsharm commented May 24, 2026

Uh oh!

openshift-merge-bot Bot commented May 24, 2026

Uh oh!

arpsharm commented May 24, 2026

Uh oh!

openshift-merge-bot Bot commented May 24, 2026

Uh oh!

arpsharm commented May 24, 2026

Uh oh!

openshift-merge-bot Bot commented May 24, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

arpsharm commented Apr 17, 2026 •

edited

Loading

coderabbitai Bot commented Apr 17, 2026 •

edited

Loading