[Feature] Add OpenShift monitoring required RBAC in OLM bundle#366
Merged
sajmera-pensando merged 1 commit intoROCm:release-v1.4.1from Oct 24, 2025
Merged
Conversation
Signed-off-by: yansun1996 <Yan.Sun3@amd.com>
leslie-qiwa
pushed a commit
to leslie-qiwa/gpu-operator
that referenced
this pull request
Feb 6, 2026
* Create a docker/shell build container * Bump golang version to 1.23.0 for operator repo * Upgrade golint to 1.63.4 and fix linter errors - Upgrade linter to 1.63.4 as it supports go 1.23 - Fix new linter errors reported by golint 1.63.4 * Build openshift helm charts using kmm main branch - Checkout to the commit in kmm openshift main branch that adds support for go 1.23 * Changes to operator build container * Move to go1.21.13 and KMM release 2.3
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Motivation
Collecting metrics with Prometheus documented the requirements to configure OpenShift cluster monitoring to scrape metrics from other namespace;
label the namespace with
openshift.io/cluster-monitoring: trueCreate the RBAC resources that this PR is trying to add.
Create ServiceMonitor/PodMonitor
Item 1 needs to be documented
Item 2 needs to be added into OLM bundle
Item 3 could already got created by the released GPU Operator
Technical Details
this PR is adding documentation for Item 1 and adding RBAC resources into OLM bundle for Item 2.
Test Plan
Bring up a fresh new OpenShift cluster, follow the updated documentation to install NFD, KMM, AMD GPU Operator. Create a
DeviceConfigto install amdgpu kmod, enable metrics exporter and serviceMonitor. Label the namespace to enable the cluster monitoring on target namespace.Test Result
On the OpenShift web console, the Observe page shows active metrics exporter target and users could query AMD GPU metrics on Observe --- Metrics webpage.
Submission Checklist