🐛 Fix silent error handling in ClusterRole update and sync endpoint by clubanderson · Pull Request #4129 · kubestellar/console

clubanderson · 2026-04-01T13:01:10Z

Summary

ClusterRole update error ignored (ClusterRole Update Error Ignored #4113): The InstallGPUHealthCronJob function discarded the error from ClusterRoles().Update() with _, _ =. If the update failed, the system continued with outdated RBAC rules, which could cause subsequent GPU health job failures. Now both the Get and Update errors are checked and returned.
Sync endpoint reports false success (Sync Endpoint Reports Success Without Implementation #4114): The SyncNow handler returned "synced": true with a 200 status even though the sync logic was never implemented (just a TODO comment). Now returns 501 with "synced": false and an error message. The frontend already handles non-ok responses gracefully.

Test plan

go build ./pkg/... compiles cleanly
go test ./pkg/api/handlers/... passes
go test ./pkg/k8s/... passes
Frontend usePersistence.ts already handles non-ok response from sync endpoint (falls through to return false)

- Check ClusterRole update errors instead of discarding them with `_, _`, ensuring RBAC failures are surfaced before dependent GPU health jobs run (fixes #4113) - Return 501 with synced:false from the sync endpoint instead of falsely reporting success when no sync logic is implemented (fixes #4114) Signed-off-by: Andrew Anderson <andy@clubanderson.com>

kubestellar-prow · 2026-04-01T13:01:16Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign clubanderson for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

netlify · 2026-04-01T13:01:17Z

✅ Deploy Preview for kubestellarconsole ready!

Name	Link
🔨 Latest commit	`97c62d1`
🔍 Latest deploy log	https://app.netlify.com/projects/kubestellarconsole/deploys/69cd171a1a2f2f0008007f25
😎 Deploy Preview	https://deploy-preview-4129.console-deploy-preview.kubestellar.io
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Copilot

Pull request overview

Fixes two cases where the system reported success despite failing/being unimplemented: ClusterRole updates now propagate errors, and the persistence sync endpoint now returns an honest “not implemented” response instead of a false success.

Changes:

Return errors from ClusterRole Get/Update when Create hits AlreadyExists, preventing silent RBAC drift.
Update SyncNow to return HTTP 501 with synced: false and an error message (instead of 200/true).

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
pkg/k8s/client.go	Stops discarding ClusterRole update errors when the role already exists.
pkg/api/handlers/console_persistence.go	Makes `/sync` return 501 + `synced: false` until real sync logic exists.

Copilot · 2026-04-01T13:07:42Z

pkg/k8s/client.go

+			if getErr != nil {
+				return fmt.Errorf("getting existing ClusterRole for update: %w", getErr)
+			}
+			existing.Rules = rules
+			if _, updateErr := client.RbacV1().ClusterRoles().Update(ctx, existing, metav1.UpdateOptions{}); updateErr != nil {
+				return fmt.Errorf("updating ClusterRole: %w", updateErr)


ClusterRole updates can fail with a conflict if the resource was modified between Get and Update. Consider wrapping the update in retry.RetryOnConflict(...) (re-Get, set Rules, then Update) so transient conflicts don’t cause the install to fail unnecessarily.

Copilot · 2026-04-01T13:07:42Z

pkg/k8s/client.go

+				return fmt.Errorf("getting existing ClusterRole for update: %w", getErr)
+			}
+			existing.Rules = rules
+			if _, updateErr := client.RbacV1().ClusterRoles().Update(ctx, existing, metav1.UpdateOptions{}); updateErr != nil {
+				return fmt.Errorf("updating ClusterRole: %w", updateErr)


The wrapped error messages don’t include which ClusterRole is being updated. Including the ClusterRole name (e.g., gpuHealthClusterRole) in both messages will make operational debugging and log triage significantly easier.

Suggested change

return fmt.Errorf("getting existing ClusterRole for update: %w", getErr)

}

existing.Rules = rules

if _, updateErr := client.RbacV1().ClusterRoles().Update(ctx, existing, metav1.UpdateOptions{}); updateErr != nil {

return fmt.Errorf("updating ClusterRole: %w", updateErr)

return fmt.Errorf("getting existing ClusterRole %q for update: %w", gpuHealthClusterRole, getErr)

}

existing.Rules = rules

if _, updateErr := client.RbacV1().ClusterRoles().Update(ctx, existing, metav1.UpdateOptions{}); updateErr != nil {

return fmt.Errorf("updating ClusterRole %q: %w", gpuHealthClusterRole, updateErr)

Copilot · 2026-04-01T13:07:43Z

pkg/k8s/client.go

+				return fmt.Errorf("getting existing ClusterRole for update: %w", getErr)
+			}
+			existing.Rules = rules
+			if _, updateErr := client.RbacV1().ClusterRoles().Update(ctx, existing, metav1.UpdateOptions{}); updateErr != nil {
+				return fmt.Errorf("updating ClusterRole: %w", updateErr)


The wrapped error messages don’t include which ClusterRole is being updated. Including the ClusterRole name (e.g., gpuHealthClusterRole) in both messages will make operational debugging and log triage significantly easier.

Suggested change

return fmt.Errorf("getting existing ClusterRole for update: %w", getErr)

}

existing.Rules = rules

if _, updateErr := client.RbacV1().ClusterRoles().Update(ctx, existing, metav1.UpdateOptions{}); updateErr != nil {

return fmt.Errorf("updating ClusterRole: %w", updateErr)

return fmt.Errorf("getting existing ClusterRole %q for update: %w", gpuHealthClusterRole, getErr)

}

existing.Rules = rules

if _, updateErr := client.RbacV1().ClusterRoles().Update(ctx, existing, metav1.UpdateOptions{}); updateErr != nil {

return fmt.Errorf("updating ClusterRole %q: %w", gpuHealthClusterRole, updateErr)

Copilot · 2026-04-01T13:07:43Z

pkg/api/handlers/console_persistence.go

+	// Sync logic is not yet implemented — return an honest status
+	return c.Status(501).JSON(fiber.Map{
+		"synced":    false,
+		"error":     "sync not implemented",


The error text is quite generic. Consider making it more actionable/consistent with other API errors (e.g., include what to do next or a stable error code like SYNC_NOT_IMPLEMENTED) so clients can handle it predictably without string matching.

Suggested change

// Sync logic is not yet implemented — return an honest status

return c.Status(501).JSON(fiber.Map{

"synced": false,

"error": "sync not implemented",

// Sync logic is not yet implemented — return a clear, machine-readable status

return c.Status(501).JSON(fiber.Map{

"synced": false,

"error": "Sync operation is not implemented for this API endpoint. Please upgrade the console backend to a version that supports /api/persistence/sync.",

"errorCode": "SYNC_NOT_IMPLEMENTED",

github-actions · 2026-04-01T13:12:47Z

👋 Hey @clubanderson — thanks for opening this PR!

🤖 This project is developed exclusively using AI coding assistants.

Please do not attempt to code anything for this project manually.
All contributions should be authored using an AI coding tool such as:

Claude Code (Opus 4.5 / 4.6) — recommended

GitHub Copilot

Cursor

Other AI coding assistants

This ensures consistency in code style, architecture patterns, test coverage,
and commit quality across the entire codebase.

This is an automated message.

github-actions · 2026-04-01T13:14:11Z

Thank you for your contribution! Your PR has been merged.

Check out what's new:

KubeStellar Console — Live multi-cluster dashboard
Marketplace — Community extensions
Knowledge Base — Troubleshooting and how-tos

Stay connected: Slack #kubestellar-dev | Multi-Cluster Survey

clubanderson · 2026-04-01T13:23:07Z

🔄 Auto-Applying Copilot Code Review

Copilot code review found 3 code suggestion(s) and 1 general comment(s).

@copilot Please apply all of the following code review suggestions:

pkg/k8s/client.go (line 2603): return fmt.Errorf("getting existing ClusterRole %q for update: %w", gpuHealthClu...
pkg/k8s/client.go (line 2603): return fmt.Errorf("getting existing ClusterRole %q for update: %w", gpuHealthClu...
pkg/api/handlers/console_persistence.go (line 791): // Sync logic is not yet implemented — return a clear, machine-readable status ...

Also address these general comments:

pkg/k8s/client.go (line 2603): ClusterRole updates can fail with a conflict if the resource was modified between Get and Update. Consider wrapping

Push all fixes in a single commit. Run cd web && npm run build && npm run lint before committing.

Auto-generated by copilot-review-apply workflow.

- Include ClusterRole name in error messages for easier debugging (client.go) - Add machine-readable errorCode to sync endpoint 501 response (console_persistence.go) - Fix test hygiene: check json.Marshal/io.ReadAll errors, close response bodies, explicitly discard hash.Write returns (feedback_test.go) - Add onFocus/onBlur and aria-describedby to InfoTooltip for keyboard and screen-reader accessibility (CardWrapper.tsx) - Reuse h.httpClient with context.WithTimeout instead of creating a new http.Client per screenshot upload (feedback.go) - Scope elevated 20 MB body limit to feedback creation route only via per-route middleware guard (server.go) Signed-off-by: Andrew Anderson <andy@clubanderson.com>

Copilot AI review requested due to automatic review settings April 1, 2026 13:01

kubestellar-prow bot added the dco-signoff: yes Indicates the PR's author has signed the DCO. label Apr 1, 2026

clubanderson merged commit aa66cbe into main Apr 1, 2026
18 of 19 checks passed

kubestellar-prow bot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Apr 1, 2026

kubestellar-prow bot deleted the fix/api-error-handling branch April 1, 2026 13:01

Copilot AI reviewed Apr 1, 2026

View reviewed changes

clubanderson assigned Copilot Apr 1, 2026

clubanderson mentioned this pull request Apr 1, 2026

🐛 Address Copilot review comments from PRs #4129, #4130, #4134 #4147

Merged

3 tasks

Copilot started reviewing on behalf of clubanderson April 1, 2026 13:33 View session

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 Fix silent error handling in ClusterRole update and sync endpoint#4129

🐛 Fix silent error handling in ClusterRole update and sync endpoint#4129
clubanderson merged 1 commit intomainfrom
fix/api-error-handling

clubanderson commented Apr 1, 2026

Uh oh!

kubestellar-prow bot commented Apr 1, 2026

Uh oh!

netlify bot commented Apr 1, 2026 •

edited

Loading

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

clubanderson commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

clubanderson commented Apr 1, 2026

Summary

Test plan

Uh oh!

kubestellar-prow bot commented Apr 1, 2026

Uh oh!

netlify bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for kubestellarconsole ready!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

clubanderson commented Apr 1, 2026

🔄 Auto-Applying Copilot Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

netlify bot commented Apr 1, 2026 •

edited

Loading