Add tests for horizontal scaling operator behaviors by yrobla · Pull Request #4388 · stacklok/toolhive

yrobla · 2026-03-26T14:26:32Z

Summary

Adds test coverage for the horizontal scaling operator behaviors introduced in
the companion PRs (THV-0047). The existing unit tests were added incrementally
with each implementation task; this PR closes gaps at three levels:

Unit tests (cmd/thv-operator/controllers/):

TestConfigMapContent_SessionStorage: verifies ensureVmcpConfigConfigMap
produces a ConfigMap YAML with sessionStorage populated when
provider=redis, and absent when nil or memory.
TestDeploymentForVirtualMCPServer_WithRedisPassword: verifies
deploymentForVirtualMCPServer injects THV_SESSION_REDIS_PASSWORD as a
SecretKeyRef env var and never as a plaintext value.

Integration tests (envtest — cmd/thv-operator/test-integration/):

MCPServer: ScalingConfig (backend_replicas + session_redis) is written
to the RunConfig ConfigMap when backendReplicas and Redis sessionStorage
are set; omitted when absent.
VirtualMCPServer: spec.replicas=3 is reflected in Deployment.Spec.Replicas;
nil spec.replicas preserves an externally-set replica count across
reconciliations (HPA-compatible contract, gated on status.observedGeneration).

E2E tests (real Kind cluster — test/e2e/thv-operator/virtualmcp/):

virtualmcpserver_scaling_test.go: two ordered contexts covering the full
cluster-level lifecycle:
- Static: replicas=2 at creation → 2 ready pods, SessionStorageWarning
  condition set.
- Scale lifecycle (1→2→1): Deployment replicas update in-place, 2 pods come
  up, SessionStorageWarning fires on scale-up and clears on scale-down.

Fixes #4220

Type of change

Test plan

Unit tests (task test)
Integration tests (task operator-test-integration)
E2E tests (task thv-operator-e2e-test-run)
Linting (task lint-fix)

Does this introduce a user-facing change?

No.

codecov · 2026-03-26T14:32:46Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 69.49%. Comparing base (75bb444) to head (2d549d7).

Additional details and impacted files

@@              Coverage Diff               @@
##           issue-4214    #4388      +/-   ##
==============================================
+ Coverage       69.46%   69.49%   +0.03%     
==============================================
  Files             485      485              
  Lines           49814    49585     -229     
==============================================
- Hits            34604    34460     -144     
+ Misses          12530    12457      -73     
+ Partials         2680     2668      -12

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull request overview

Adds automated coverage around horizontal scaling behaviors in the ToolHive operator, focusing on VirtualMCPServer session storage configuration and Redis password injection, plus integration/e2e coverage for replica behavior and RunConfig scaling fields.

Changes:

Added VirtualMCPServer e2e scenario coverage for scaling (replicas 1↔2) and SessionStorageWarning condition behavior.
Added envtest integration coverage for VirtualMCPServer replica handling (explicit replicas + nil/HPA-compat behavior) and MCPServer RunConfig scaling config population.
Added controller-level unit tests verifying vMCP ConfigMap sessionStorage YAML content and Redis password SecretKeyRef env var injection.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
test/e2e/thv-operator/virtualmcp/virtualmcpserver_scaling_test.go	New real-cluster e2e coverage for VMCP replica scaling and SessionStorageWarning behavior
cmd/thv-operator/test-integration/virtualmcp/virtualmcpserver_replicas_integration_test.go	New envtest integration checks for VMCP replica passthrough + HPA-compat preservation
cmd/thv-operator/test-integration/mcp-server/mcpserver_runconfig_integration_test.go	Adds integration assertions for RunConfig `ScalingConfig` presence/absence based on spec
cmd/thv-operator/controllers/virtualmcpserver_vmcpconfig_test.go	Adds unit test for ConfigMap YAML `sessionStorage` section population/omission
cmd/thv-operator/controllers/virtualmcpserver_deployment_test.go	Adds unit test ensuring Redis password is injected via `SecretKeyRef` env var (no plaintext)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

cmd/thv-operator/test-integration/virtualmcp/virtualmcpserver_replicas_integration_test.go

test/e2e/thv-operator/virtualmcp/virtualmcpserver_scaling_test.go

cmd/thv-operator/controllers/virtualmcpserver_deployment_test.go

cmd/thv-operator/controllers/virtualmcpserver_vmcpconfig_test.go

Unit tests (controllers package): - TestConfigMapContent_SessionStorage: verifies that ensureVmcpConfigConfigMap produces a ConfigMap YAML with the sessionStorage section populated when provider=redis, and absent when nil or memory. - TestDeploymentForVirtualMCPServer_WithRedisPassword: verifies that deploymentForVirtualMCPServer injects THV_SESSION_REDIS_PASSWORD as a SecretKeyRef env var and never as a plaintext value. Integration tests (envtest with real CRDs + controller loop): - MCPServer: ScalingConfig (backend_replicas + session_redis) is written to the RunConfig ConfigMap when backendReplicas and Redis sessionStorage are set; omitted when absent. - VirtualMCPServer: spec.replicas is reflected in Deployment.Spec.Replicas; nil replicas produces a nil Deployment replicas field for HPA compatibility. E2E tests (Kind cluster via thv-operator-e2e-test-run): - VirtualMCPServer created with replicas=2 runs 2 ready pods and sets SessionStorageWarning condition when no Redis is configured. - Scale lifecycle (1→2→1): verifies Deployment replicas update in-place, 2 pods come up, SessionStorageWarning fires on scale-up and clears on scale-down. Closes: #4220 Fix ignored AddToScheme errors in unit tests Replace `_ = *.AddToScheme(scheme)` with `require.NoError(t, *.AddToScheme(scheme))` in virtualmcpserver_deployment_test.go and virtualmcpserver_vmcpconfig_test.go so that scheme registration failures cause an immediate, clear test failure rather than a confusing error later with a partially configured scheme. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ation Annotation changes do not increment metadata.generation on Kubernetes resources. The previous trigger used `vmcp.Annotations["test/trigger-reconcile"] = "true"`, which left generation unchanged, so `triggerGeneration = vmcp.Generation + 1` was always one ahead of ObservedGeneration and the Eventually timed out. Fix: trigger via `vmcp.Spec.ServiceType = "ClusterIP"` (the default value, semantically a no-op) which IS a spec change and DOES increment Generation. After the Update, controller-runtime populates the object in-place with the server response, so `vmcp.Generation` is already the post-increment value. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

jerm-dro

question: The tests verify that the infrastructure scales correctly (replica counts, pod readiness, SessionStorageWarning conditions), but they don't send any MCP traffic through the scaled-out vMCP. Is this an intentional gap, or is cross-pod request handling planned for a follow-up?

Without a request hitting a different pod than the one that created the session, we're testing that pods exist but not that horizontal scaling actually works end-to-end. A few approaches that would demonstrate "any pod can handle any request":

Scale to 2 with Redis, create a session via tools/list, delete the pod that handled it, then send tools/call on the same session and verify it succeeds on the surviving pod
Scale to 0 and back to 1, then send a request — proves a completely fresh pod with no in-memory state can serve traffic
Send N requests through the Service and verify responses come back successfully (doesn't prove session affinity, but proves all pods are functional)

No preference on which approach — just looking for something that closes the loop beyond infrastructure assertions.

yrobla requested review from ChrisJBurns, JAORMX, jerm-dro and jhrozek as code owners March 26, 2026 14:26

github-actions bot added the size/M Medium PR: 300-599 lines changed label Mar 26, 2026

yrobla force-pushed the issue-4220 branch from e955cd4 to d14b8f6 Compare March 26, 2026 14:46

github-actions bot added size/L Large PR: 600-999 lines changed and removed size/M Medium PR: 300-599 lines changed labels Mar 26, 2026

yrobla force-pushed the issue-4220 branch from d14b8f6 to 7191d75 Compare March 26, 2026 14:48

github-actions bot added size/L Large PR: 600-999 lines changed and removed size/L Large PR: 600-999 lines changed labels Mar 26, 2026

yrobla force-pushed the issue-4220 branch from 7191d75 to 4e76442 Compare March 26, 2026 16:06

yrobla requested review from Copilot March 26, 2026 16:06

github-actions bot added size/L Large PR: 600-999 lines changed and removed size/L Large PR: 600-999 lines changed labels Mar 26, 2026

Copilot started reviewing on behalf of yrobla March 26, 2026 16:06 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

yrobla changed the title ~~Add controller-level unit tests for horizontal scaling behaviors~~ Add tests for horizontal scaling operator behaviors Mar 26, 2026

yrobla force-pushed the issue-4220 branch from 4e76442 to 3cd1c09 Compare March 26, 2026 17:42

github-actions bot added size/L Large PR: 600-999 lines changed and removed size/L Large PR: 600-999 lines changed labels Mar 26, 2026

yrobla force-pushed the issue-4220 branch from 3cd1c09 to 5f411f8 Compare March 26, 2026 17:45

github-actions bot added size/L Large PR: 600-999 lines changed and removed size/L Large PR: 600-999 lines changed labels Mar 26, 2026

jerm-dro approved these changes Mar 26, 2026

View reviewed changes

jerm-dro self-requested a review March 26, 2026 20:46

jerm-dro reviewed Mar 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests for horizontal scaling operator behaviors#4388

Add tests for horizontal scaling operator behaviors#4388
yrobla wants to merge 2 commits intoissue-4214from
issue-4220

yrobla commented Mar 26, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 26, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jerm-dro left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

yrobla commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Type of change

Test plan

Does this introduce a user-facing change?

Uh oh!

codecov bot commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jerm-dro left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yrobla commented Mar 26, 2026 •

edited

Loading

codecov bot commented Mar 26, 2026 •

edited

Loading