Skip to content

[E2E] Add production-stack profile for E2E testing framework #657

@Xunzhuo

Description

@Xunzhuo

Description

Add a new production-stack profile to the E2E testing framework to test vLLM Production Stack configurations with Semantic Router.

Background

The E2E testing framework introduced in #655 provides an extensible profile-based architecture. We need to add a production-stack profile to test Semantic Router deployment and functionality in production-grade vLLM stack environments.

Tasks

  • Create e2e/profiles/production-stack/ directory structure
  • Implement Profile interface for production-stack
    • Setup: Deploy vLLM production stack components
    • Setup: Deploy Semantic Router with production configurations
    • Setup: Configure high availability and load balancing
    • Setup: Configure monitoring and observability
    • Teardown: Clean up production stack resources
  • Implement test cases:
    • Multi-replica deployment health check
    • Load balancing verification
    • High availability failover testing
    • Performance and throughput testing
    • Resource utilization monitoring
  • Add documentation for production-stack profile usage
  • Update CI workflow to run production-stack tests

Implementation Details

Profile Structure

type Profile struct {
    verbose bool
}

func (p *Profile) Setup(ctx context.Context, opts *framework.SetupOptions) error {
    // 1. Deploy vLLM production stack (multiple replicas, load balancer)
    // 2. Deploy Semantic Router with production settings
    // 3. Configure monitoring (Prometheus, Grafana)
    // 4. Configure high availability settings
}

Test Cases

  1. Multi-Replica Health Check: Verify all replicas are healthy and serving
  2. Load Balancing: Test request distribution across replicas
  3. Failover: Verify graceful failover when a replica fails
  4. Performance: Measure throughput and latency under load
  5. Resource Monitoring: Check CPU, memory, and GPU utilization

Acceptance Criteria

  • Production-stack profile can be run with make e2e-test PROFILE=production-stack
  • All test cases pass successfully
  • Documentation is complete and clear
  • CI integration works correctly
  • Performance benchmarks are documented

References

Related Issues

Part of the E2E testing framework expansion effort.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions