[RayService] Add e2e tests #1167

zcin · 2023-06-14T21:19:49Z

Why are these changes needed?

4 e2e tests for RayService:

deploy 2 applications
deploy then execute in place update
deploy then execute zero downtime rollout
autoscaling

The e2e tests should be run with the following command:

RAY_IMAGE=rayproject/ray:2.5.0 OPERATOR_IMAGE=controller:latest pytest -vs tests/test_sample_rayservice_yamls.py --log-cli-level=INFO

To run a specific test in test_sample_rayservice_yamls.py, specify the test name with -k, e.g:

RAY_IMAGE=rayproject/ray:2.5.0 OPERATOR_IMAGE=controller:latest pytest -vs tests/test_sample_rayservice_yamls.py -k test_service_autoscaling --log-cli-level=INFO

Related issue number

Checks

I've made sure the tests are passing.
Testing Strategy
- Unit tests
- Manual tests
- This PR is not tested :(

Signed-off-by: cindyz <cindyz@anyscale.com>

ray-operator/config/samples/ray-service.autoscaler.yaml

tests/framework/prototype.py

tests/test_sample_rayservice_yamls.py

Signed-off-by: cindyz <cindyz@anyscale.com>

shrekris-anyscale

Thanks for addressing all my comments! The change looks good to me.

kevin85421

Leave some comments; the rest looks good to me! Having these tests would be very helpful!

tests/framework/utils.py

ray-operator/config/samples/ray-service.autoscaler.yaml

kevin85421 · 2023-06-16T18:59:51Z

ray-operator/config/samples/ray-service.autoscaler.yaml

-        routePrefix: "/"
-        rayActorOptions:
-          numCpus: 0.1
+  serveConfigV2: |


How many CPUs will these applications utilize?

0.5 per replica, it should scale to 14 replicas in the test so that would require 7 CPUs.

kevin85421 · 2023-06-16T19:02:02Z

ray-operator/config/samples/ray-service.autoscaler.yaml

    ######################headGroupSpecs#################################
    headGroupSpec:
      # The `rayStartParams` are used to configure the `ray start` command.
      # See https://github.com/ray-project/kuberay/blob/master/docs/guidance/rayStartParams.md for the default settings of `rayStartParams` in KubeRay.
      # See https://docs.ray.io/en/latest/cluster/cli.html#ray-start for all available options in `rayStartParams`.
-      rayStartParams: {"num-cpus": "0"}
+      rayStartParams:


What is the reason for this change?

I didn't make the changes from the original autoscaler file, instead I added autoscaling configurations to the original config/samples/ray_v1alpha1_rayservice.yaml file.

ray-operator/config/samples/ray-service.autoscaler.yaml

tests/framework/prototype.py

tests/framework/utils.py

Signed-off-by: cindyz <cindyz@anyscale.com>

kevin85421 · 2023-06-16T23:32:09Z

tests/test_sample_rayservice_yamls.py

+
+        scale_up_rule = AutoscaleRule(
+            query={"path": "/", "json_args": {}},
+            num_repeat=20,


Does it mean sending 20 requests? How do we know it will scale up to 14 replicas (7 CPUs) instead of another number, such as 5 replicas?

I do not understand when will the Serving autoscaling be triggered.

I use the deployment from https://github.com/ray-project/serve_workloads/blob/main/autoscaling_test/blocked.py. If you send requests to this deployment, the request will block indefinitely until https://github.com/ray-project/serve_workloads/blob/main/autoscaling_test/signaling.py releases the "lock". The target number of ongoing requests per replica is 1, so serve will try to add enough replicas to serve all 20 requests (since all requests are blocked).

Thank you for the explanation! We may need to add comments to both this test and the YAML file.

Good point. I've added comments to both the test and yaml.

kevin85421

LGTM

kevin85421 · 2023-06-16T23:40:11Z

tests/test_sample_rayservice_yamls.py

+
+        scale_up_rule = AutoscaleRule(
+            query={"path": "/", "json_args": {}},
+            num_repeat=20,


Thank you for the explanation! We may need to add comments to both this test and the YAML file.

Signed-off-by: cindyz <cindyz@anyscale.com>

Add e2e tests

wip

d2e2033

Signed-off-by: cindyz <cindyz@anyscale.com>

zcin requested a review from kevin85421 June 14, 2023 21:19

cindyz added 3 commits June 14, 2023 23:34

wip

d23cff8

Signed-off-by: cindyz <cindyz@anyscale.com>

wip

807b3ac

Signed-off-by: cindyz <cindyz@anyscale.com>

wip

d296463

Signed-off-by: cindyz <cindyz@anyscale.com>

zcin marked this pull request as ready for review June 15, 2023 16:19

zcin changed the title ~~[WIP][RayService] Add e2e tests~~ [RayService] Add e2e tests Jun 15, 2023

zcin requested a review from sihanwang41 June 15, 2023 17:13

autoscaler e2e test

8e5a18f

Signed-off-by: cindyz <cindyz@anyscale.com>

zcin requested a review from shrekris-anyscale June 16, 2023 15:48

shrekris-anyscale reviewed Jun 16, 2023

View reviewed changes

cindyz added 2 commits June 16, 2023 18:27

address comments

9300c62

Signed-off-by: cindyz <cindyz@anyscale.com>

show number of head pods and worker pods in show cluster info

f736ea6

Signed-off-by: cindyz <cindyz@anyscale.com>

shrekris-anyscale approved these changes Jun 16, 2023

View reviewed changes

kevin85421 approved these changes Jun 16, 2023

View reviewed changes

sihanwang41 approved these changes Jun 16, 2023

View reviewed changes

fix yaml

05e5526

Signed-off-by: cindyz <cindyz@anyscale.com>

zcin force-pushed the services-e2e-tests branch from 22f5aa6 to 05e5526 Compare June 16, 2023 20:30

cindyz added 2 commits June 16, 2023 20:41

address comments

1fb1454

Signed-off-by: cindyz <cindyz@anyscale.com>

update timeout

7e00e73

Signed-off-by: cindyz <cindyz@anyscale.com>

kevin85421 reviewed Jun 16, 2023

View reviewed changes

kevin85421 approved these changes Jun 16, 2023

View reviewed changes

add comments

520a6a1

Signed-off-by: cindyz <cindyz@anyscale.com>

kevin85421 merged commit 8db4f6d into ray-project:master Jun 17, 2023
20 checks passed

kevin85421 mentioned this pull request Jul 26, 2023

[Feature] E2E Test plan for RayService #762

Closed

2 tasks

zcin deleted the services-e2e-tests branch August 25, 2023 17:35

lowang-bh pushed a commit to lowang-bh/kuberay that referenced this pull request Sep 24, 2023

[RayService] Add e2e tests (ray-project#1167)

b79ce36

Add e2e tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RayService] Add e2e tests #1167

[RayService] Add e2e tests #1167

zcin commented Jun 14, 2023 •

edited

Loading

shrekris-anyscale left a comment

kevin85421 left a comment

kevin85421 Jun 16, 2023

zcin Jun 16, 2023

kevin85421 Jun 16, 2023

zcin Jun 16, 2023

kevin85421 Jun 16, 2023

kevin85421 Jun 16, 2023

zcin Jun 16, 2023 •

edited

Loading

kevin85421 Jun 16, 2023

zcin Jun 17, 2023

kevin85421 left a comment

kevin85421 Jun 16, 2023

[RayService] Add e2e tests #1167

[RayService] Add e2e tests #1167

Conversation

zcin commented Jun 14, 2023 • edited Loading

Why are these changes needed?

Related issue number

Checks

shrekris-anyscale left a comment

Choose a reason for hiding this comment

kevin85421 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zcin Jun 16, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kevin85421 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zcin commented Jun 14, 2023 •

edited

Loading

zcin Jun 16, 2023 •

edited

Loading