[serve] Application-level batching initial commit #14610

edoakes · 2021-03-10T23:53:43Z

Why are these changes needed?

Adds a @serve.batch decorator that allows for application level batching rather than making it part of the backend_worker.py implementation. This should allow for more flexibility and enable batching with our future ingress plans that are more "HTTP-native." This does not yet modify other tests, examples, docs, etc.

TODOs before making this the default in the docs:

Address the annoying Task was destroyed but it is pending! errors happening at shutdown.
Make sure that type hinting is configured properly for autocompletion.

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…atching

edoakes · 2021-03-10T23:54:30Z

python/ray/serve/tests/test_batching.py

@@ -58,6 +58,198 @@ def __call__(self, requests):
        assert ray.get(handle.remote(temp=1))


+def test_app_level_batching(serve_instance):


this and the next test are duplicates of the old batching tests

edoakes · 2021-03-10T23:55:15Z

@architkulkarni @simon-mo this ended up being pretty gnarly to get right both for the asyncio code and the decorator (to handle both methods and functions). Please leave any comments where the code is confusing so I can clarify w/ comments.

…atching

edoakes · 2021-03-11T15:07:34Z

@simon-mo I added type hints based on the examples here:
https://mypy.readthedocs.io/en/stable/generics.html#declaring-decorators

Could you double-check these? I'm not too familiar with mypy.

…atching

simon-mo · 2021-03-11T19:02:37Z

python/ray/serve/backend_worker.py

@@ -185,8 +121,8 @@ def __init__(self, _callable: Callable, backend_config: BackendConfig,
        self.is_function = is_function

        self.config = backend_config
-        self.batch_queue = BatchQueue(self.config.max_batch_size or 1,
-                                      self.config.batch_wait_timeout)
+        self.batch_queue = _BatchQueue(self.config.max_batch_size or 1,


is this still used?

Yep, the existing codepath is untouched. I don't see a benefit to refactoring it to use this new version given that this will just be deleted.

python/ray/serve/batching.py

edoakes · 2021-03-11T23:43:45Z

@simon-mo all of your comments are address, please have another look!

architkulkarni · 2021-03-12T01:31:07Z

python/ray/serve/tests/test_batching.py

+    t1 = asyncio.get_event_loop().create_task(call("hi1"))
+    await asyncio.sleep(0.5)
+    t2 = asyncio.get_event_loop().create_task(call("hi2"))
+    t3 = asyncio.get_event_loop().create_task(call("raise"))


I'm a bit confused here, why are these executed in a size-two batch if timeout = 0? Doesn't timeout=0 mean that the max batch size is effectively 1?

The second two are executed together because they are both waiting once the first one finishes (only one batch is executed at a time)

Oh right, thanks

edoakes added 2 commits March 10, 2021 13:33

tip

e46f481

Merge remote-tracking branch 'upstream/master' into serve-app-level-b…

1a0dfc7

…atching

edoakes assigned architkulkarni and simon-mo Mar 10, 2021

edoakes commented Mar 10, 2021

View reviewed changes

edoakes changed the title ~~[serve] Application-level batching first commit~~ [serve] Application-level batching initial commit Mar 10, 2021

edoakes added 3 commits March 10, 2021 18:09

better comments

52df652

more comments

026e9db

Merge remote-tracking branch 'upstream/master' into serve-app-level-b…

00eb3a9

…atching

edoakes added 5 commits March 11, 2021 09:23

add typing

47c7460

comments on overloads

d36c4dc

more tests

f4dd46e

Merge remote-tracking branch 'upstream/master' into serve-app-level-b…

548af49

…atching

fix lint

be1e0de

simon-mo reviewed Mar 11, 2021

View reviewed changes

edoakes added 2 commits March 11, 2021 17:02

address comments

6aef7d6

comment

80035bc

simon-mo approved these changes Mar 12, 2021

View reviewed changes

architkulkarni reviewed Mar 12, 2021

View reviewed changes

edoakes merged commit 9cf328d into ray-project:master Mar 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[serve] Application-level batching initial commit #14610

[serve] Application-level batching initial commit #14610

edoakes commented Mar 10, 2021 •

edited

Loading

edoakes Mar 10, 2021

edoakes commented Mar 10, 2021

edoakes commented Mar 11, 2021 •

edited

Loading

simon-mo Mar 11, 2021

edoakes Mar 11, 2021

edoakes commented Mar 11, 2021

architkulkarni Mar 12, 2021

edoakes Mar 12, 2021

architkulkarni Mar 12, 2021

		@@ -58,6 +58,198 @@ def __call__(self, requests):
		assert ray.get(handle.remote(temp=1))


		def test_app_level_batching(serve_instance):

[serve] Application-level batching initial commit #14610

[serve] Application-level batching initial commit #14610

Conversation

edoakes commented Mar 10, 2021 • edited Loading

Why are these changes needed?

Related issue number

Checks

edoakes Mar 10, 2021

Choose a reason for hiding this comment

edoakes commented Mar 10, 2021

edoakes commented Mar 11, 2021 • edited Loading

simon-mo Mar 11, 2021

Choose a reason for hiding this comment

edoakes Mar 11, 2021

Choose a reason for hiding this comment

edoakes commented Mar 11, 2021

architkulkarni Mar 12, 2021

Choose a reason for hiding this comment

edoakes Mar 12, 2021

Choose a reason for hiding this comment

architkulkarni Mar 12, 2021

Choose a reason for hiding this comment

edoakes commented Mar 10, 2021 •

edited

Loading

edoakes commented Mar 11, 2021 •

edited

Loading