[Serve][Doc] Update docs about input schema, and json_request adapter #24191

simon-mo · 2022-04-25T23:45:52Z

Why are these changes needed?

This PR dives into more detail about the input schema type. In particular:

Restructured DAGDriver's section in deployment graph doc to highlight built in one.
In order to do that, I have to fix [Feature] DAGDriver should support partial access of InputNode #23484 by adding a JSON adapters.
Clarified multiple different usage of http adapters: Ray AIR, DAGDriver, and FastAPI directly.

Related issue number

Closes #23484

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…schema-doc

simon-mo · 2022-04-25T23:46:22Z

doc/source/serve/deployment-graph.md

@@ -16,15 +16,15 @@ kernelspec:

 # Deployment Graph

-```{note} 


trailing whitespace, hard to revert but good for code quality, so i kept these changes.

simon-mo · 2022-04-25T23:46:52Z

doc/source/serve/deployment-graph.md

@@ -394,31 +420,21 @@ more info on these options.

 Now we're done! The full example below covers the full example for you to try out.

-```{code-cell} ipython3
-:tags: [remove-cell]


we should show some of these so when people copy the full example from the web page, the code works end to end

simon-mo · 2022-04-25T23:47:25Z

doc/source/serve/deployment-graph.md

    # Each serve dag has a driver deployment as ingress that can be user provided.
-    serve_dag = DAGDriver.options(route_prefix="/my-dag").bind(dag)
+    serve_dag = DAGDriver.options(route_prefix="/my-dag", num_replicas=2).bind(


verified locally running this example works end to end. we should convert this to a Python file and run in CI (but in a separate PR)

doc/source/serve/deployment-graph.md

jiaodong · 2022-04-26T00:03:55Z

doc/source/serve/deployment-graph.md

-```{note}
-We expect each DAG has a driver class implementation as root, similar to the example below. This is where HTTP ingress are configured and implemented. We provide a default `DAGDriver` to handle simple HTTP parsing, but in this example we put up a custom implementation. 
+Serve provides a default DAGDriver implementation that accepts HTTP request and orchestrate the deployment graph execution. You can import it from `from ray.serve.drivers import DAGDriver`.
+


I think we should also mention user has all the freedom they want to provide their own DAGDriver implementation as ingress just as if what they would do today, and what we did here is to facilitate holding dag instance by providing a default template.

Realistically i think there'll be decent # of users who start with our default implementation then add their own code to it rather than our DAGDriver covers everything they need ?

it's in two paragraph below.

jiaodong · 2022-04-26T00:05:15Z

does our CI provide a default link to the documentation based on latest commit to review the whole webpage ? i remember seeing it from my last iteration of the documentation

simon-mo · 2022-04-26T00:12:13Z

https://ray--24191.org.readthedocs.build/en/24191/

yes just click the docs/readthedocs:ray" details page

jiaodong · 2022-04-26T00:36:29Z

content looks good to me, do you want to change input_schema to http_adapter in this PR then ? It covers code, tests and documentation changes already

jiaodong · 2022-04-26T00:40:57Z

python/ray/serve/tests/test_pipeline_driver.py

+            input_schema=json_request,
+        )
+    handle = serve.run(dag)
+    assert ray.get(handle.predict.remote([1, 2])) == [1, 2]


can you try it with an example that passes multiple objects to dag.execute(1, 2, [3,4]) ? I remember default DAGDriver will throw due to mismatched signature that lead to different behaviors between ray dag and serve dag regarding InputNode handling

jiaodong · 2022-04-27T17:55:10Z

python/ray/serve/tests/test_pipeline_driver.py

+            input_schema=json_request,
+        )
+    handle = serve.run(dag)
+    assert ray.get(handle.predict.remote([1, 2, [3, 4]])) == [1, 2, [3, 4]]


sorry i actually meant handle.predict.remote(1, 2, [3, 4]) so it's multiple args that will spawn InputAttributeNode rather than dealing with one python object. You can see them in https://sourcegraph.com/github.com/ray-project/ray/-/blob/python/ray/experimental/dag/tests/test_input_node.py?L225

This doesn't make sense for typical DAGDriver here. The input is still a single item because predict and http request has a single value representing the input. As you can see in the dag users can still write input[0] to index into input node.

how about we use the same internal data wrapper as ray dag's execute ? https://sourcegraph.com/github.com/ray-project/ray/-/blob/python/ray/experimental/dag/input_node.py?L255

I understand eventually InputNode translate to one python object, but it's better if we can minimize the churn from user's dev flow if they find out they need to change input syntax / structure from ray core to serve dag. I'm not suggesting what we have now is already optimal, but they should be the same class.

jiaodong

synced offline and agreed on doing *args **kwargs unpacking on default DAGDriver

simon-mo added 3 commits April 25, 2022 16:32

update input schema doc

c86b6c8

Merge branch 'master' of github.com:ray-project/ray into serve/input-…

cbc64c0

…schema-doc

remove appendix

df19d91

simon-mo assigned jiaodong and edoakes Apr 25, 2022

simon-mo commented Apr 25, 2022

View reviewed changes

add test for json requests and partial input

a0cf7c6

jiaodong reviewed Apr 26, 2022

View reviewed changes

jiaodong added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Apr 27, 2022

update tests

362ea7f

simon-mo removed the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Apr 27, 2022

jiaodong reviewed Apr 27, 2022

View reviewed changes

Allow unpacking

1d52d18

jiaodong approved these changes Apr 27, 2022

View reviewed changes

simon-mo merged commit ee52895 into ray-project:master Apr 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Serve][Doc] Update docs about input schema, and json_request adapter #24191

[Serve][Doc] Update docs about input schema, and json_request adapter #24191

simon-mo commented Apr 25, 2022

simon-mo Apr 25, 2022

simon-mo Apr 25, 2022

simon-mo Apr 25, 2022

jiaodong Apr 26, 2022

simon-mo Apr 26, 2022

jiaodong commented Apr 26, 2022

simon-mo commented Apr 26, 2022

jiaodong commented Apr 26, 2022

jiaodong Apr 26, 2022

jiaodong Apr 27, 2022

simon-mo Apr 27, 2022

jiaodong Apr 27, 2022

jiaodong left a comment

[Serve][Doc] Update docs about input schema, and json_request adapter #24191

[Serve][Doc] Update docs about input schema, and json_request adapter #24191

Conversation

simon-mo commented Apr 25, 2022

Why are these changes needed?

Related issue number

Checks

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jiaodong commented Apr 26, 2022

simon-mo commented Apr 26, 2022

jiaodong commented Apr 26, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jiaodong left a comment

Choose a reason for hiding this comment