Various fixes to enable tempo to pack environment for single models #189

sakoush · 2021-09-02T16:41:49Z

This PR adds various fixes that we needed to get an mlflow model(e.g based on pytorch) saved and deployed using Tempo.

The main change is to pack the conda environment if save_env is True even if there is no custom predict function (i.e. BaseModel._user_func is False) defined, which is the case with individual models.

Other changes that are required as well:

Pack the runtime mlserver dependency as well, for example in the case of mlfow runtime we need to pack mlserver-mlflow. Models have platform which we use to specify with platform we use. (ModelFramework.MLFlow -> mlserver-mlflow) and that is defined in MLServerRuntimeEnvDeps.
MLServer supports a numpy codec for request input when the whole input in the (case of an image) is being sent in one go as in the first input. Setting content_type=np in the inference request will trigger that. check: Add request codec for numpy and strings which returns the first element MLServer#286 .

There is an example to showcase the pytorch model serving as a notebook.

TODO in this PR still:

Make sure that the other notebooks are still functional.

ukclivecox · 2021-09-03T07:48:50Z

tempo/kfserving/protocol.py

@@ -6,6 +6,8 @@
 from tempo.serve.metadata import ModelDataArgs, ModelDetails
 from tempo.serve.protocol import Protocol

+_REQUEST_NUMPY_CONTENT_TYPE = {"content_type": "np"}


Is this a MLServer specific parameter. The KFServing V2 could be for any compliant V2 server like Triton. Would need to check if this will be ignored by them?

yes I think this is specific to MLServer. Doing a quick search on "content_type" in the triton repo doesnt result in anything that suggests that it is being used. We need to check then that it will be ignored by triton as you suggested.

this bit is a little bit convoluted I admit, so we might want to think how to do it properly.

The parameters field of the v2 payload should accept any arbitrary field. So unless Triton already uses this field for something else, I think that it should be alright.

ukclivecox · 2021-09-03T07:50:14Z

tempo/serve/constants.py

@@ -8,7 +10,13 @@
 DefaultRemoteFilename = "remote.pickle"
 DefaultEnvFilename = "environment.tar.gz"

-MLServerEnvDeps = ["mlserver==0.4.0"]
+MLServerEnvDeps = ["mlserver==0.4.1.dev1"]


What are the changes in MLServer to require the dev1 release?

SeldonIO/MLServer#273 and SeldonIO/MLServer#286

These are specific changes required for being able to serve pytorch models (via mlflow) and it needed numpy fixes.

Now that the parallel inference fix for the outlier example is also in, should we build a 0.4.2 release and use that instead?

ukclivecox · 2021-09-03T07:51:09Z

tempo/serve/loader/env.py



-def _get_env(conda_env_file_path: str = None, env_name: str = None) -> dict:
+def _get_env(conda_env_file_path: str = None, env_name: str = None, platform: ModelFramework = None) -> dict:


Can you explain the changes in this file?

if we pack the conda environment, we need to add mlserver dependencies if not defined in conda.yaml

In the case of mlflow this is the case as the model conda.yaml generated will not have anything related to mlserver as not required for training.

There is logic to add mlserver package in _add_required_deps however we also need the corresponding runtime to be packaged (e.g. mlserver-mlflow in this case) and it is dependant on the type of model used. A reference to the platform is able to let us do that.

There is logic that

ukclivecox · 2021-09-03T07:53:08Z

Should we add an associated example with MLFlow?

review-notebook-app · 2021-09-03T16:16:49Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

sakoush added 10 commits September 2, 2021 10:00

add mlflow as runtime for mlserver

6853752

add pytorch mlflow model artifact

86ed093

add mlserver deps if not present in conda.yaml

ee3113b

reshuffle logic to save individual model is save env is true

4db415f

raise exception in the model cannot be load with tempo runtime

893dff2

tidy up code

cf716f6

use latest mlserver

bd28e20

pass np codec

95438b1

add a test

b81d5fc

add more tests

295a8c7

sakoush requested review from adriangonz and ukclivecox September 2, 2021 16:41

sakoush changed the title ~~Various fixes to enable temp to pack environment for single models~~ Various fixes to enable tempo to pack environment for single models Sep 2, 2021

ukclivecox reviewed Sep 3, 2021

View reviewed changes

sakoush added 3 commits September 3, 2021 11:35

add model saving

4034faf

use dev mlserver docker image

3f64a2f

initial mlfow example

35c3704

sakoush added 9 commits September 3, 2021 21:27

fix notebook

75471c0

tidy up code

2a1272a

add tests

0a7a1a4

tidy up notebook and create README.md

daff3a8

link to README in the main doc

20ca075

fixes to notebook and readme

0abeada

markdown fixes

15d9be1

reshuffle cells in notebook

a5d4394

notebooks fixes

fcd62fa

sakoush added 9 commits September 7, 2021 11:42

use mlserver dev in examples

2e3722f

refresh README.md and use production namespace

4cda2bc

fix broken link in doc

b961b54

upgrade seldon-core version in ansible

83a1785

add a note about python version

31795e7

update doc

712570d

reorder cell

6d2b2cb

clear output of notebook

ea40422

Merge branch 'master' into SherifAkoush/72/mlflow_example

32c4823

ukclivecox merged commit 0a0be3b into SeldonIO:master Sep 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Various fixes to enable tempo to pack environment for single models #189

Various fixes to enable tempo to pack environment for single models #189

sakoush commented Sep 2, 2021 •

edited

Loading

ukclivecox Sep 3, 2021

sakoush Sep 3, 2021

adriangonz Sep 8, 2021

ukclivecox Sep 3, 2021

sakoush Sep 3, 2021

adriangonz Sep 8, 2021

ukclivecox Sep 3, 2021 •

edited

Loading

sakoush Sep 3, 2021

ukclivecox commented Sep 3, 2021

review-notebook-app bot commented Sep 3, 2021



		def _get_env(conda_env_file_path: str = None, env_name: str = None) -> dict:
		def _get_env(conda_env_file_path: str = None, env_name: str = None, platform: ModelFramework = None) -> dict:

Various fixes to enable tempo to pack environment for single models #189

Various fixes to enable tempo to pack environment for single models #189

Conversation

sakoush commented Sep 2, 2021 • edited Loading

ukclivecox Sep 3, 2021

Choose a reason for hiding this comment

sakoush Sep 3, 2021

Choose a reason for hiding this comment

adriangonz Sep 8, 2021

Choose a reason for hiding this comment

ukclivecox Sep 3, 2021

Choose a reason for hiding this comment

sakoush Sep 3, 2021

Choose a reason for hiding this comment

adriangonz Sep 8, 2021

Choose a reason for hiding this comment

ukclivecox Sep 3, 2021 • edited Loading

Choose a reason for hiding this comment

sakoush Sep 3, 2021

Choose a reason for hiding this comment

ukclivecox commented Sep 3, 2021

review-notebook-app bot commented Sep 3, 2021

sakoush commented Sep 2, 2021 •

edited

Loading

ukclivecox Sep 3, 2021 •

edited

Loading