Iris Example with Model Signature #46

shrinath-suresh · 2021-03-24T19:18:55Z

Signed-off-by: Shrinath Suresh shrinath@ideas2it.com

What changes are proposed in this pull request?

Logging model signature and validating model signature for iris classification example.

Reused the model signature enforcements from pyfunc.

How is this patch tested?

Existing Unit Tests

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

(Details in 1-2 sentences. You can just refer to another PR with a description if this PR is part of a larger change.)

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, JavaScript, plotting
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

harupy · 2021-04-02T10:39:04Z

mlflow/pytorch/__init__.py

+        if self.mlmodel_file_path:
+            mlmodel = Model.load(self.mlmodel_file_path)
+            if not hasattr(mlmodel, "signature"):
+                raise Exception("Model Signature not found")
+
+            input_schema = mlmodel.get_input_schema()
+
+            from mlflow.pyfunc import _enforce_schema
+            _enforce_schema(data, input_schema)
+


@shrinath-suresh
Do we really need this change? Have you tried using mlflow.pyfunc.load_model("<model_uri>")?

@shrinath-suresh
Do we really need this change? Have you tried using mlflow.pyfunc.load_model("<model_uri>")?

As of now, mlflow.pytorch library only allows to log the signature. But, it doesnt have the mechanism to validate the signature.

Is it the case, that pytorch users can log the model signature using mlflow.pytorch and validate it using mlflow.pyfunc ? If that is the case, we don't need this change.

that pytorch users can log the model signature using mlflow.pytorch and validate it using mlflow.pyfunc

I think yes because that's the only way to enforce the schema now.

harupy · 2021-04-02T10:40:01Z

examples/pytorch/signature/iris_classification.py

+import pandas as pd
+
+
+class IrisClassification(pl.LightningModule):


I think we have a similar class in other pytorch examples. Can we reuse it?

harupy · 2021-04-02T10:56:39Z

examples/pytorch/signature/iris_classification.py

+    # Uncomment this block to check invalid data type enforcement
+    # for column in df.columns:
+    #     df[column] = df[column].astype("str")
+    #
+    # print("Result with invalid datatype: ", model.predict(df))


Can we uncomment this block and wrap it with try-catch and print out the error message in the except clause?

try: model.predict(df) except Exception as e: print(e)

harupy · 2021-04-02T11:03:56Z

examples/pytorch/signature/iris_classification.py

+        mlflow.pytorch.save_model(trainer.get_model(), "model", signature=signature)
+
+    model = _load_pyfunc(path="model/data", validate_signature=True)
+    df = pd.read_json("sample.json")


Do we need really sample.json? Can we just hard-code its content in pd.DataFrame?

df = pd.DataFrame({"sepal length (cm)": ...})

harupy · 2021-04-02T11:05:58Z

examples/pytorch/signature/iris_classification.py

+    input_schema = Schema(
+        [
+            ColSpec("double", "sepal length (cm)"),
+            ColSpec("double", "sepal width (cm)"),
+            ColSpec("double", "petal length (cm)"),
+            ColSpec("double", "petal width (cm)"),
+        ]
+    )
+    output_schema = Schema([ColSpec("long")])


Can we infer the schema from the iris dataframe?

Iris Example with Model Signature

dd7a745

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

shrinath-suresh requested a review from chauhang March 24, 2021 19:18

shrinath-suresh added 2 commits March 25, 2021 22:50

Adding MLProject and conda.yaml files

502be65

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

Adding README.md

0e37d8d

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

shrinath-suresh changed the title ~~[WIP] Iris Example with Model Signature~~ Iris Example with Model Signature Mar 25, 2021

harupy reviewed Apr 2, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Iris Example with Model Signature #46

Iris Example with Model Signature #46

shrinath-suresh commented Mar 24, 2021

harupy Apr 2, 2021 •

edited

Loading

shrinath-suresh Apr 5, 2021 •

edited

Loading

harupy Apr 6, 2021

harupy Apr 2, 2021 •

edited

Loading

harupy Apr 2, 2021 •

edited

Loading

harupy Apr 2, 2021 •

edited

Loading

harupy Apr 2, 2021 •

edited

Loading

		import pandas as pd


		class IrisClassification(pl.LightningModule):

Iris Example with Model Signature #46

Are you sure you want to change the base?

Iris Example with Model Signature #46

Conversation

shrinath-suresh commented Mar 24, 2021

What changes are proposed in this pull request?

How is this patch tested?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

harupy Apr 2, 2021 • edited Loading

Choose a reason for hiding this comment

shrinath-suresh Apr 5, 2021 • edited Loading

Choose a reason for hiding this comment

harupy Apr 6, 2021

Choose a reason for hiding this comment

harupy Apr 2, 2021 • edited Loading

Choose a reason for hiding this comment

harupy Apr 2, 2021 • edited Loading

Choose a reason for hiding this comment

harupy Apr 2, 2021 • edited Loading

Choose a reason for hiding this comment

harupy Apr 2, 2021 • edited Loading

Choose a reason for hiding this comment

harupy Apr 2, 2021 •

edited

Loading

shrinath-suresh Apr 5, 2021 •

edited

Loading

harupy Apr 2, 2021 •

edited

Loading

harupy Apr 2, 2021 •

edited

Loading

harupy Apr 2, 2021 •

edited

Loading

harupy Apr 2, 2021 •

edited

Loading