Add warning in MLflow pytorch docs to include signature #5347

mehtayogita · 2022-02-04T18:52:53Z

What changes are proposed in this pull request?

Updates MLFlow pytorch documentation to add warning suggesting to add signature while logging model to avoid float precision errors.

How is this patch tested?

Building docs locally and verifying the change.

Does this PR change the documentation?

No. You can skip the rest of this section.
Yes. Make sure the changed pages / sections render correctly by following the steps below.

Check the status of the ci/circleci: build_doc check. If it's successful, proceed to the
next step, otherwise fix it.
Click Details on the right to open the job page of CircleCI.
Click the Artifacts tab.
Click docs/build/html/index.html.
Find the changed pages / sections and make sure they render correctly.

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

Updates the MLFlow pytorch documentation.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Add one line in the model signature introduction section and add link to detailed section in the introduction. Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

…g sphinx build locally. Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

…e while logging model to avoid float precision errors. Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

mehtayogita · 2022-02-04T20:23:13Z

autoformat

Signed-off-by: mlflow-automation <mlflow-automation@users.noreply.github.com>

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

mehtayogita · 2022-02-04T20:47:12Z

autoformat

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

mehtayogita · 2022-02-04T21:22:30Z

autoformat

Signed-off-by: mlflow-automation <mlflow-automation@users.noreply.github.com>

mlflow/pytorch/__init__.py

ankit-db

Good start - a few notes

mlflow/pytorch/__init__.py

ankit-db · 2022-02-05T00:44:34Z

mlflow/pytorch/__init__.py

+        .. warning::
+
+            Log the model with signature to avoid inference errors. Pytorch float precision default
+            is float32, while numpy float precision default is float64. Adding the signature will


I would maybe re-frame this a bit and say:

For models without signatures, the MLflow Model Server relies on the default inferred data type from NumPy. However, PyTorch often expects different defaults, particularly when parsing floats.

Updated as suggested.

mlflow/pytorch/__init__.py

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

ankit-db

One minor nit, but looks great otherwise!

mlflow/pytorch/__init__.py

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

andreakress · 2022-02-07T19:40:23Z

mlflow/pytorch/__init__.py

+            For models without signatures, the MLflow Model Server relies on the default inferred
+            data type from NumPy. However, PyTorch often expects different defaults, particularly
+            when parsing floats. Include the signature to ensure that the model is logged with the
+            correct data type so that the MLflow model server can correctly provide valid input


Suggested change

correct data type so that the MLflow model server can correctly provide valid input

correct data type so that the MLflow model server correctly provides valid input

Just to clarify @andreakress - one thing I think would be good to emphasize is that by logging a signature, the user is making it possible for the model server to provide valid input. Correctly inferring the correct data types without the signature is an impossible problem. Maybe just me, but, in your suggested phrasing, it kind of feels like we're saying that there's a bug where it won't provide it correctly.

WDYT of something like this:

If the model is logged without a signature, the MLflow Model Server relies on the default inferred data type from NumPy. However, PyTorch often expects different defaults, particularly when parsing floats. You must include the signature to ensure that the model is logged with the correct data type so that the MLflow model server can correctly provide valid input.

Sounds good!

Thanks for helping iterate on this one!

andreakress

Approved with one suggestion.

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

mehtayogita added 4 commits February 2, 2022 15:29

Make it easy to find how to log model signature.

a987f8f

Add one line in the model signature introduction section and add link to detailed section in the introduction. Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Fix the link to how to log models with signatures section.

a888e38

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Fix the link to how to log models with signatures. Verified by runnin…

17ad2a0

…g sphinx build locally. Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Add warning in MLFlow Pytorch log model documentation to add signatur…

6e44a25

…e while logging model to avoid float precision errors. Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

github-actions bot added area/docs Documentation issues rn/none List under Small Changes in Changelogs. labels Feb 4, 2022

mehtayogita and others added 3 commits February 4, 2022 11:19

Merge branch 'mehtayogita-ML-19507'

f40c216

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Bring the two branches in sync.

041a242

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Merge branch 'master' into ML-19507

bb4622c

mehtayogita added the autoformat label Feb 4, 2022

mehtayogita added 2 commits February 4, 2022 12:20

Add 'the' before models in documentation.

cff3d53

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Merge branch 'master' into ML-19507

71c7f25

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

mlflow-automation and others added 3 commits February 4, 2022 20:24

Autoformat: https://github.com/mlflow/mlflow/actions/runs/1796974651

f3fbd78

Signed-off-by: mlflow-automation <mlflow-automation@users.noreply.github.com>

Format the file.

e5746fe

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Merge branch 'ML-19507' of github.com:mehtayogita/mlflow into ML-19507

d2e9bf2

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Format the file to pass lint.

01592dc

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Autoformat: https://github.com/mlflow/mlflow/actions/runs/1797191551

affc93e

Signed-off-by: mlflow-automation <mlflow-automation@users.noreply.github.com>

mehtayogita requested review from ankit-db and andreakress February 4, 2022 21:29

andreakress reviewed Feb 5, 2022

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

andreakress reviewed Feb 5, 2022

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

andreakress reviewed Feb 5, 2022

View reviewed changes

mlflow/pytorch/__init__.py Show resolved Hide resolved

andreakress reviewed Feb 5, 2022

View reviewed changes

mlflow/pytorch/__init__.py Show resolved Hide resolved

ankit-db reviewed Feb 5, 2022

View reviewed changes

mehtayogita added 3 commits February 7, 2022 09:25

Address review comments.

3ca7772

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Fix lint errors.

9efd37e

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

Merge branch 'ML-19507' of github.com:mehtayogita/mlflow into ML-19507

2097b37

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

ankit-db approved these changes Feb 7, 2022

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

Address review comment.

6c277cf

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

mehtayogita requested a review from andreakress February 7, 2022 19:21

andreakress reviewed Feb 7, 2022

View reviewed changes

andreakress approved these changes Feb 7, 2022

View reviewed changes

Address review comments.

af0d344

Signed-off-by: Yogita Mehta <yogita.mehta@databricks.com>

mehtayogita merged commit 1389683 into mlflow:master Feb 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add warning in MLflow pytorch docs to include signature #5347

Add warning in MLflow pytorch docs to include signature #5347

mehtayogita commented Feb 4, 2022

mehtayogita commented Feb 4, 2022

mehtayogita commented Feb 4, 2022

mehtayogita commented Feb 4, 2022

ankit-db left a comment

ankit-db Feb 5, 2022

mehtayogita Feb 7, 2022

ankit-db left a comment

andreakress Feb 7, 2022

ankit-db Feb 7, 2022

andreakress Feb 7, 2022

ankit-db Feb 7, 2022

ankit-db Feb 7, 2022

andreakress left a comment

	correct data type so that the MLflow model server can correctly provide valid input
	correct data type so that the MLflow model server correctly provides valid input

Add warning in MLflow pytorch docs to include signature #5347

Add warning in MLflow pytorch docs to include signature #5347

Conversation

mehtayogita commented Feb 4, 2022

What changes are proposed in this pull request?

How is this patch tested?

Does this PR change the documentation?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

mehtayogita commented Feb 4, 2022

mehtayogita commented Feb 4, 2022

mehtayogita commented Feb 4, 2022

ankit-db left a comment

Choose a reason for hiding this comment

ankit-db Feb 5, 2022

Choose a reason for hiding this comment

mehtayogita Feb 7, 2022

Choose a reason for hiding this comment

ankit-db left a comment

Choose a reason for hiding this comment

andreakress Feb 7, 2022

Choose a reason for hiding this comment

ankit-db Feb 7, 2022

Choose a reason for hiding this comment

andreakress Feb 7, 2022

Choose a reason for hiding this comment

ankit-db Feb 7, 2022

Choose a reason for hiding this comment

ankit-db Feb 7, 2022

Choose a reason for hiding this comment

andreakress left a comment

Choose a reason for hiding this comment