Saving and Loading pytorch model as state dict #3705

shrinath-suresh · 2020-11-17T10:04:34Z

Signed-off-by: Shrinath Suresh shrinath@ideas2it.com

What changes are proposed in this pull request?

The current implementation of mlflow.pytorch only supports for saving the entire model into mlflow. Adding support for saving and loading the model using state dict.

Instead of storing the entire model into mlflow, when the model state dicts are saved, the size of the model is reduced to a greater extent - which would be helpful during the deployment of the model.

#3408 - Please read through the discussion points on the PR . It would be helpful for the future use cases as mentioned above.

Implementation Details:

Adding two new methods to mlflow.pytorch - load_state_dict and save_state_dict for loading and saving the pytorch models. And also added a key state_dict under pytorch:flavor. By default(for entire model) the key will be set to false . Only when the model is saved/logged as state dict, the key would be set to true.

Sample screenshot given below

How is this patch tested?

Tested by saving/loading the model as both state dict and entire version. Working on the Unit tests.

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

(Details in 1-2 sentences. You can just refer to another PR with a description if this PR is part of a larger change.)

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, JavaScript, plotting
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

…ibrary Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

mlflow/pytorch/__init__.py

…of the code in load_state_dict method Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

mlflow/pytorch/__init__.py

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

harupy · 2020-11-29T13:32:08Z

@shrinath-suresh Thanks for the updates! btw I have a question about whta you mentioned about saved model size.

Instead of storing the entire model into mlflow, when the model state dicts are saved, the size of the model is reduced to a greater extent - which would be helpful during the deployment of the model.

I wrote a simple script to verify this behavior.

import os
import torch
from torchvision import models
import shutil
import subprocess


SAVE_DIR = "foo"

if os.path.exists(SAVE_DIR):
    shutil.rmtree(SAVE_DIR)

os.makedirs(SAVE_DIR)


model = models.resnet50(pretrained=True)
torch.save(model, f"{SAVE_DIR}/model.pt")
torch.save(model.state_dict(), f"{SAVE_DIR}/state_dict.pt")

print(subprocess.check_output(["ls", "-lh", SAVE_DIR]).decode("utf-8"))

output

total 400624
-rw-r--r--  1 harutakakawamura  staff    98M Nov 29 21:56 model.pt
-rw-r--r--  1 harutakakawamura  staff    98M Nov 29 21:56 state_dict.pt
                                         ^^^
                                         almost equal

The difference between torch.save(model.state_dict(), ...) and torch.save(model, ...) is very small. Am I missing something?

harupy · 2020-11-29T15:43:20Z

WIP

(I'm writing this to consider the full design space in order to make the right decision on the API.)

How should we support state dicts?

Option 1:

log_state_dict logs a state dict in the specified artifact path (similar to log_artifact).

log_model(model, "model")  # -> saves model.pt
log_state_dict(model.state_dict(), "model")  # -> saves state_dict.pt (doesn't cerate or update an MLmodel file)

# --- output ---
# - model
#   - model.pt
#   - state_dict.pt
#   - MLmodel

Workflow to load the model in the TorchServe plugin:

path = _download_artifact_from_uri(model_uri)

if cotains_state_dict(path):
    serialized_file = os.path.join(path, mlflow.pytorch.STATE_DICT_FILENAME)
else:
    serialized_file = os.path.join(path, mlflow.pytorch.MODEL_FILENAME)

Pros:

log_state_dict doesn't need to create an MLmodel file.
Simpler implementation of log_state_dict

Cons:

Need to call both log_model and log_state_dict, which seems weird.
We need to handle two cases:
1. case where log_state_dict is called
2. case where log_state_dict is not called

Option 2:

Add a new flag argument save_state_dict (default: False) to mlflow.pytorch.log_model. If this value is set to True, log the state dict along with the pickled model.

Pros

???

Cons

Can't log only a state dict
FlagArgument

Option 3 (preferred):

log_state_dict(model.state_dict(), "model") logs a state dict and creates an MLmodel file with a new pytorch_state_dict flavor.

Workflow to load the model in the TorchServe plugin:

path = _download_artifact_from_uri(model_uri)
config = model = Model.load(os.path.join(path, "MLmodel"))

if "pytorch_state_dict" in config.flavors:
    serialized_file = os.path.join(path, mlflow.pytorch.STATE_DICT_FILENAME)
else:
    serialized_file = os.path.join(path, mlflow.pytorch.MODEL_FILENAME)

Pros:

Can only log a state dict.

Cons:

To allow serving the model, we need to log the model class and constructor parameters along with the state dict.
More maintenance burden for us (we need to maintain both log_model and log_state_dict)

Questions:

Should we allow serving? -> Ideally yes
Can we start without serving support? -> yes

APPENDIX

What do we need to recontruct a state dict model?

state dict
model class (= a python file that define the class)
constructor parameters (if the model class requires them)

How does TorchServe reconstruct a state dict model from `model_file`?

https://github.com/pytorch/serve/blob/eedddb1d19d4aef24a81838cb670d17a49eec6d0/ts/torch_handler/base_handler.py#L84
It just calls model = model_class() (note that no arguments are specified) and model.load_state_dict(state_dict)

Does the torchserve plugin require an MLmodel file?

yes

What is MLflow Model?

Each MLflow Model is a directory containing arbitrary files, together with an MLmodel file in the root of the directory that can define multiple flavors that the model can be viewed in.

What is flavor?

Flavors are the key concept that makes MLflow Models powerful: they are a convention that deployment tools can use to understand the model, which makes it possible to write tools that work with models from any ML library without having to integrate each tool with each library.

https://www.mlflow.org/docs/latest/models.html#storage-format

Should we create a new flavor for `log_state_dict` rather than using the exisiting `pytorch` flavor?

Yes to make it easier for downstream tools (e.g. the TorchServer plugin) to understand what they can do with the model.

@shrinath-suresh Just feel free to add comments

shrinath-suresh · 2020-11-30T07:52:25Z

@shrinath-suresh Thanks for the updates! btw I have a question about whta you mentioned about saved model size.

Instead of storing the entire model into mlflow, when the model state dicts are saved, the size of the model is reduced to a greater extent - which would be helpful during the deployment of the model.

I wrote a simple script to verify this behavior.
import os
import torch
from torchvision import models
import shutil
import subprocess


SAVE_DIR = "foo"

if os.path.exists(SAVE_DIR):
    shutil.rmtree(SAVE_DIR)

os.makedirs(SAVE_DIR)


model = models.resnet50(pretrained=True)
torch.save(model, f"{SAVE_DIR}/model.pt")
torch.save(model.state_dict(), f"{SAVE_DIR}/state_dict.pt")

print(subprocess.check_output(["ls", "-lh", SAVE_DIR]).decode("utf-8"))
output
total 400624
-rw-r--r--  1 harutakakawamura  staff    98M Nov 29 21:56 model.pt
-rw-r--r--  1 harutakakawamura  staff    98M Nov 29 21:56 state_dict.pt
                                         ^^^
                                         almost equal
The difference between torch.save(model.state_dict(), ...) and torch.save(model, ...) is very small. Am I missing something?

My observation is from MNIST example. I ran 10 epochs and here is the result of full model and state dict

-rw-rw-r--  1 ubuntu ubuntu 100M Nov 30 12:37 full_model.pth
-rw-rw-r--  1 ubuntu ubuntu 534K Nov 30 12:37 state_dict.pth

harupy · 2020-11-30T09:24:06Z

@shrinath-suresh This is probably because mlflow.pytorch.autolog saves trainer.model (which is a pl.LightningModule object). pl.LightningModule has many attributes that torch.nn.Module doesn't have. These attributes increase the saved model size.

shrinath-suresh · 2020-12-01T10:20:33Z

@shrinath-suresh This is probably because mlflow.pytorch.autolog saves trainer.model (which is a pl.LightningModule object). pl.LightningModule has many attributes that torch.nn.Module doesn't have. These attributes increase the saved model size.

You are right. Same mnist example with pytorch shows same size for both state dict and entire model. We can take this discussion in a separate thread, as this PR has no dependency with mlflow.pytorch.autolog.

@harupy Do you have any more comments on the code ?

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

mlflow/pytorch/__init__.py

…s to load the state dict Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

mlflow/pytorch/__init__.py

harupy · 2020-12-16T16:22:26Z

mlflow/pytorch/__init__.py

+    with open(pickle_module_path, "w") as f:
+        f.write(pickle_module.__name__)
+
+    model_path = os.path.join(model_data_path, _SERIALIZED_TORCH_MODEL_FILE_NAME)


A uesr might log a state_dict that represents a checkpoint for inference and/or resuming training (this use case). In this case _SERIALIZED_TORCH_MODEL_FILE_NAME (= "model.pth") doesn't seem to be the right name because it's not a model.

Maybe state_dict.pth is better?

Pro: easier to tell it's a state dict.
Con: harder to tell what the state dict represents.

renamed it to state_dict.pth

mlflow/pytorch/__init__.py

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy · 2021-01-10T15:13:16Z

@shrinath-suresh I have pushed some commits to clean up the code :)

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

shrinath-suresh · 2021-01-12T02:38:21Z

@harupy Thank you very much. The changes LGTM. Is there any other comment you have on this PR ? if not can we merge the PR?

harupy

@shrinath-suresh LGTM! Thanks for all the hard work 👍

mlflow/pytorch/__init__.py

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

* Adding save_state_dict and load_state_dict method to mlflow.pytorch library Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing unwanted changes Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Resetting empty lines Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Adding Unit tests for save_state_dict and load_state_dict model Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Adding log_state_dict method and refactored load_model to reuse most of the code in load_state_dict method Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing unused argument Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Applying black Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * save_state_dict, log_state_dict and load_state_dict with pytorch flavor Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing MLModel file for state dict and adding appropriate conditions to load the state dict Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Updating doc strings Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Setting experimental annotation and saving state dict as state_dict.pth Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Fixing doc strings Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing state_dict key from save_model Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Addressing review comments Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Applying black Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing doc string Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * swapping arguments Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Using get_artifact_uri to derive model path Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing pickle_module from save and log state dict Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * rephrasing doc strings Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Renaming tests Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Comparing state dicts in test Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Disabling reimport error Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing blank line between params in doc string Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing model Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Replacing _get_model_artifact_path with _download_artifact_from_uri Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * creating get_sequential_model utility and renamving model_class to model Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing pd.DataFrame type conversion Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Adding compare state dicts utility Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing Ordered Dictionary from doc string Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Fixing Docstring Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing unused variable Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing unused import Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Addressing review comments Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing unrelated change Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Addressing review comments Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Removing data folder Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * Addressing review comments Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com> * revert changes on load_model Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * remove redundant folder generation Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * Set exist_ok to True Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * Assert state_dict is dict Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * wording fix Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * kwargs Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * remove redundant model.eval Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * fix Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * Prevent false positive Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * test for nested_state_dict Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * blank line Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * move tests Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * put state dict functions in one place Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * remove unused variable Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * comment on test_save_state_dict_can_save_nested_state_dict Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * Fix Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * ensure model and optim can load state dict Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * enhance comment Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * comment Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * dot Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * remove useless comma Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * use pos args Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * rename Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * nit Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * article Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * example Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * Add checkpoint example Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * remove ... Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * warning Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Co-authored-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Adding save_state_dict and load_state_dict method to mlflow.pytorch l…

af84333

…ibrary Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

github-actions bot added area/models MLmodel format, model serialization/deserialization, flavors rn/feature Mention under Features in Changelogs. labels Nov 17, 2020

shrinath-suresh added 3 commits November 17, 2020 15:36

Removing unwanted changes

ea2c961

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

Resetting empty lines

5842c59

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

Adding Unit tests for save_state_dict and load_state_dict model

749c99f

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

harupy reviewed Nov 18, 2020

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

shrinath-suresh mentioned this pull request Nov 19, 2020

MNIST - Save/Load model as state dict mlflow/mlflow-torchserve#47

Merged

11 tasks

harupy reviewed Nov 25, 2020

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

harupy reviewed Nov 25, 2020

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

Adding log_state_dict method and refactored load_model to reuse most …

d1d389c

…of the code in load_state_dict method Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

shrinath-suresh commented Nov 26, 2020

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

shrinath-suresh commented Nov 26, 2020

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

shrinath-suresh commented Nov 26, 2020

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

shrinath-suresh added 2 commits November 26, 2020 16:12

Removing unused argument

4a61a3e

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

Applying black

0f1a130

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

shrinath-suresh added 2 commits December 11, 2020 14:13

save_state_dict, log_state_dict and load_state_dict with pytorch flavor

65c090b

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

Pulling latest changes from upstream master and resolving conflicts

db42830

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

shrinath-suresh mentioned this pull request Dec 11, 2020

[WIP] Pytorch - load_state_dict/save_state_dict/log_state_dict implementation chauhang/mlflow#39

Open

27 tasks

harupy reviewed Dec 14, 2020

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

shrinath-suresh added 2 commits December 16, 2020 15:50

Removing MLModel file for state dict and adding appropriate condition…

b89b1a7

…s to load the state dict Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

Updating doc strings

b7f0bb9

Signed-off-by: Shrinath Suresh <shrinath@ideas2it.com>

harupy reviewed Dec 16, 2020

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

harupy reviewed Dec 16, 2020

View reviewed changes

mlflow/pytorch/__init__.py Outdated Show resolved Hide resolved

harupy added 12 commits January 10, 2021 00:28

remove redundant model.eval

b5695c3

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

fix

dfe4328

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Prevent false positive

f2aa709

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

test for nested_state_dict

0dc8f33

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

blank line

1b4fa3e

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

move tests

34106b1

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

put state dict functions in one place

07f742f

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

remove unused variable

f584815

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

comment on test_save_state_dict_can_save_nested_state_dict

44557f5

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Fix

ef9f227

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

ensure model and optim can load state dict

8f0bc98

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

enhance comment

ef3a9ee

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy added 10 commits January 11, 2021 01:38

comment

2dd69e5

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

dot

0c58321

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

remove useless comma

ce205c2

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

use pos args

ad70744

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

rename

cf20475

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

nit

d026c0b

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

article

5228a40

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

example

5e56e8f

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Add checkpoint example

912f3e2

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

remove ...

df84282

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy approved these changes Jan 12, 2021

View reviewed changes

harupy reviewed Jan 12, 2021

View reviewed changes

mlflow/pytorch/__init__.py Show resolved Hide resolved

warning

f016620

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy merged commit fcf8b90 into mlflow:master Jan 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Saving and Loading pytorch model as state dict #3705

Saving and Loading pytorch model as state dict #3705

shrinath-suresh commented Nov 17, 2020

harupy commented Nov 29, 2020 •

edited

Loading

harupy commented Nov 29, 2020 •

edited

Loading

shrinath-suresh commented Nov 30, 2020

harupy commented Nov 30, 2020 •

edited

Loading

shrinath-suresh commented Dec 1, 2020 •

edited by harupy

Loading

harupy Dec 16, 2020

harupy Dec 16, 2020 •

edited

Loading

shrinath-suresh Dec 17, 2020

harupy commented Jan 10, 2021

shrinath-suresh commented Jan 12, 2021

harupy left a comment

Saving and Loading pytorch model as state dict #3705

Saving and Loading pytorch model as state dict #3705

Conversation

shrinath-suresh commented Nov 17, 2020

What changes are proposed in this pull request?

How is this patch tested?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

harupy commented Nov 29, 2020 • edited Loading

harupy commented Nov 29, 2020 • edited Loading

WIP

How should we support state dicts?

Option 1:

Option 2:

Option 3 (preferred):

APPENDIX

What do we need to recontruct a state dict model?

How does TorchServe reconstruct a state dict model from model_file?

Does the torchserve plugin require an MLmodel file?

What is MLflow Model?

What is flavor?

Should we create a new flavor for log_state_dict rather than using the exisiting pytorch flavor?

shrinath-suresh commented Nov 30, 2020

harupy commented Nov 30, 2020 • edited Loading

shrinath-suresh commented Dec 1, 2020 • edited by harupy Loading

harupy Dec 16, 2020

Choose a reason for hiding this comment

harupy Dec 16, 2020 • edited Loading

Choose a reason for hiding this comment

shrinath-suresh Dec 17, 2020

Choose a reason for hiding this comment

harupy commented Jan 10, 2021

shrinath-suresh commented Jan 12, 2021

harupy left a comment

Choose a reason for hiding this comment

harupy commented Nov 29, 2020 •

edited

Loading

harupy commented Nov 29, 2020 •

edited

Loading

How does TorchServe reconstruct a state dict model from `model_file`?

Should we create a new flavor for `log_state_dict` rather than using the exisiting `pytorch` flavor?

harupy commented Nov 30, 2020 •

edited

Loading

shrinath-suresh commented Dec 1, 2020 •

edited by harupy

Loading

harupy Dec 16, 2020 •

edited

Loading