Export transform and model for torchscript, quantization and deployment #36

hudeven · 2022-10-12T21:58:14Z

It's troublesome to deal with transform and model separately during torchscript, quantization and deploy with torchserve. So I added CombinedModule wrapper to hold them such that it can consume raw input(text) and produce the prediction(generated text) end to end.

dracifer · 2022-10-13T02:06:58Z

torchrecipes/paved_path/charnn/char_transform.py

+from torch import Tensor
+
+
+class CharTransform(nn.Module):


this builds vocabulary of the input data with loading all into memory?

Yes, the dataset is only 1M for this example. We will use iterable datasets/datapipes for the vision tutorial.

torchrecipes/paved_path/README.md

torchrecipes/paved_path/charnn/combined_module.py

dracifer · 2022-10-13T07:20:19Z

torchrecipes/paved_path/charnn/export.py

+
+    fs, intput_path = fsspec.core.url_to_fs(args.input_path)
+    with fs.open(intput_path, "rb") as f:
+        model = torch.load(f, map_location="cpu")


shall we use snapshot to save/load trained model as well?

I tried but feel it causes more trouble than benefits. Snapshot save/load state_dicts from training states including model, optimizer, progress, etc. 1) we only need model here. It's a waste to pass the larger snapshot file in S3 2) to load the state_dict, we have to initialize the GPT model object here with the same config as training, which make this script less generic.

For "Snapshot save/load state_dicts from training states including model, optimizer, progress, etc.", we should be able to create a separate snapshot that only save model params? but anyway, it sounds reasonable to avoid the overhead. maybe lets check with @yifuwang about the appropriate use case of snapshot

dracifer · 2022-10-13T07:23:59Z

torchrecipes/paved_path/charnn/model.py

-        loss = None
-        if targets is not None:
-            loss = F.cross_entropy(logits.view(-1, logits.size(-1)), targets.view(-1))


make sense to move loss out of the main module!

This is also required to make model torchscriptable.

facebook-github-bot · 2022-10-13T17:38:46Z

@hudeven has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

hudeven added 9 commits October 10, 2022 17:43

support user specified snapshot path

9179f6a

export model at the end of training

b5d9143

add quantization script

4a7baa2

add torchscript option and rename to export.py

ada0347

replace custom util with url_to_fs()

d391ede

add CharTransform

315fd86

improve the boolean options --quantize and --torchscript

8705d37

make GPT model torchscriptable

c0a2981

combine transform and moodel into CombinedModule for easy deployment

25bf3e6

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 12, 2022

hudeven requested a review from dracifer October 12, 2022 21:58

dracifer reviewed Oct 13, 2022

View reviewed changes

dracifer approved these changes Oct 13, 2022

View reviewed changes

facebook-github-bot closed this in ff447c9 Oct 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Export transform and model for torchscript, quantization and deployment #36

Export transform and model for torchscript, quantization and deployment #36

hudeven commented Oct 12, 2022

dracifer Oct 13, 2022

hudeven Oct 13, 2022

dracifer Oct 13, 2022

hudeven Oct 13, 2022

dracifer Oct 13, 2022

dracifer Oct 13, 2022

hudeven Oct 13, 2022

facebook-github-bot commented Oct 13, 2022

Export transform and model for torchscript, quantization and deployment #36

Export transform and model for torchscript, quantization and deployment #36

Conversation

hudeven commented Oct 12, 2022

dracifer Oct 13, 2022

Choose a reason for hiding this comment

hudeven Oct 13, 2022

Choose a reason for hiding this comment

dracifer Oct 13, 2022

Choose a reason for hiding this comment

hudeven Oct 13, 2022

Choose a reason for hiding this comment

dracifer Oct 13, 2022

Choose a reason for hiding this comment

dracifer Oct 13, 2022

Choose a reason for hiding this comment

hudeven Oct 13, 2022

Choose a reason for hiding this comment

facebook-github-bot commented Oct 13, 2022