Add an example with torchdata and torchserve #1940

PratsBhatt · 2022-11-02T15:41:49Z

Description

The pull request provides a simple example of using torchdata with torchserve.
It uses MNIST as the dataset and task to be solved.
The current example builds on top of the already provided example of MNIST.

Please read our CONTRIBUTING.md prior to creating your first pull request.

Please include a summary of the feature or issue being fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Fixes #(issue)

Type of change

The current pull request adds an example w.r.t torchdata and torchserve for MNIST model.
It adds an inference.py script that takes care of loading the MNIST dataset and do REST calls to torchserver. It adds a new mnist_handler.py script which adds a preprocessing step to convert the payload of the REST request to tensor as well as to output a class number once inference request is finished.

The output of the inference.py looks as the following.

2022-11-03 02:10:38.996234 - Model prediction Class 1 True Class: tensor([1], dtype=torch.uint8)
2022-11-03 02:10:38.996327 - Model prediction Class 8 True Class: tensor([3], dtype=torch.uint8)
2022-11-03 02:10:38.996433 - Model prediction Class 4 True Class: tensor([4], dtype=torch.uint8)
2022-11-03 02:10:38.996578 - Model prediction Class 7 True Class: tensor([7], dtype=torch.uint8)
2022-11-03 02:10:38.996699 - Model prediction Class 2 True Class: tensor([2], dtype=torch.uint8)
2022-11-03 02:10:38.996821 - Model prediction Class 3 True Class: tensor([3], dtype=torch.uint8)
2022-11-03 02:10:38.996943 - Model prediction Class 7 True Class: tensor([7], dtype=torch.uint8)
2022-11-03 02:10:38.997034 - Model prediction Class 4 True Class: tensor([4], dtype=torch.uint8)
2022-11-03 02:10:38.997124 - Model prediction Class 5 True Class: tensor([5], dtype=torch.uint8)
2022-11-03 02:10:38.997222 - Model prediction Class 9 True Class: tensor([9], dtype=torch.uint8)
2022-11-03 02:10:38.997305 - Model prediction Class 0 True Class: tensor([0], dtype=torch.uint8)

Checklist:

Did you have fun?
Have you added tests that prove your fix is effective or that this feature works?
Has code been commented, particularly in hard-to-understand areas?
Have you made corresponding changes to the documentation?

msaroufim · 2022-11-02T18:54:16Z

examples/image_classifier/mnist/torchdata/inference.py

+testset = datasets.MNIST('./MNIST_dataset', download=True, train=False, transform=image_transform)
+
+# Creating the dataloader.
+inference_dataset = torch.utils.data.DataLoader(testset, batch_size=BATCH_SIZE, shuffle=True)


Overall I think the example looks good, it's integrating a torchvision dataset but it's not quite clearly a torchdata integration

Specifically I was hoping we could create some toy torchdata dataset directly without leveraging torchvision. I believe this change would be minor to your code but if it isn't I'm happy to merge this if you have bandwidth to work on the more vanilla torchdata integration

I agree. @NivekT Wondering do you have existing pipeline that they can take a reference. for vision benchmarking?

For DataPipe reference:

Here is the torchvision implementation of loading MNIST - it might be too complicated. One option is to import and directly use that here (similar to how datasets.MNIST is used)

A standalone, common example is something like this:

dp = FileLister(str(root), masks=[f"archive_{args.archive_size}*.tar"]) dp = dp.shuffle(buffer_size=10000) dp = FileOpener(dp, mode="b") dp = TarArchiveLoader(dp, mode="r:") dp = dp.shuffle(buffer_size=archive_size) dp = dp.sharding_filter() dp = dp.map(pil_loader).map(pil_transformation) # dp = dp.map(tensor_loader).map(tensor_transformation) # Alternate - convert image to tensor then transform

Separately, I think we should use DataLoader2 instead of the old version in the example. @ejguan WDYT?

Thank you @msaroufim , @ejguan, @agunapal and @NivekT for your comments and guidance. I have incorporated the required changes. Looking forward to your feedback. Thank you once again.

Thanks for the quick turnaround @PratsBhatt . Looks good. I am approving it. Minor feedback: Please link the example here since its a few levels deep and might be missed by others. https://github.com/pytorch/serve/blob/master/examples/README.md

agunapal · 2022-11-02T18:57:47Z

@PratsBhatt Thanks for taking this up. Overall it looks good, but In this example, we would want to explicitly make use of TorchData features (Ex: DataPipes). You could take a look at this example in TorchData and see if you can modify your current example with with. https://github.com/pytorch/data/blob/main/examples/vision/imagefolder.py

msaroufim

Amazing thank you for the quick turnaround

codecov · 2022-11-03T01:47:52Z

Codecov Report

Merging #1940 (4398bc4) into master (f5d4022) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #1940   +/-   ##
=======================================
  Coverage   44.95%   44.95%           
=======================================
  Files          63       63           
  Lines        2609     2609           
  Branches       56       56           
=======================================
  Hits         1173     1173           
  Misses       1436     1436

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

agunapal

Looks good.
Minor feedback: Please link the example here since its a few levels deep and might be missed by others.
https://github.com/pytorch/serve/blob/master/examples/README.md

PratsBhatt · 2022-11-03T08:39:57Z

Thank you @agunapal and @msaroufim , I have implemented the code changes. Looking forward to merging the PR.

* Add an example with torchdata * Update comment. * Incorporate code review comments. * Remove unsed imports. * Apply code review comments.

PratsBhatt added 2 commits November 2, 2022 16:20

Add an example with torchdata

360f36d

Update comment.

a4fc68e

msaroufim added example bootcamp labels Nov 2, 2022

msaroufim requested review from mreso, msaroufim, agunapal and ejguan November 2, 2022 16:03

msaroufim reviewed Nov 2, 2022

View reviewed changes

PratsBhatt and others added 5 commits November 2, 2022 23:52

Merge branch 'master' into master

6ac467a

Merge branch 'master' into master

f8d7c55

Incorporate code review comments.

bf5f3fa

Merge branch 'master' of github.com:PratsBhatt/serve

0505723

Remove unsed imports.

4c266fb

msaroufim approved these changes Nov 3, 2022

View reviewed changes

agunapal approved these changes Nov 3, 2022

View reviewed changes

Apply code review comments.

4398bc4

msaroufim merged commit 33e1e97 into pytorch:master Nov 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an example with torchdata and torchserve #1940

Add an example with torchdata and torchserve #1940

PratsBhatt commented Nov 2, 2022 •

edited

msaroufim Nov 2, 2022

ejguan Nov 2, 2022

NivekT Nov 2, 2022 •

edited

PratsBhatt Nov 3, 2022 •

edited

agunapal Nov 3, 2022

agunapal commented Nov 2, 2022 •

edited

msaroufim left a comment

codecov bot commented Nov 3, 2022 •

edited

agunapal left a comment

PratsBhatt commented Nov 3, 2022

Add an example with torchdata and torchserve #1940

Add an example with torchdata and torchserve #1940

Conversation

PratsBhatt commented Nov 2, 2022 • edited

Description

Type of change

Checklist:

msaroufim Nov 2, 2022

Choose a reason for hiding this comment

ejguan Nov 2, 2022

Choose a reason for hiding this comment

NivekT Nov 2, 2022 • edited

Choose a reason for hiding this comment

PratsBhatt Nov 3, 2022 • edited

Choose a reason for hiding this comment

agunapal Nov 3, 2022

Choose a reason for hiding this comment

agunapal commented Nov 2, 2022 • edited

msaroufim left a comment

Choose a reason for hiding this comment

codecov bot commented Nov 3, 2022 • edited

Codecov Report

agunapal left a comment

Choose a reason for hiding this comment

PratsBhatt commented Nov 3, 2022

PratsBhatt commented Nov 2, 2022 •

edited

NivekT Nov 2, 2022 •

edited

PratsBhatt Nov 3, 2022 •

edited

agunapal commented Nov 2, 2022 •

edited

codecov bot commented Nov 3, 2022 •

edited