Adding mps support to base handler and regression test #3048

udaij12 · 2024-03-27T19:26:07Z

Description

Adding MPS as device type for Mac M1. #3022

When Model-config.yaml deviceType: "gpu" or deviceType is not specified then default to GPU:
MPS enabled

When Model-config.yaml deviceType: "cpu":
MPS not enabled

Test cases

Added 4 test cases.
Frontend:

test that number of gpu is greater then 0 when on Mac M1

Backend:

Set deviceType to GPU and test that device == "mps"
Set deviceType to CPU and test that device != "mps"
Dont set deviceType (remains at default) and test that device == "mps"

agunapal · 2024-03-27T20:58:43Z

ts/torch_handler/base_handler.py

@@ -149,6 +149,8 @@ def initialize(self, context):
            self.device = torch.device(
                self.map_location + ":" + str(properties.get("gpu_id"))
            )
+        elif hasattr(self, "model_yaml_config") and "mps" in self.model_yaml_config and self.model_yaml_config["mps"] == "enable":


Can we add mps here

serve/model-archiver/README.md

Line 174 in 1a99de4

deviceType: cpu # cpu, gpu, neuron

cc @lxning

In terms of design, having one config for all deviceType probably makes sense?

lxning · 2024-03-27T21:03:10Z

CX still can specify deviceType (CPU or GPU) on Mac.
The PR should be able to support getNumber of GPUs on Mac.
The GPU device id is decided by frontend, which is as same as Linux.
Backend handler should be able to auto-detect Mac and apply MPS+deviceId. Essentially, there are no new config parameters in the model-config.yaml.

udaij12 · 2024-03-29T16:40:43Z

MPS is auto detected in the base handler and getAvailableGpu() now supports Mac M1 gpu cores.

agunapal

Please add the User experience for both the cases in the PR/ add this in a README and add a pytest for MPS on/off for a model

agunapal · 2024-04-01T22:07:46Z

cc: @msaroufim

frontend/server/src/main/java/org/pytorch/serve/util/ConfigManager.java

ts/torch_handler/base_handler.py

lxning

Can you add test cases?

define deviceType in model-config.yaml on Mac
cpu
gpu

deviceId

define deviceType in model-config.yaml on gpu host

test/pytest/test_device_config.py

test/resources/model-config.yaml

lxning · 2024-04-08T16:19:50Z

@udaij12 in general, LGTM. I just left a minor comments #3048 (comment). Thanks.

agunapal · 2024-04-08T18:05:23Z

@udaij12 Can you please add a document called apple_silicon_support.md in docs folder where you mention these details. also please include what is currently working

agunapal

LGTM

stale

udaij12 added 2 commits March 27, 2024 12:24

adding mps support to base handler and regression test

5b9adf7

fixed method

c3f060a

agunapal reviewed Mar 27, 2024

View reviewed changes

mps support

31c093c

udaij12 added 4 commits March 29, 2024 09:47

fix format

a102288

changes to detection

1eff31a

testing x86

827fa6d

adding m1 check

1d9975e

agunapal requested changes Apr 1, 2024

View reviewed changes

udaij12 marked this pull request as ready for review April 2, 2024 17:14

Merge branch 'master' into mps_m1

c357e0b

msaroufim previously requested changes Apr 3, 2024

View reviewed changes

lxning reviewed Apr 3, 2024

View reviewed changes

udaij12 added 6 commits April 5, 2024 10:03

adding test cases

29b388e

Merge branch 'mps_m1' of https://github.com/pytorch/serve into mps_m1

7c3e876

adding test workflow

5d45c22

modifiying tests

09fb201

removing python tests

1096ab7

remove workflow

5d2879b

agunapal requested changes Apr 5, 2024

View reviewed changes

test/pytest/test_device_config.py Show resolved Hide resolved

lxning reviewed Apr 8, 2024

View reviewed changes

test/resources/model-config.yaml Outdated Show resolved Hide resolved

removing test config file

5d7f39d

lxning approved these changes Apr 8, 2024

View reviewed changes

Merge branch 'master' into mps_m1

31e7b00

adding docs

325688a

Merge branch 'mps_m1' of https://github.com/pytorch/serve into mps_m1

7cefd74

agunapal approved these changes Apr 8, 2024

View reviewed changes

udaij12 added 2 commits April 8, 2024 15:31

fixing spell check

5575f93

lint fix

1ead54a

msaroufim self-requested a review April 8, 2024 22:51

udaij12 enabled auto-merge April 8, 2024 23:39

udaij12 added this pull request to the merge queue Apr 9, 2024

Merged via the queue into master with commit 89c5389 Apr 9, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding mps support to base handler and regression test #3048

Adding mps support to base handler and regression test #3048

udaij12 commented Mar 27, 2024 •

edited

Loading

agunapal Mar 27, 2024

agunapal Mar 27, 2024

lxning commented Mar 27, 2024

udaij12 commented Mar 29, 2024

agunapal left a comment

agunapal commented Apr 1, 2024

lxning left a comment

lxning commented Apr 8, 2024

agunapal commented Apr 8, 2024

agunapal left a comment

Adding mps support to base handler and regression test #3048

Adding mps support to base handler and regression test #3048

Conversation

udaij12 commented Mar 27, 2024 • edited Loading

Description

Test cases

agunapal Mar 27, 2024

Choose a reason for hiding this comment

agunapal Mar 27, 2024

Choose a reason for hiding this comment

lxning commented Mar 27, 2024

udaij12 commented Mar 29, 2024

agunapal left a comment

Choose a reason for hiding this comment

agunapal commented Apr 1, 2024

lxning left a comment

Choose a reason for hiding this comment

lxning commented Apr 8, 2024

agunapal commented Apr 8, 2024

agunapal left a comment

Choose a reason for hiding this comment

udaij12 commented Mar 27, 2024 •

edited

Loading