Add support for JSON encoding `torch.tensor` to keep it consistent with sagemaker-inference-toolkit #84

maaquib · 2023-05-08T20:33:57Z

Diff in encoding logic:

Sagemaker base toolkit: https://github.com/aws/sagemaker-inference-toolkit/blob/e602335fd9a4db08216d1f58ded2861cccb64f7d/src/sagemaker_inference/encoder.py#L25_L44
HF inference toolkit: https://github.com/aws/sagemaker-huggingface-inference-toolkit/blob/27275f40a2bbff85bb507646e6a3ef866d0599af/src/sagemaker_huggingface_inference_toolkit/decoder_encoder.py#L80_L114

This is similar to JSON encoding of numpy data.

Issue #, if available: N/A

Description of changes:
Inference script:

...
def predict_fn(input_data, model):
    return [torch.tensor(42)]
...

With HuggingFaceModel: mms.service.PredictionException: Object of type Tensor is not JSON serializable : 400
With PyTorchModel: Works fine

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…th sagemaker-inference-toolkit Sagemaker base toolkit: https://github.com/aws/sagemaker-inference-toolkit/blob/e602335fd9a4db08216d1f58ded2861cccb64f7d/src/sagemaker_inference/encoder.py#L25_L44 HF inference toolkit: https://github.com/aws/sagemaker-huggingface-inference-toolkit/blob/27275f40a2bbff85bb507646e6a3ef866d0599af/src/sagemaker_huggingface_inference_toolkit/decoder_encoder.py#L80_L114

philschmid

Thank you for adding!

src/sagemaker_huggingface_inference_toolkit/decoder_encoder.py

tests/unit/test_decoder_encoder.py

davidthomas426

LGTM

maaquib requested review from philschmid and davidthomas426 May 8, 2023 20:34

maaquib self-assigned this May 8, 2023

maaquib force-pushed the decoder branch from 93eed30 to de567ce Compare May 8, 2023 20:37

maaquib force-pushed the decoder branch from de567ce to 4edf510 Compare May 8, 2023 23:52

philschmid approved these changes May 9, 2023

View reviewed changes

davidthomas426 reviewed May 9, 2023

View reviewed changes

src/sagemaker_huggingface_inference_toolkit/decoder_encoder.py Outdated Show resolved Hide resolved

tests/unit/test_decoder_encoder.py Show resolved Hide resolved

Address review comments

c4c9da0

maaquib force-pushed the decoder branch from 21e5e0e to c4c9da0 Compare May 10, 2023 17:02

davidthomas426 approved these changes May 10, 2023

View reviewed changes

maaquib merged commit 02ba035 into aws:main May 10, 2023

maaquib deleted the decoder branch May 10, 2023 17:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for JSON encoding `torch.tensor` to keep it consistent with sagemaker-inference-toolkit #84

Add support for JSON encoding `torch.tensor` to keep it consistent with sagemaker-inference-toolkit #84

maaquib commented May 8, 2023

philschmid left a comment

davidthomas426 left a comment

Add support for JSON encoding torch.tensor to keep it consistent with sagemaker-inference-toolkit #84

Add support for JSON encoding torch.tensor to keep it consistent with sagemaker-inference-toolkit #84

Conversation

maaquib commented May 8, 2023

philschmid left a comment

Choose a reason for hiding this comment

davidthomas426 left a comment

Choose a reason for hiding this comment

Add support for JSON encoding `torch.tensor` to keep it consistent with sagemaker-inference-toolkit #84

Add support for JSON encoding `torch.tensor` to keep it consistent with sagemaker-inference-toolkit #84