Env variable support for batch inference #106

nikhil-sk · 2021-09-27T19:13:37Z

Issue #, if available:
NA

Description of changes:

Add feature to specify the following properties for a model that help in a batch inference:

batchSize
maxBatchDelay
minWorkers
maxWorkers
responseTimeout

The above properties for the model have been exposed as the following environment variables in the toolkit:

SAGEMAKER_TS_BATCH_SIZE
SAGEMAKER_TS_MAX_BATCH_DELAY
SAGEMAKER_TS_MIN_WORKERS
SAGEMAKER_TS_MAX_WORKERS
SAGEMAKER_TS_RESPONSE_TIMEOUT

These properties need to be supplied in a dictionary form to the config option 'env' when configuring a model using the sagemaker python sdk.

Note: These properties only apply in a single model inference on SageMaker. For multi-model endpoint, a user still needs to bake-in the config.properties file, and list the models in the config file.

Logs

When run in SageMaker, the model config is correctly picked up from the environment when specified as follows:

Input

from sagemaker.pytorch.model import PyTorchModel

env_variables_dict = {
    "SAGEMAKER_TS_BATCH_SIZE": "3",
    "SAGEMAKER_TS_MAX_BATCH_DELAY": "100000"
}

pytorch_model = PyTorchModel(
    model_data=model_artifact,
    role=role,
    image_uri=image_uri,
    source_dir="code",
    framework_version='1.9',
    entry_point="inference.py",
    env=env_variables_dict
)

Output:

2n5r6rur8a-algo-1-33bni | WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv
2n5r6rur8a-algo-1-33bni | ['torchserve', '--start', '--model-store', '/.sagemaker/ts/models', '--ts-config', '/etc/sagemaker-ts.properties', '--log-config', '/sagemaker-pytorch-inference-toolkit/src/sagemaker_pytorch_serving_container/etc/log4j.properties', '--models', 'model.mar']
2n5r6rur8a-algo-1-33bni | 2021-09-27 19:06:42,737 [INFO ] main org.pytorch.serve.servingsdk.impl.PluginsManager - Initializing plugins manager...
2n5r6rur8a-algo-1-33bni | 2021-09-27 19:06:42,927 [INFO ] main org.pytorch.serve.ModelServer - 
2n5r6rur8a-algo-1-33bni | Torchserve version: 0.4.2
2n5r6rur8a-algo-1-33bni | TS Home: /usr/local/lib/python3.6/dist-packages
2n5r6rur8a-algo-1-33bni | Current directory: /
2n5r6rur8a-algo-1-33bni | Temp directory: /tmp
2n5r6rur8a-algo-1-33bni | Number of GPUs: 0
2n5r6rur8a-algo-1-33bni | Number of CPUs: 32
2n5r6rur8a-algo-1-33bni | Max heap size: 30688 M
2n5r6rur8a-algo-1-33bni | Python executable: /usr/bin/python3
2n5r6rur8a-algo-1-33bni | Config file: /etc/sagemaker-ts.properties
2n5r6rur8a-algo-1-33bni | Inference address: http://0.0.0.0:8080
2n5r6rur8a-algo-1-33bni | Management address: http://0.0.0.0:8080
2n5r6rur8a-algo-1-33bni | Metrics address: http://127.0.0.1:8082
2n5r6rur8a-algo-1-33bni | Model Store: /.sagemaker/ts/models
2n5r6rur8a-algo-1-33bni | Initial Models: model.mar
2n5r6rur8a-algo-1-33bni | Log dir: /logs
2n5r6rur8a-algo-1-33bni | Metrics dir: /logs
2n5r6rur8a-algo-1-33bni | Netty threads: 0
2n5r6rur8a-algo-1-33bni | Netty client threads: 0
2n5r6rur8a-algo-1-33bni | Default workers per model: 32
2n5r6rur8a-algo-1-33bni | Blacklist Regex: N/A
2n5r6rur8a-algo-1-33bni | Maximum Response Size: 6553500
2n5r6rur8a-algo-1-33bni | Maximum Request Size: 6553500
2n5r6rur8a-algo-1-33bni | Prefer direct buffer: false
2n5r6rur8a-algo-1-33bni | Allowed Urls: [file://.*|http(s)?://.*]
2n5r6rur8a-algo-1-33bni | Custom python dependency for model allowed: false
2n5r6rur8a-algo-1-33bni | Metrics report format: prometheus
2n5r6rur8a-algo-1-33bni | Enable metrics API: true
2n5r6rur8a-algo-1-33bni | Workflow Store: /.sagemaker/ts/models
2n5r6rur8a-algo-1-33bni | Model config: {"model": {"1.0": {"defaultVersion": true, "marName": "model.mar", "minWorkers": 1, "maxWorkers": 4, "batchSize": 3, "maxBatchDelay": 100000, "responseTimeout": 120}}}

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

… the single model in torchserve

sagemaker-bot · 2021-09-27T20:42:10Z

AWS CodeBuild CI Report

CodeBuild project: sagemaker-pytorch-inference-toolkit-pr
Commit ID: 82c0919
Result: SUCCEEDED
Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

src/sagemaker_pytorch_serving_container/ts_environment.py

sagemaker-bot · 2021-09-27T23:47:36Z

AWS CodeBuild CI Report

CodeBuild project: sagemaker-pytorch-inference-toolkit-pr
Commit ID: bfe053d
Result: SUCCEEDED
Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

src/sagemaker_pytorch_serving_container/ts_environment.py

sagemaker-bot · 2021-09-28T18:47:32Z

AWS CodeBuild CI Report

CodeBuild project: sagemaker-pytorch-inference-toolkit-pr
Commit ID: 9b36af5
Result: FAILED
Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

sagemaker-bot · 2021-09-28T21:05:15Z

AWS CodeBuild CI Report

CodeBuild project: sagemaker-pytorch-inference-toolkit-pr
Commit ID: 28a1b6b
Result: SUCCEEDED
Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

sagemaker-bot · 2021-09-30T04:45:29Z

AWS CodeBuild CI Report

CodeBuild project: sagemaker-pytorch-inference-toolkit-pr
Commit ID: f093170
Result: FAILED
Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

sagemaker-bot · 2021-10-01T01:25:50Z

AWS CodeBuild CI Report

CodeBuild project: sagemaker-pytorch-inference-toolkit-pr
Commit ID: 61a7d0b
Result: FAILED
Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

sagemaker-bot · 2021-10-01T09:31:09Z

AWS CodeBuild CI Report

CodeBuild project: sagemaker-pytorch-inference-toolkit-pr
Commit ID: 2ccf3b6
Result: FAILED
Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

sagemaker-bot · 2021-10-01T19:42:10Z

AWS CodeBuild CI Report

CodeBuild project: sagemaker-pytorch-inference-toolkit-pr
Commit ID: a14faf1
Result: SUCCEEDED
Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

Nikhil Kulkarni added 14 commits September 17, 2021 15:44

Support env variables for configure batchSize, maxBatchDelay etc. for…

a7a596e

… the single model in torchserve

Add modified version

27ca279

fix flake8

9415fd7

Edit version

58c9fc2

Correct type

a3f24c6

Add condition to including env variables in model config

6771e2b

Add version

76e700f

Update version and remove env support

21fb127

Try converting config to string

e44f383

Reverse str and update version

6df9a20

Fix true

6ed240f

Experiment with default config

e99fff9

Complete

112a08a

Include load models

82c0919

maaquib reviewed Sep 27, 2021

View reviewed changes

src/sagemaker_pytorch_serving_container/ts_environment.py Outdated Show resolved Hide resolved

lxning previously approved these changes Sep 27, 2021

View reviewed changes

Set max workers to 1

bfe053d

nikhil-sk dismissed lxning’s stale review via bfe053d September 27, 2021 22:21

maaquib reviewed Sep 28, 2021

View reviewed changes

src/sagemaker_pytorch_serving_container/ts_environment.py Outdated Show resolved Hide resolved

Set default response timeout to 60, and improve docstring

9b36af5

Fix flake8

28a1b6b

Nikhil Kulkarni added 2 commits September 29, 2021 20:11

Add a warning log for single model

837f9b8

Fix extra spacing in log

f093170

Nikhil Kulkarni added 2 commits September 30, 2021 13:10

Use string instead of a dict

7a6a103

Print config

6040a4a

Nikhil Kulkarni added 6 commits September 30, 2021 13:29

Fix string

f86c7d6

Fix f-string

7fd9f42

Remove newline

63c916e

Adjust f string

0138d35

Fix flake8

d42daa4

Trigger build

61a7d0b

Trigger build

2ccf3b6

Nikhil Kulkarni added 2 commits October 1, 2021 10:32

Trigger build

59db629

Trigger build

a14faf1

maaquib approved these changes Oct 1, 2021

View reviewed changes

lxning approved these changes Oct 2, 2021

View reviewed changes

maaquib merged commit 27b667f into aws:master Oct 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Env variable support for batch inference #106

Env variable support for batch inference #106

Uh oh!

nikhil-sk commented Sep 27, 2021 •

edited

Loading

Uh oh!

sagemaker-bot commented Sep 27, 2021

Uh oh!

Uh oh!

sagemaker-bot commented Sep 27, 2021

Uh oh!

Uh oh!

sagemaker-bot commented Sep 28, 2021

Uh oh!

sagemaker-bot commented Sep 28, 2021

Uh oh!

sagemaker-bot commented Sep 30, 2021

Uh oh!

sagemaker-bot commented Oct 1, 2021

Uh oh!

sagemaker-bot commented Oct 1, 2021

Uh oh!

sagemaker-bot commented Oct 1, 2021

Uh oh!

Uh oh!

Env variable support for batch inference #106

Env variable support for batch inference #106

Uh oh!

Conversation

nikhil-sk commented Sep 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Logs

Input

Output:

Uh oh!

sagemaker-bot commented Sep 27, 2021

AWS CodeBuild CI Report

Uh oh!

Uh oh!

sagemaker-bot commented Sep 27, 2021

AWS CodeBuild CI Report

Uh oh!

Uh oh!

sagemaker-bot commented Sep 28, 2021

AWS CodeBuild CI Report

Uh oh!

sagemaker-bot commented Sep 28, 2021

AWS CodeBuild CI Report

Uh oh!

sagemaker-bot commented Sep 30, 2021

AWS CodeBuild CI Report

Uh oh!

sagemaker-bot commented Oct 1, 2021

AWS CodeBuild CI Report

Uh oh!

sagemaker-bot commented Oct 1, 2021

AWS CodeBuild CI Report

Uh oh!

sagemaker-bot commented Oct 1, 2021

AWS CodeBuild CI Report

Uh oh!

Uh oh!

nikhil-sk commented Sep 27, 2021 •

edited

Loading