JVM detect the CPU count as 1 when more CPUs are available for the container. #82

amaharek · 2021-03-15T11:14:23Z

Describe the bug
This issue is related to the issue aws/sagemaker-python-sdk#1275

JVM detect the CPU count as 1 when more CPUs are available for the container.

To reproduce

Clone the SaeMaker example
Deploy the model using the same endpoint.
Check CloudWatch logs and the number of CPU cores detected will be like Number of CPUs: 1

Expected behavior
The CPU count from CloudWatch should match the CPU count for the used instance. For example, 4 if the instance is ml.m4.xlarge

System information
Container: pytorch-inference:1.5-gpu-py3
SageMaker inference v1.1.2

The text was updated successfully, but these errors were encountered:

daniel-hanmoi-choi · 2021-03-18T19:37:19Z

@amaharek We had the same issue and fixed with this

TOOLKIT_PATH=python -c "import sagemaker_inference;print(sagemaker_inference.__path__[0])"

Add the following to the Dockerfile and build a new image based on the above

Single-model

RUN echo "vmargs=-XX:-UseContainerSupport" >> $TOOLKIT_PATH/etc/default-mms.properties

Multi-model

RUN echo "vmargs=-XX:-UseContainerSupport" >> $TOOLKIT_PATH/etc/mme-mms.properties
Correspondence

About `UserContainerSupport`

https://www.eclipse.org/openj9/docs/xxusecontainersupport/
https://blog.softwaremill.com/docker-support-in-new-java-8-finally-fd595df0ca54

Fixes #82

amaharek · 2021-07-09T22:05:49Z

PR 83 has been merged

amaharek mentioned this issue Mar 15, 2021

fix: Fixing issue #82 #83

Merged

zoran-hristov mentioned this issue Apr 23, 2021

JVM detect the CPU count as 1 when more CPUs are available for the container. aws/sagemaker-pytorch-inference-toolkit#99

Open

la-cruche mentioned this issue Jun 24, 2021

Num CPU incorrect aws/sagemaker-huggingface-inference-toolkit#3

Open

vdantu added a commit that referenced this issue Jul 7, 2021

Add VMARGS to disable the container support.

60641e0

Fixes #82

vdantu mentioned this issue Jul 7, 2021

Add VMARGS to disable the container support. #87

Closed

6 tasks

ahsan-z-khan pushed a commit that referenced this issue Jul 9, 2021

fix: Fixing issue #82 (#83)

7fcb805

amaharek closed this as completed Jul 9, 2021

davidthomas426 mentioned this issue Jan 22, 2023

add vmargs=-XX:-UseContainerSupport in config aws/sagemaker-pytorch-inference-toolkit#136

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JVM detect the CPU count as 1 when more CPUs are available for the container. #82

JVM detect the CPU count as 1 when more CPUs are available for the container. #82

amaharek commented Mar 15, 2021

daniel-hanmoi-choi commented Mar 18, 2021 •

edited

amaharek commented Jul 9, 2021 •

edited

JVM detect the CPU count as 1 when more CPUs are available for the container. #82

JVM detect the CPU count as 1 when more CPUs are available for the container. #82

Comments

amaharek commented Mar 15, 2021

daniel-hanmoi-choi commented Mar 18, 2021 • edited

Single-model

Multi-model

About UserContainerSupport

amaharek commented Jul 9, 2021 • edited

daniel-hanmoi-choi commented Mar 18, 2021 •

edited

About `UserContainerSupport`

amaharek commented Jul 9, 2021 •

edited