Re-throw certain specific OOM errors as 507 for SageMaker MME + repo path enhancement #4392

nskool · 2022-05-17T16:25:44Z

This change is an addition to the MME changes in this PR: Enable Triton for SageMaker MME mode #4181.
This change adds the ability for the SM endpoint to throw a 507 to the SM platform as per - https://docs.aws.amazon.com/sagemaker/latest/dg/mms-container-apis.html#multi-model-api-load-model.
Can be updated to throw capture more specific errors for different backends.
It also adds the ability for Triton on SM to handle the model artifact within two locations i.e. both /opt/ml/models/<hash>/model as well as /opt/ml/models/<hash>/model/<model_subdir>. The second one is useful if the customer is re-using a model after trying it in SME mode.

src/sagemaker_server.cc

GuanLuo · 2022-05-19T23:34:42Z

src/sagemaker_server.cc

+      return;
+    } else {
+      /* Return a 400*/
+      evhtp_send_reply(req, EVHTP_RES_BADREQ);


Actually this should be done outside the for loop, so it checks if any messages are matched before returning 400 early

@nskool let me know when it is ready for a CI run

thanks, fixed - request you to run the CI

GuanLuo reviewed May 18, 2022

View reviewed changes

src/sagemaker_server.cc Outdated Show resolved Hide resolved

nskool changed the title ~~Re-throw certain specific OOM errors as 507 for SageMaker MME~~ Re-throw certain specific OOM errors as 507 for SageMaker MME + repo path enhancement May 19, 2022

nskool added 6 commits May 19, 2022 22:16

Re-throw certain specific OOM errors as 507 for SageMaker MME

65698a0

Handle subdir case within another model/

d6b41c4

Typo fix

24adc03

Handle nested subdir within repo

5e9b947

Add text about hidden folders

7896767

Add more generic OOM

35ce601

nskool force-pushed the sagemaker_507 branch from b6265da to 35ce601 Compare May 19, 2022 22:16

GuanLuo reviewed May 19, 2022

View reviewed changes

Address comment

c4b3d33

GuanLuo approved these changes May 23, 2022

View reviewed changes

GuanLuo merged commit 2965840 into triton-inference-server:main May 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-throw certain specific OOM errors as 507 for SageMaker MME + repo path enhancement #4392

Re-throw certain specific OOM errors as 507 for SageMaker MME + repo path enhancement #4392

nskool commented May 17, 2022 •

edited

Loading

GuanLuo May 19, 2022

GuanLuo May 20, 2022

nskool May 20, 2022

Re-throw certain specific OOM errors as 507 for SageMaker MME + repo path enhancement #4392

Re-throw certain specific OOM errors as 507 for SageMaker MME + repo path enhancement #4392

Conversation

nskool commented May 17, 2022 • edited Loading

GuanLuo May 19, 2022

Choose a reason for hiding this comment

GuanLuo May 20, 2022

Choose a reason for hiding this comment

nskool May 20, 2022

Choose a reason for hiding this comment

nskool commented May 17, 2022 •

edited

Loading