Update DJLModel class for latest container releases #4754

siddvenk · 2024-06-21T17:35:04Z

Issue #, if available:

Description of changes:

The DJLModel class has not been updated in multiple container releases, and as such it's functionality is completely broken.

This change updates the DJLModel class for the latest DJL container releases. Furthermore, it simplifies the interface and functionality so that it is more future-proofed.

Specifically:

Removes the subclasses of DJLModel. All functionality and auto-configuration behavior is handled partly by the DJLModel class, but mostly by the container at runtime
Removes the configuration file serving.properties requirement in favor of environment variables. A serving.properties file can still be provided and used, but the DJLModel class will not read/parse this configuration.
Updates the interactions with ModelBuilder interface to reflect the changes here

Testing done:

Updated unit tests
Used DJLModel to deploy SM endpoints in dev account

Merge Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your pull request.

General

I have read the CONTRIBUTING doc
I certify that the changes I am introducing will be backward compatible, and I have discussed concerns about this, if any, with the Python SDK team
I used the commit message format described in CONTRIBUTING
I have passed the region in to all S3 and STS clients that I've initialized as part of this change.
I have updated any necessary documentation, including READMEs and API docs (if appropriate)

Tests

I have added tests that prove my fix is effective or that my feature works (if appropriate)
I have added unit and/or integration tests as appropriate to ensure backward compatibility of the changes
I have checked that my tests are not configured for a specific region or account (if appropriate)
I have used unique_name_from_base to create resource names in integ tests (if appropriate)

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

doc/frameworks/djl/using_djl.rst

grenmester · 2024-06-24T23:17:32Z

doc/frameworks/djl/using_djl.rst

-    fastertransformer_predictor = fastertransformer_model.deploy("ml.g5.12xlarge",
-                                                                 initial_instance_count=1)
-
-Regardless of which way you choose to create your model, a ``Predictor`` object is returned. You can use this ``Predictor``


should we still leave in some docs explaining that a Predictor is returned on deploy()?

yea i can do that, I think i deleted this line by mistake!

grenmester · 2024-06-24T23:18:39Z

doc/frameworks/djl/using_djl.rst

 Each ``Predictor`` provides a ``predict`` method, which can do inference with json data, numpy arrays, or Python lists.
 Inference data are serialized and sent to the DJL Serving model server by an ``InvokeEndpoint`` SageMaker operation. The
 ``predict`` method returns the result of inference against your model.

 By default, the inference data is serialized to a json string, and the inference result is a Python dictionary.

-Model Directory Structure


is all of this removed information being moved to the LMI docs on the AWS documentation site?

Yes, we have this all on our LMI docs site.

We are not publishing on AWS docs anymore, but have aligned on all our docs living here https://docs.djl.ai/docs/serving/serving/docs/lmi/index.html. The model directory structure specifically is here https://docs.djl.ai/docs/serving/serving/docs/lmi/deployment_guide/model-artifacts.html

grenmester · 2024-06-24T23:29:46Z

src/sagemaker/serve/model_server/djl_serving/utils.py

-def _set_serve_properties(hf_model_config: dict, schema_builder: SchemaBuilder) -> tuple:
+def _get_default_djl_configurations(
+    model_id: str, hf_model_config: dict, schema_builder: SchemaBuilder
+) -> tuple:


nit: tuple[dict, int]

grenmester · 2024-06-24T23:49:05Z

src/sagemaker/djl_inference/model.py

+            logger.info("Using provided engine %s", self.engine)
+            return self.engine
+
+        if self.task is not None:


nit: unnecessary, None == "text-embedding" is not an error

siddvenk · 2024-06-25T16:30:45Z

@akrishna1995 - can you review this PR? I have two approvals, one from my team (owning DJLModel class), and one from the team that owns the ModelBuilder class.

siddvenk changed the title ~~simplify and refactor djl model for latest container releases~~ [WIP] simplify and refactor djl model for latest container releases Jun 21, 2024

siddvenk temporarily deployed to auto-approve June 21, 2024 17:35 — with GitHub Actions Inactive

siddvenk temporarily deployed to auto-approve June 23, 2024 21:24 — with GitHub Actions Inactive

siddvenk force-pushed the djl branch from 8f2c921 to c74ca68 Compare June 23, 2024 22:39

siddvenk temporarily deployed to auto-approve June 23, 2024 22:39 — with GitHub Actions Inactive

siddvenk temporarily deployed to auto-approve June 23, 2024 22:55 — with GitHub Actions Inactive

siddvenk temporarily deployed to auto-approve June 23, 2024 23:08 — with GitHub Actions Inactive

siddvenk force-pushed the djl branch from 04a4870 to 54180c1 Compare June 23, 2024 23:28

siddvenk temporarily deployed to auto-approve June 23, 2024 23:28 — with GitHub Actions Inactive

siddvenk force-pushed the djl branch from 54180c1 to 86d96e0 Compare June 24, 2024 00:00

siddvenk temporarily deployed to auto-approve June 24, 2024 00:00 — with GitHub Actions Inactive

siddvenk marked this pull request as ready for review June 24, 2024 01:39

siddvenk requested a review from a team as a code owner June 24, 2024 01:39

siddvenk requested a review from akrishna1995 June 24, 2024 01:39

siddvenk changed the title ~~[WIP] simplify and refactor djl model for latest container releases~~ Update DJLModel class for latest container releases Jun 24, 2024

frankfliu reviewed Jun 24, 2024

View reviewed changes

doc/frameworks/djl/using_djl.rst Outdated Show resolved Hide resolved

doc/frameworks/djl/using_djl.rst Show resolved Hide resolved

siddvenk force-pushed the djl branch from 86d96e0 to 2cbecd6 Compare June 24, 2024 19:12

siddvenk temporarily deployed to auto-approve June 24, 2024 19:12 — with GitHub Actions Inactive

frankfliu approved these changes Jun 24, 2024

View reviewed changes

grenmester reviewed Jun 24, 2024

View reviewed changes

grenmester approved these changes Jun 24, 2024

View reviewed changes

siddvenk temporarily deployed to auto-approve June 25, 2024 00:20 — with GitHub Actions Inactive

siddvenk force-pushed the djl branch from 4184689 to 87b5037 Compare June 25, 2024 01:57

siddvenk temporarily deployed to auto-approve June 25, 2024 01:57 — with GitHub Actions Inactive

siddvenk added 3 commits June 24, 2024 21:17

simplify and refactor djl model for latest container releases

a24323e

update model builder for new DJLModel implementation

cb912b7

fix formatting/linting suggestions

ddfdcc5

siddvenk added 2 commits June 24, 2024 21:17

update DJLModel documentation on docs site

5dfbfe3

address reviewer feedback

39ce570

siddvenk force-pushed the djl branch from 87b5037 to 39ce570 Compare June 25, 2024 04:17

siddvenk temporarily deployed to auto-approve June 25, 2024 04:18 — with GitHub Actions Inactive

mufaddal-rohawala approved these changes Jun 25, 2024

View reviewed changes

mufaddal-rohawala merged commit 327b5d9 into aws:master Jun 25, 2024
11 checks passed

siddvenk deleted the djl branch June 25, 2024 20:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update DJLModel class for latest container releases #4754

Update DJLModel class for latest container releases #4754

siddvenk commented Jun 21, 2024 •

edited

Loading

grenmester Jun 24, 2024

siddvenk Jun 25, 2024

grenmester Jun 24, 2024

siddvenk Jun 25, 2024

grenmester Jun 24, 2024

grenmester Jun 24, 2024

siddvenk commented Jun 25, 2024

Update DJLModel class for latest container releases #4754

Update DJLModel class for latest container releases #4754

Conversation

siddvenk commented Jun 21, 2024 • edited Loading

Merge Checklist

General

Tests

grenmester Jun 24, 2024

Choose a reason for hiding this comment

siddvenk Jun 25, 2024

Choose a reason for hiding this comment

grenmester Jun 24, 2024

Choose a reason for hiding this comment

siddvenk Jun 25, 2024

Choose a reason for hiding this comment

grenmester Jun 24, 2024

Choose a reason for hiding this comment

grenmester Jun 24, 2024

Choose a reason for hiding this comment

siddvenk commented Jun 25, 2024

siddvenk commented Jun 21, 2024 •

edited

Loading