sagemaker-endpoints-blog

Description

This repository will walk you through the end-to-end process of deploying a single custom model on SageMaker using the Prithvi model, a temporal Vision transformer developed by IBM and NASA and pre-trained on the Harmonized Landsat Sentinel-2 data collection. The Prithvi model, with its unique dependencies and architecture, is an effective example of how to deploy complex custom models to SageMaker.

Prerequisites

You need the following prerequisites before you can proceed. For this post, we use the us-east-1 (Northern Virginia) Region:

Have access to a POSIX based (Mac/Linux) system or SageMaker Notebooks
Ensure you have IAM permissions for SageMaker access, S3 bucket create, read, and putobject access, CodeBuild access Amazon Elastic Container Registry (ECR) repository access, and the ability to create IAM Roles
Download Prithvi model artifacts files and Burn Scar finetuning files

Solution Overview

To run a custom model that needs unique packages as an Amazon SageMaker Endpoint you will need to follow these steps:

If your model requires additional packages or package versions unavailable from SageMaker’s managed container images you will need to extend one of the container images.
- For this blog 763104351884.dkr.ecr.us-east-1.amazonaws.com/pytorch-training:1.13.1-gpu-py39-cu117-ubuntu20.04-sagemaker was used
- You will need to create a Dockerfile for your new container like the one in the repository
Write a python model definition using SageMaker’s inference.py file format. Look at the inference.py file here for reference
Define your model artifacts and inference file within a specific file structure, archive your model files as a tar.gz, and upload your files to Amazon Simple Storage Service (S3)

File Structure:

./
- code
-- inference.py
-- requirements.txt
model.config
weights.pth
other_model_data...

Tar Command:

tar -czvf model.tar.gz ./

Create the S3 Bucket and upload the tar

# generate a unique postfix 

BUCKET_POSTFIX=$(uuidgen --random | cut -d'-' -f1)
echo "export BUCKET_POSTFIX=${BUCKET_POSTFIX}" >> ~/.bashrc 
echo "Your bucket name will be mybucket-${BUCKET_POSTFIX}" 

#make your bucket
aws s3 mb s3://mybucket-${BUCKET_POSTFIX}

# upload to your bucket 
aws s3 cp model.tar.gz s3://mybucket-${BUCKET_POSTFIX}/model.tar.gz

With your model code and an extended SageMaker container you will use SageMaker Studio to create a model, endpoint configuration, and endpoint.
Call the inference endpoint to ensure your model is running correctly

Querying the Endpoint

from sagemaker.predictor import Predictor
from sagemaker.serializers import NumpySerializer

payload = "https://huggingface.co/spaces/ibm-nasa-geospatial/Prithvi-100M-demo/resolve/main/HLS.L30.T13REN.2018013T172747.v2.0.B02.B03.B04.B05.B06.B07_cropped.tif"

predictor = Predictor(endpoint_name=[your endpoint name])
predictor.serializer = NumpySerializer()

predictions = predictor.predict(payload)

Cleaning Up Resources

To clean up the resources from this blog and avoid incurring costs follow these steps:

Delete the SageMaker endpoint, endpoint configuration, and model.
Delete the ECR image and repository.
Delete the model.tar.gz in the S3 bucket that was created.
Delete the S3 bucket.

Additional Notes:

The model information is in model, but this model information exists without the actual .pt checkpoint files, which will need to be included
local_test.py is a simple file to test your endpoint in local mode, which can be helpful if you hit an error deploying the endpoint
All the other files are files which can help through the blog process in building your model using CodeBuild (codebuild-project.json, prithvi_container-source.zip, etc), which is not a requirement

Support

For any questions reach out to riaidan@amazon.com

Authors and acknowledgment

Thank to the whole team (Aidan Ricci, Charlotte Fondren, Nate Haynes)

License

MIT No Attribution

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
code		code
.DS_Store		.DS_Store
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
local_test.py		local_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

sagemaker-endpoints-blog

Description

Prerequisites

Solution Overview

Querying the Endpoint

Cleaning Up Resources

Additional Notes:

Support

Authors and acknowledgment

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

aws-samples/sagemaker-custom-model-inference-endpoint

Folders and files

Latest commit

History

Repository files navigation

sagemaker-endpoints-blog

Description

Prerequisites

Solution Overview

Querying the Endpoint

Cleaning Up Resources

Additional Notes:

Support

Authors and acknowledgment

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages