New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to update existing endpoint with newly trained model #101
Comments
Unfortunately, the feature for allowing you to update existing endpoints directly with .deploy is still on our backlog. We'll look again at its prioritization. In the meantime, you can try the workaround described in this issue: #58 |
I've made this work, the implementation is pretty straightforward. I'd contribute if I wasn't busy with building our ML infra around Sagemaker. Here's what I did:
|
@ChoiByungWook what is the availability impact for the in-place updating? And the example link is looking broken. Thanks! |
I'm still seeing this error w/1.55.0 when trying to deploy a PyTorch Model:
Example code:
|
can you try specifying |
Hi @laurenyu - seems like I get the same error including a new predictor = pytorch_model.deploy(
endpoint_name = env.setting('model_name')+"-1",
instance_type = env.setting('instance_type'),
update_endpoint = True,
initial_instance_count = 1) |
could you open a new issue in this repo? (sorry for the inconvenience, but it'll help with our internal tracking and making sure we respond) |
Still not working on my side. Same error as the original bug report, even when using update_endpoint = true. Did anyone open a new issue? |
@hubtub2 the behavior around this changed with v2.0+, so it's probably best if you open a new issue and include your specific code and Python SDK version |
Hello!
I am investigating the Sagemaker API for use in production (without notebooks). I am able to train a model, create an endpoint and delete the endpoint without any problems with the API.
However, in a very common situation where I have a newly trained model on new data, I would like to be able to update/change the model that is currently serving in the specified endpoint and not have to update other services. In production, I would like to update the model serving without any downtime.
Currently when I try to do this operation, simply train a new model and deploy to an endpoint using
deploy
with:I get the following error:
botocore.exceptions.ClientError: An error occurred (ValidationException) when calling the CreateEndpoint operation: Cannot create already existing endpoint "arn:aws:sagemaker:eu-west-1:166488713907:endpoint/iris".
Am I missing something here? Do I have to / can I do this operation manually with the boto3 api instead?
Thank you
The text was updated successfully, but these errors were encountered: