Deploying models with seldon-core #20

aguschin · 2021-09-16T09:16:42Z

Hey guys. Reviewing OpenMlOps they are exposing the models via Seldon and Ambassador API Gateway. This is similar to the QuaroML stack that I did and a very convenient and easy way to expose and of course scale the models.
Maybe we can introduce MLEM inside the TPI in conjunction with Consul

We need to get back to this discussion to shape the vision for MLEM deployment part. It's better to do as soon as we are finished with the closed alpha release.

aguschin · 2022-08-01T07:55:17Z

Maybe we can introduce MLEM inside the TPI in conjunction with Consul

@DavidGOrtega, I'm getting back to your comment since @mike0sv is working on Deployment 2.0 in MLEM. Could you please elaborate on your thought? How do you see this?

Beside that, could MLEM integrate with TPI to deploy to AWS EC2 or GCP?

aguschin · 2022-08-01T10:28:11Z

@casperdcl @0x2b3bfa0 @DavidGOrtega

Also, we're now implementing deployment to Sagemaker. To do that e2e we need to provision some AWS resources for the user (Roles, Policies, etc). Can we use TPI to do that? Or do you have plans to implement an option to provision this?

casperdcl · 2022-08-09T13:48:12Z

Looks like 3 different feature requests:

"exposing the models via Seldon and Ambassador API Gateway [...] similar to the QuaroML stack [...] introduce MLEM inside the TPI in conjunction with Consul"
- not sure I follow. Is this a new cloud feature request on the TPI repo @DavidGOrtega?
"MLEM integrate with TPI to deploy to AWS EC2 or GCP"
- sure, TPI can: provision an instance, upload workdir, run a script on the instance, auto-recover from spot interruptions (restores wokrdir & re-runs script), auto-cleanup on script exit, download workdir.
- requirement: provide cloud (AWS/Azure/GCP) credentials via environment variables¹
- you can use docker²
- you can use avoid having to config/expose service ports on AWS by using a (free) port forwarding service³
- you can use the Python wrapper (pip install tpi) which auto-detects OS, downloads/caches terraform binaries, installs TPI, and even has a Python API so you don't explicitly run CLI
"deployment to Sagemaker. To do that e2e we need to provision some AWS resources for the user (Roles, Policies, etc)"
- sounds like TPI's permission_set?
- see also https://github.com/iterative/terraform-provider-iterative/tree/master/docs/guides/permissions (mentioned in ¹) for a list of permissions
- FYI about spot instances on Sagemaker, looks like they're only supported for training and not really for serving

aguschin transferred this issue from another repository Sep 30, 2021

aguschin added the deploy Related to model deployment label Sep 30, 2021

aguschin added this to the Beta release milestone Oct 11, 2021

aguschin added the p2-medium Medium priority label Nov 23, 2021

mike0sv modified the milestones: Beta release (deployment), Release Critical Mar 21, 2022

aguschin modified the milestones: Release Critical, Release Optional Mar 22, 2022

mike0sv modified the milestones: MLEM 0.2: Release Optional, Q3 May 30, 2022

aguschin assigned mike0sv Jul 7, 2022

aguschin changed the title ~~Develop understanding of advanced deployment functionality~~ Deploying models with seldon-core Aug 1, 2022

aguschin added plugins Plugins and extensions for MLEM! and removed p2-medium Medium priority labels Aug 1, 2022

aguschin unassigned mike0sv Aug 1, 2022

aguschin mentioned this issue Oct 3, 2022

deploy: more platforms to support #422

Open

11 tasks

aguschin added the p2-medium Medium priority label Nov 9, 2022

aguschin removed this from the Q3 milestone Nov 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deploying models with seldon-core #20

Deploying models with seldon-core #20

aguschin commented Sep 16, 2021

aguschin commented Aug 1, 2022

aguschin commented Aug 1, 2022

casperdcl commented Aug 9, 2022

Deploying models with seldon-core #20

Deploying models with seldon-core #20

Comments

aguschin commented Sep 16, 2021

aguschin commented Aug 1, 2022

aguschin commented Aug 1, 2022

casperdcl commented Aug 9, 2022

Footnotes