Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FR] Adding SageMaker provider to MLflow Gateway AI #10351

Open
1 of 22 tasks
pdifranc opened this issue Nov 10, 2023 · 3 comments
Open
1 of 22 tasks

[FR] Adding SageMaker provider to MLflow Gateway AI #10351

pdifranc opened this issue Nov 10, 2023 · 3 comments
Labels
area/deployments MLflow Deployments client APIs, server, and third-party Deployments integrations enhancement New feature or request

Comments

@pdifranc
Copy link
Contributor

pdifranc commented Nov 10, 2023

Willingness to contribute

Yes. I can contribute this feature independently.

Proposal Summary

SageMaker JumpStart provides an easy way to deploy LLM endpoints on the SageMaker managed infrastructure. However, as of today, the MLflow Gateway AI does not implement a SageMaker provider.

We would like to introduce the SageMaker provider and adapter together with an example script to show how you can add a SageMaker hosted model and make it available to the MLflow Gateway AI

Motivation

What is the use case for this feature?

SageMaker jumpstarts supports many LLMs with a one click deployment. Would be nice to have a way to interface with these models in a centralized way as offered by mlflow Gateway AI

Why is this use case valuable to support for MLflow users in general?

Sagemaker is a popular tool, and some models are readily available directly there. Would open up to a great deal of possibilities

Why is this use case valuable to support for your project(s) or organization?

I want to have access to more providers

Why is it currently difficult to achieve this use case?

The provider/adapters should not be too hard, of course the evil is in the details.

Details

I would like to provide a basic SageMaker provider, which would handle the credentials and provide the basic AWS specific credentials mechanisms (similar as done already for Bedrock).
I would then provide two Adapter for two models (thinking on providing samples for the two LLama2 models, likely the 7B one).
This can be a baseline for more models deployable via sagemaker that can be extended. Users might be able to quickly integrate new adapters, requiring some sort of plugin feature, but this might be for the future.

What component(s) does this bug affect?

  • area/artifacts: Artifact stores and artifact logging
  • area/build: Build and test infrastructure for MLflow
  • area/docs: MLflow documentation pages
  • area/examples: Example code
  • area/gateway: AI Gateway service, Gateway client APIs, third-party Gateway integrations
  • area/model-registry: Model Registry service, APIs, and the fluent client calls for Model Registry
  • area/models: MLmodel format, model serialization/deserialization, flavors
  • area/recipes: Recipes, Recipe APIs, Recipe configs, Recipe Templates
  • area/projects: MLproject format, project running backends
  • area/scoring: MLflow Model server, model deployment tools, Spark UDFs
  • area/server-infra: MLflow Tracking server backend
  • area/tracking: Tracking Service, tracking client APIs, autologging

What interface(s) does this bug affect?

  • area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
  • area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
  • area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
  • area/windows: Windows support

What language(s) does this bug affect?

  • language/r: R APIs and clients
  • language/java: Java APIs and clients
  • language/new: Proposals for new client languages

What integration(s) does this bug affect?

  • integrations/azure: Azure and Azure ML integrations
  • integrations/sagemaker: SageMaker integrations
  • integrations/databricks: Databricks integrations
@pdifranc pdifranc added the enhancement New feature or request label Nov 10, 2023
@github-actions github-actions bot added the area/deployments MLflow Deployments client APIs, server, and third-party Deployments integrations label Nov 10, 2023
@kirit93
Copy link

kirit93 commented Nov 13, 2023

+1 for this feature! It would be really helpful to have this.

Copy link

@mlflow/mlflow-team Please assign a maintainer and start triaging this issue.

@pdifranc
Copy link
Contributor Author

pdifranc commented Dec 3, 2023

maybe rather than adding a new model
provider, give the possibility to add new providers via plugins (currently not possible via regular mlflow plugins).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/deployments MLflow Deployments client APIs, server, and third-party Deployments integrations enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants