MLflow model adapter #517

fedefernandez · 2023-11-03T16:07:19Z

Overview

This PR adds a new Ktor client's plugin for adapting the Xef request/response model (OpenAI compatible models) to MLflow.

sequenceDiagram
    actor Client
    participant A as XefServer
    participant B as MLflowModelAdapter
    participant C as MLflowGateway
    Client->>+A: OpenAI Request
    A->>+B: OpenAI Request
    B->>+C: MLflow Request
    C-->>-B: MLflow Response
    B-->>-A: OpenAI Response
    A-->>-Client: OpenAI Response

Change details

Simplify ModelUriAdapter.kt by intercepting the "request pipeline" (instead of "send pipeline") since we only need to change the request
Add the models that will be used for transforming Xef requests into MLflow requests and MLflow responses into Xef responses.
Add the MLflowModelAdapter.kt and the builder

How to test it?

Follow the Quickstart Guide for starting an MLflow Gateway instance
Add the interceptors to the Ktor HTTP client in Server

install(com.xebia.functional.xef.server.http.client.ModelUriAdapter) {
  addToPath(
    com.xebia.functional.xef.server.http.client.OpenAIPathType.EMBEDDINGS, 
    "embedding-model" to "http://127.0.0.1:5000/gateway/embeddings/invocations"
  )
  addToPath(
    com.xebia.functional.xef.server.http.client.OpenAIPathType.CHAT, 
    "chat-model" to "http://127.0.0.1:5000/gateway/chat/invocations"
  )
}
install(com.xebia.functional.xef.server.http.client.mlflow.MLflowModelAdapter) {
  addToPath(
    "http://127.0.0.1:5000/gateway/embeddings/invocations",
    com.xebia.functional.xef.server.http.client.OpenAIPathType.EMBEDDINGS
  )
  addToPath(
    "http://127.0.0.1:5000/gateway/chat/invocations",
    com.xebia.functional.xef.server.http.client.OpenAIPathType.CHAT
  )
}

Next Steps

Necessary code for automatically starting Xef server with an MLflow Gateway connection through configuration and presumably just with an environment variable.

raulraja

Looks great @fedefernandez !

fedefernandez added 3 commits November 3, 2023 14:18

Updates adapters

428316c

Completes mlflow model adapter

d881c5a

Restores some unrelated changes with this PR

d00b841

fedefernandez changed the title ~~Feature/model adapter~~ MLflow model adapter Nov 6, 2023

fedefernandez marked this pull request as ready for review November 6, 2023 09:38

fedefernandez marked this pull request as draft November 6, 2023 09:41

fedefernandez marked this pull request as ready for review November 6, 2023 11:41

raulraja approved these changes Nov 6, 2023

View reviewed changes

fedefernandez merged commit 3a8c11a into main Nov 6, 2023
5 checks passed

fedefernandez deleted the feature/model-adapter branch November 6, 2023 16:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MLflow model adapter #517

MLflow model adapter #517

fedefernandez commented Nov 3, 2023 •

edited

raulraja left a comment

MLflow model adapter #517

MLflow model adapter #517

Conversation

fedefernandez commented Nov 3, 2023 • edited

Overview

Change details

How to test it?

Next Steps

raulraja left a comment

Choose a reason for hiding this comment

fedefernandez commented Nov 3, 2023 •

edited