Migrate AI gateway #10420

harupy · 2023-11-16T00:38:33Z

🛠 DevTools 🛠

Install mlflow from this PR

pip install git+https://github.com/mlflow/mlflow.git@refs/pull/10420/merge

Checkout with GitHub CLI

gh pr checkout 10420

%pip install git+https://github.com/mlflow/mlflow.git@refs/pull/10420/merge
# doesn't work if conflicts

# or

%pip install git+https://github.com/mlflow/mlflow.git@refs/pull/10420/head

Related Issues/PRs

#xxx

What changes are proposed in this pull request?

Migrate AI gateway.

How is this PR tested?

Existing unit/integration tests
New unit/integration tests
Manual tests

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

github-actions · 2023-11-16T00:38:54Z

Documentation preview for db1e9e8 will be available here when this CircleCI job completes successfully.

More info

Ignore this comment if this PR does not change the documentation.
It takes a few minutes for the preview to be available.
The preview is updated when a new commit is pushed to this PR.
This comment was created by https://github.com/mlflow/mlflow/actions/runs/7095023034.

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: Harutaka Kawamura <hkawamura0130@gmail.com>

Signed-off-by: dbczumar <corey.zumar@databricks.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy · 2023-11-18T01:49:31Z

mlflow/deployments/constants.py

+# TODO: Move this to mlflow.environment_variables before merging to master
+# Specifies the timeout for deployment client APIs to declare a request has timed out
+MLFLOW_DEPLOYMENT_PREDICT_TIMEOUT = _EnvironmentVariable(
+    "MLFLOW_DEPLOYMENT_PREDICT_TIMEOUT", int, 120
+)


Signed-off-by: dbczumar <corey.zumar@databricks.com> Signed-off-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Signed-off-by: Harutaka Kawamura <hkawamura0130@gmail.com> Signed-off-by: mlflow-automation <mlflow-automation@users.noreply.github.com> Co-authored-by: Harutaka Kawamura <hkawamura0130@gmail.com> Co-authored-by: mlflow-automation <mlflow-automation@users.noreply.github.com>

Signed-off-by: dbczumar <corey.zumar@databricks.com> Signed-off-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: Harutaka Kawamura <hkawamura0130@gmail.com> Co-authored-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Signed-off-by: dbczumar <corey.zumar@databricks.com> Signed-off-by: mlflow-automation <mlflow-automation@users.noreply.github.com> Signed-off-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: mlflow-automation <mlflow-automation@users.noreply.github.com> Co-authored-by: Harutaka Kawamura <hkawamura0130@gmail.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Signed-off-by: dbczumar <corey.zumar@databricks.com> Signed-off-by: mlflow-automation <mlflow-automation@users.noreply.github.com> Signed-off-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: mlflow-automation <mlflow-automation@users.noreply.github.com> Co-authored-by: Harutaka Kawamura <hkawamura0130@gmail.com>

Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: Harutaka Kawamura <hkawamura0130@gmail.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Signed-off-by: Daniel Lok <daniel.lok@databricks.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

#10570) Signed-off-by: Sunish Sheth <sunishsheth2009@gmail.com>

Signed-off-by: dbczumar <corey.zumar@databricks.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Signed-off-by: Sunish Sheth <sunishsheth2009@gmail.com>

Signed-off-by: dbczumar <corey.zumar@databricks.com> Signed-off-by: mlflow-automation <mlflow-automation@users.noreply.github.com> Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Co-authored-by: mlflow-automation <mlflow-automation@users.noreply.github.com> Co-authored-by: harupy <17039389+harupy@users.noreply.github.com>

…s APIs. (langchain-ai#13699) ## Description Related to mlflow/mlflow#10420. MLflow AI gateway will be deprecated and replaced by the `mlflow.deployments` module. Happy to split this PR if it's too large. ``` pip install git+https://github.com/langchain-ai/langchain.git@refs/pull/13699/merge#subdirectory=libs/langchain ``` ## Dependencies Install mlflow from mlflow/mlflow#10420: ``` pip install git+https://github.com/mlflow/mlflow.git@refs/pull/10420/merge ``` ## Testing plan The following code works fine on local and databricks: <details><summary>Click</summary> ```python """ Setup ----- mlflow deployments start-server --config-path examples/gateway/openai/config.yaml databricks secrets create-scope <scope> databricks secrets put-secret <scope> openai-api-key --string-value $OPENAI_API_KEY Run --- python /path/to/this/file.py secrets/<scope>/openai-api-key """ from langchain.chat_models import ChatMlflow, ChatDatabricks from langchain.embeddings import MlflowEmbeddings, DatabricksEmbeddings from langchain.llms import Databricks, Mlflow from langchain.schema.messages import HumanMessage from langchain.chains.loading import load_chain from mlflow.deployments import get_deploy_client import uuid import sys import tempfile from langchain.chains import LLMChain from langchain.prompts import PromptTemplate ############################### # MLflow ############################### chat = ChatMlflow( target_uri="http://127.0.0.1:5000", endpoint="chat", params={"temperature": 0.1} ) print(chat([HumanMessage(content="hello")])) embeddings = MlflowEmbeddings(target_uri="http://127.0.0.1:5000", endpoint="embeddings") print(embeddings.embed_query("hello")[:3]) print(embeddings.embed_documents(["hello", "world"])[0][:3]) llm = Mlflow( target_uri="http://127.0.0.1:5000", endpoint="completions", params={"temperature": 0.1}, ) print(llm("I am")) llm_chain = LLMChain( llm=llm, prompt=PromptTemplate( input_variables=["adjective"], template="Tell me a {adjective} joke", ), ) print(llm_chain.run(adjective="funny")) # serialization/deserialization with tempfile.TemporaryDirectory() as tmpdir: print(tmpdir) path = f"{tmpdir}/llm.yaml" llm_chain.save(path) loaded_chain = load_chain(path) print(loaded_chain("funny")) ############################### # Databricks ############################### secret = sys.argv[1] client = get_deploy_client("databricks") # External - chat name = f"chat-{uuid.uuid4()}" client.create_endpoint( name=name, config={ "served_entities": [ { "name": "test", "external_model": { "name": "gpt-4", "provider": "openai", "task": "llm/v1/chat", "openai_config": { "openai_api_key": "{{" + secret + "}}", }, }, } ], }, ) try: chat = ChatDatabricks( target_uri="databricks", endpoint=name, params={"temperature": 0.1} ) print(chat([HumanMessage(content="hello")])) finally: client.delete_endpoint(endpoint=name) # External - embeddings name = f"embeddings-{uuid.uuid4()}" client.create_endpoint( name=name, config={ "served_entities": [ { "name": "test", "external_model": { "name": "text-embedding-ada-002", "provider": "openai", "task": "llm/v1/embeddings", "openai_config": { "openai_api_key": "{{" + secret + "}}", }, }, } ], }, ) try: embeddings = DatabricksEmbeddings(target_uri="databricks", endpoint=name) print(embeddings.embed_query("hello")[:3]) print(embeddings.embed_documents(["hello", "world"])[0][:3]) finally: client.delete_endpoint(endpoint=name) # External - completions name = f"completions-{uuid.uuid4()}" client.create_endpoint( name=name, config={ "served_entities": [ { "name": "test", "external_model": { "name": "gpt-3.5-turbo-instruct", "provider": "openai", "task": "llm/v1/completions", "openai_config": { "openai_api_key": "{{" + secret + "}}", }, }, } ], }, ) try: llm = Databricks( endpoint_name=name, model_kwargs={"temperature": 0.1}, ) print(llm("I am")) finally: client.delete_endpoint(endpoint=name) # Foundation model - chat chat = ChatDatabricks( endpoint="databricks-llama-2-70b-chat", params={"temperature": 0.1} ) print(chat([HumanMessage(content="hello")])) # Foundation model - embeddings embeddings = DatabricksEmbeddings(endpoint="databricks-bge-large-en") print(embeddings.embed_query("hello")[:3]) # Foundation model - completions llm = Databricks( endpoint_name="databricks-mpt-7b-instruct", model_kwargs={"temperature": 0.1} ) print(llm("hello")) llm_chain = LLMChain( llm=llm, prompt=PromptTemplate( input_variables=["adjective"], template="Tell me a {adjective} joke", ), ) print(llm_chain.run(adjective="funny")) # serialization/deserialization with tempfile.TemporaryDirectory() as tmpdir: print(tmpdir) path = f"{tmpdir}/llm.yaml" llm_chain.save(path) loaded_chain = load_chain(path) print(loaded_chain("funny")) ``` Output: ``` content='Hello! How can I assist you today?' [-0.025058426, -0.01938856, -0.027781019] [-0.025058426, -0.01938856, -0.027781019] sorry, but I cannot continue the sentence as it is incomplete. Can you please provide more information or context? Sure, here's a classic one for you: Why don't scientists trust atoms? Because they make up everything! /var/folders/dz/cd_nvlf14g9g__n3ph0d_0pm0000gp/T/tmpx_4no6ad {'adjective': 'funny', 'text': "Sure, here's a classic one for you:\n\nWhy don't scientists trust atoms?\n\nBecause they make up everything!"} content='Hello! How can I assist you today?' [-0.025058426, -0.01938856, -0.027781019] [-0.025058426, -0.01938856, -0.027781019] a 23 year old female and I am currently studying for my master's degree content="\nHello! It's nice to meet you. Is there something I can help you with or would you like to chat for a bit?" [0.051055908203125, 0.007221221923828125, 0.003879547119140625] [0.051055908203125, 0.007221221923828125, 0.003879547119140625] hello back Well, I don't really know many jokes, but I do know this funny story... /var/folders/dz/cd_nvlf14g9g__n3ph0d_0pm0000gp/T/tmp7_ds72ex {'adjective': 'funny', 'text': " Well, I don't really know many jokes, but I do know this funny story..."} ``` </details> The existing workflow doesn't break: <details><summary>click</summary> ```python import uuid import mlflow from mlflow.models import ModelSignature from mlflow.types.schema import ColSpec, Schema class MyModel(mlflow.pyfunc.PythonModel): def predict(self, context, model_input): return str(uuid.uuid4()) with mlflow.start_run(): mlflow.pyfunc.log_model( "model", python_model=MyModel(), pip_requirements=["mlflow==2.8.1", "cloudpickle<3"], signature=ModelSignature( inputs=Schema( [ ColSpec("string", "prompt"), ColSpec("string", "stop"), ] ), outputs=Schema( [ ColSpec(name=None, type="string"), ] ), ), registered_model_name=f"lang-{uuid.uuid4()}", ) # Manually create a serving endpoint with the registered model and run from langchain.llms import Databricks llm = Databricks(endpoint_name="<name>") llm("hello") # 9d0b2491-3d13-487c-bc02-1287f06ecae7 ``` </details> ## Follow-up tasks (This PR is too large. I'll file a separate one for follow-up tasks.) - Update `docs/docs/integrations/providers/mlflow_ai_gateway.mdx` and `docs/docs/integrations/providers/databricks.md`. --------- Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

…s APIs. (langchain-ai#13699) ## Description Related to mlflow/mlflow#10420. MLflow AI gateway will be deprecated and replaced by the `mlflow.deployments` module. Happy to split this PR if it's too large. ``` pip install git+https://github.com/langchain-ai/langchain.git@refs/pull/13699/merge#subdirectory=libs/langchain ``` ## Dependencies Install mlflow from mlflow/mlflow#10420: ``` pip install git+https://github.com/mlflow/mlflow.git@refs/pull/10420/merge ``` ## Testing plan The following code works fine on local and databricks: <details><summary>Click</summary> ```python """ Setup ----- mlflow deployments start-server --config-path examples/gateway/openai/config.yaml databricks secrets create-scope <scope> databricks secrets put-secret <scope> openai-api-key --string-value $OPENAI_API_KEY Run --- python /path/to/this/file.py secrets/<scope>/openai-api-key """ from langchain.chat_models import ChatMlflow, ChatDatabricks from langchain.embeddings import MlflowEmbeddings, DatabricksEmbeddings from langchain.llms import Databricks, Mlflow from langchain.schema.messages import HumanMessage from langchain.chains.loading import load_chain from mlflow.deployments import get_deploy_client import uuid import sys import tempfile from langchain.chains import LLMChain from langchain.prompts import PromptTemplate ############################### # MLflow ############################### chat = ChatMlflow( target_uri="http://127.0.0.1:5000", endpoint="chat", params={"temperature": 0.1} ) print(chat([HumanMessage(content="hello")])) embeddings = MlflowEmbeddings(target_uri="http://127.0.0.1:5000", endpoint="embeddings") print(embeddings.embed_query("hello")[:3]) print(embeddings.embed_documents(["hello", "world"])[0][:3]) llm = Mlflow( target_uri="http://127.0.0.1:5000", endpoint="completions", params={"temperature": 0.1}, ) print(llm("I am")) llm_chain = LLMChain( llm=llm, prompt=PromptTemplate( input_variables=["adjective"], template="Tell me a {adjective} joke", ), ) print(llm_chain.run(adjective="funny")) # serialization/deserialization with tempfile.TemporaryDirectory() as tmpdir: print(tmpdir) path = f"{tmpdir}/llm.yaml" llm_chain.save(path) loaded_chain = load_chain(path) print(loaded_chain("funny")) ############################### # Databricks ############################### secret = sys.argv[1] client = get_deploy_client("databricks") # External - chat name = f"chat-{uuid.uuid4()}" client.create_endpoint( name=name, config={ "served_entities": [ { "name": "test", "external_model": { "name": "gpt-4", "provider": "openai", "task": "llm/v1/chat", "openai_config": { "openai_api_key": "{{" + secret + "}}", }, }, } ], }, ) try: chat = ChatDatabricks( target_uri="databricks", endpoint=name, params={"temperature": 0.1} ) print(chat([HumanMessage(content="hello")])) finally: client.delete_endpoint(endpoint=name) # External - embeddings name = f"embeddings-{uuid.uuid4()}" client.create_endpoint( name=name, config={ "served_entities": [ { "name": "test", "external_model": { "name": "text-embedding-ada-002", "provider": "openai", "task": "llm/v1/embeddings", "openai_config": { "openai_api_key": "{{" + secret + "}}", }, }, } ], }, ) try: embeddings = DatabricksEmbeddings(target_uri="databricks", endpoint=name) print(embeddings.embed_query("hello")[:3]) print(embeddings.embed_documents(["hello", "world"])[0][:3]) finally: client.delete_endpoint(endpoint=name) # External - completions name = f"completions-{uuid.uuid4()}" client.create_endpoint( name=name, config={ "served_entities": [ { "name": "test", "external_model": { "name": "gpt-3.5-turbo-instruct", "provider": "openai", "task": "llm/v1/completions", "openai_config": { "openai_api_key": "{{" + secret + "}}", }, }, } ], }, ) try: llm = Databricks( endpoint_name=name, model_kwargs={"temperature": 0.1}, ) print(llm("I am")) finally: client.delete_endpoint(endpoint=name) # Foundation model - chat chat = ChatDatabricks( endpoint="databricks-llama-2-70b-chat", params={"temperature": 0.1} ) print(chat([HumanMessage(content="hello")])) # Foundation model - embeddings embeddings = DatabricksEmbeddings(endpoint="databricks-bge-large-en") print(embeddings.embed_query("hello")[:3]) # Foundation model - completions llm = Databricks( endpoint_name="databricks-mpt-7b-instruct", model_kwargs={"temperature": 0.1} ) print(llm("hello")) llm_chain = LLMChain( llm=llm, prompt=PromptTemplate( input_variables=["adjective"], template="Tell me a {adjective} joke", ), ) print(llm_chain.run(adjective="funny")) # serialization/deserialization with tempfile.TemporaryDirectory() as tmpdir: print(tmpdir) path = f"{tmpdir}/llm.yaml" llm_chain.save(path) loaded_chain = load_chain(path) print(loaded_chain("funny")) ``` Output: ``` content='Hello! How can I assist you today?' [-0.025058426, -0.01938856, -0.027781019] [-0.025058426, -0.01938856, -0.027781019] sorry, but I cannot continue the sentence as it is incomplete. Can you please provide more information or context? Sure, here's a classic one for you: Why don't scientists trust atoms? Because they make up everything! /var/folders/dz/cd_nvlf14g9g__n3ph0d_0pm0000gp/T/tmpx_4no6ad {'adjective': 'funny', 'text': "Sure, here's a classic one for you:\n\nWhy don't scientists trust atoms?\n\nBecause they make up everything!"} content='Hello! How can I assist you today?' [-0.025058426, -0.01938856, -0.027781019] [-0.025058426, -0.01938856, -0.027781019] a 23 year old female and I am currently studying for my master's degree content="\nHello! It's nice to meet you. Is there something I can help you with or would you like to chat for a bit?" [0.051055908203125, 0.007221221923828125, 0.003879547119140625] [0.051055908203125, 0.007221221923828125, 0.003879547119140625] hello back Well, I don't really know many jokes, but I do know this funny story... /var/folders/dz/cd_nvlf14g9g__n3ph0d_0pm0000gp/T/tmp7_ds72ex {'adjective': 'funny', 'text': " Well, I don't really know many jokes, but I do know this funny story..."} ``` </details> The existing workflow doesn't break: <details><summary>click</summary> ```python import uuid import mlflow from mlflow.models import ModelSignature from mlflow.types.schema import ColSpec, Schema class MyModel(mlflow.pyfunc.PythonModel): def predict(self, context, model_input): return str(uuid.uuid4()) with mlflow.start_run(): mlflow.pyfunc.log_model( "model", python_model=MyModel(), pip_requirements=["mlflow==2.8.1", "cloudpickle<3"], signature=ModelSignature( inputs=Schema( [ ColSpec("string", "prompt"), ColSpec("string", "stop"), ] ), outputs=Schema( [ ColSpec(name=None, type="string"), ] ), ), registered_model_name=f"lang-{uuid.uuid4()}", ) # Manually create a serving endpoint with the registered model and run from langchain.llms import Databricks llm = Databricks(endpoint_name="<name>") llm("hello") # 9d0b2491-3d13-487c-bc02-1287f06ecae7 ``` </details> ## Follow-up tasks (This PR is too large. I'll file a separate one for follow-up tasks.) - Update `docs/docs/integrations/providers/mlflow_ai_gateway.mdx` and `docs/docs/integrations/providers/databricks.md`. --------- Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>

Remove unused loggger

f536157

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

github-actions bot added the rn/none List under Small Changes in Changelogs. label Nov 16, 2023

harupy added the gateway-migration label Nov 16, 2023

harupy and others added 9 commits November 16, 2023 11:34

Add databricks deployments client skeleton + example (#10421)

dce3cb6

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Set up CI (#10422)

856de75

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: Harutaka Kawamura <hkawamura0130@gmail.com>

Change git labels from gateway -> deployments (#10428)

94ba4d2

Signed-off-by: dbczumar <corey.zumar@databricks.com>

Update gateway embeddings request / response format (#10430)

e450b10

Signed-off-by: dbczumar <corey.zumar@databricks.com>

Fix typo

3fd534f

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Fix typo

ee8daca

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Add MLflowDeploymentClient (still empty) (#10447)

c32b9a8

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Implement CRUD for serving-endpoints (#10425)

8566e3f

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Use Literal for string constants (#10457)

d220673

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy commented Nov 18, 2023

View reviewed changes

dbczumar and others added 14 commits November 19, 2023 19:35

Update gateway completions request / response format (#10465)

1390a2d

Signed-off-by: dbczumar <corey.zumar@databricks.com> Signed-off-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com>

Deprecation warning for mlflow.gateway (#10460)

ac98a64

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: Harutaka Kawamura <hkawamura0130@gmail.com> Co-authored-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com>

Mark deployment clients as experimental (#10468)

12f2279

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Fix start_server (#10470)

5ccb7f3

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Suppress Pydantic warnings (#10483)

1877ef1

Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>

Implement MLflowDeploymentClient (#10458)

fdb2779

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Set timeout default to None (#10487)

8ca8429

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Rename MLflowDeploymentClient to MlflowDeploymentClient (#10500)

6f97551

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Fix MlflowDeploymentClient.list_endpoints to auto-paginate (#10501)

5472a35

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Use /rate-limits if config only contains rate_limits (#10502)

9fc93d3

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy mentioned this pull request Nov 27, 2023

Migrate mlflow and databricks classes to deployments APIs. langchain-ai/langchain#13699

Merged

Replace TODO in MlflowDeploymentClient (#10496)

bf0782c

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy and others added 13 commits December 1, 2023 13:23

Update docs/source/llms/index.rst (#10551)

4602e81

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: Harutaka Kawamura <hkawamura0130@gmail.com>

Use MLFLOW_DEPLOYMENTS_TARGET in gateway_proxy_handler (#10554)

54751eb

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Update docs/source/llms/prompt-engineering/index.rst (#10552)

c8dfc06

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Quick fix for promptlab gateway migration (#10563)

d2945a9

Signed-off-by: Daniel Lok <daniel.lok@databricks.com>

Fix completions params (#10565)

631ce09

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Fix incorrect Embeddings param: text -> input (#10566)

ea341fd

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Replace candidates with choices (#10569)

2f0a1fc

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

More gateway replacements (#10568)

1861d5b

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

[Docs] Updating the docs for prompt engineering with MLflow deployment (

beaf89f

#10570) Signed-off-by: Sunish Sheth <sunishsheth2009@gmail.com>

Gateway anthropic small fix for "n" parameter (#10576)

22e791f

Signed-off-by: dbczumar <corey.zumar@databricks.com>

Fix example links (#10561)

0998384

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

[Bug-fix] Fixing the error state bug for Mlflow deployments (#10575)

96a6d2b

Signed-off-by: Sunish Sheth <sunishsheth2009@gmail.com>

harupy and others added 4 commits December 4, 2023 14:13

Fix deployments examples (#10582)

e502ddf

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Support completions endpoints (#10577)

7e99592

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

merge master

4ec83b0

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

Update docs/source/llms/gateway/migration.rst (#10498)

abb76c9

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy changed the title ~~[DO NOT MERGE YET] Migrate AI gateway~~ Migrate AI gateway Dec 5, 2023

Update quickstart quides (#10593)

db1e9e8

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy merged commit 2ef3a13 into master Dec 5, 2023
44 of 46 checks passed

sainivedh mentioned this pull request Dec 7, 2023

Add Clarifai as a provider in MLflow AI Gateway #10075

Closed

37 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate AI gateway #10420

Migrate AI gateway #10420

harupy commented Nov 16, 2023 •

edited

github-actions bot commented Nov 16, 2023 •

edited

harupy Nov 18, 2023

Migrate AI gateway #10420

Migrate AI gateway #10420

Conversation

harupy commented Nov 16, 2023 • edited

Install mlflow from this PR

Checkout with GitHub CLI

Related Issues/PRs

What changes are proposed in this pull request?

How is this PR tested?

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

github-actions bot commented Nov 16, 2023 • edited

harupy Nov 18, 2023

Choose a reason for hiding this comment

harupy commented Nov 16, 2023 •

edited

github-actions bot commented Nov 16, 2023 •

edited