Add documentation on pointing llmengine client to self-hosted infrastructure #200

ruizehung-scale · 2023-08-07T20:16:07Z

Add documentation on pointing llmengine client to self-hosted infrastructure.

…ructure

ruizehung-scale · 2023-08-07T20:17:55Z

docs/guides/self_hosting.md

+```
+
+### Pointing LLM Engine client to use self-hosted infrastructure
+The `llmengine` client makes requests to Scale AI's hosted infrastructure by default. You can have `llmengine` client make requests to your own self-hosted infrastructure by setting the `LLM_ENGINE_BASE_PATH` environment variable to the URL of the `llm-engine` pod. The exact URL of `llm-engine` pod depends on your Kubernetes cluster networking setup.


I'm not 100% sure if this would just work out of the box as Spellbook API might be different from the API exposed by llm-engine gateway?

Does having people use https://github.com/scaleapi/launch-python-client and point the gateway_endpoint to self-hosted llm-engine be better?

I'd have thought that we'd have wanted to have llmengine client be able to point to llmengine server directly? in the sense that if this doesn't work OOB we should make it work OOB?

IMO having people use launch-python-client seems kinda ugly

Let's not mention launch-python-client here. FWIW, the EGP APIs do have a /llm-prefixed set of APIs that are at parity with the LLM Engine APIs, so having the LLM Engine client point to either the self-hosted or EGP-hosted one should still work. There is a separate set of EGP-specific Completion APIs, which is not at play here. cc @felixs8696

Ok! Then I'll keep the documentation as it is and only use llmengine client

Spellbook API might be different from the API exposed by llm-engine gateway

we intentionally keep them the same.

yixu34 · 2023-08-07T22:39:02Z

docs/guides/self_hosting.md

+```
+
+### Pointing LLM Engine client to use self-hosted infrastructure
+The `llmengine` client makes requests to Scale AI's hosted infrastructure by default. You can have `llmengine` client make requests to your own self-hosted infrastructure by setting the `LLM_ENGINE_BASE_PATH` environment variable to the URL of the `llm-engine` pod. The exact URL of `llm-engine` pod depends on your Kubernetes cluster networking setup.


The exact URL of llm-engine pod depends on your Kubernetes cluster networking setup.

This might be true, but I think we have some default in our helm chart? @song-william @phil-scale

Also, let's add a code snippet example, which assumes this default.

@yixu34 yep you're right! The default cluster domain is specified here

llm-engine/charts/llm-engine/values_sample.yaml

Line 99 in 771b693

dns_host_domain: domain.llm-engine.com

But do requests sent to the k8s cluster domain get properly routed to llm-engine gateway? I'm not 100% sure but I feel like there needs to be some networking config set up to route requests to the gateway?
cc @yunfeng-scale

yixu34 · 2023-08-07T23:41:05Z

docs/guides/self_hosting.md

+```
+
+### Pointing LLM Engine client to use self-hosted infrastructure
+The `llmengine` client makes requests to Scale AI's hosted infrastructure by default. You can have `llmengine` client make requests to your own self-hosted infrastructure by setting the `LLM_ENGINE_BASE_PATH` environment variable to the URL of the `llm-engine` pod. 


Would suggest replacing "pod" with "service". pod is too specific and low-level - in fact, there may be many pods for a given service.

yixu34 · 2023-08-07T23:41:41Z

docs/guides/self_hosting.md

+### Pointing LLM Engine client to use self-hosted infrastructure
+The `llmengine` client makes requests to Scale AI's hosted infrastructure by default. You can have `llmengine` client make requests to your own self-hosted infrastructure by setting the `LLM_ENGINE_BASE_PATH` environment variable to the URL of the `llm-engine` pod. 
+
+The exact URL of `llm-engine` pod depends on your Kubernetes cluster networking setup. The domain is specified at `config.values.infra.dns_host_domain` in the helm chart values config file. Using `charts/llm-engine/values_sample.yaml` as an example, you would 


Same here, pod -> service.

Also finish this sentence, e.g. you would do:

seanshi-scale · 2023-08-08T18:37:33Z

docs/guides/self_hosting.md

+
+The exact URL of `llm-engine` service depends on your Kubernetes cluster networking setup. The domain is specified at `config.values.infra.dns_host_domain` in the helm chart values config file. Using `charts/llm-engine/values_sample.yaml` as an example, you would do:
+```bash
+export LLM_ENGINE_BASE_PATH=https://domain.llm-engine.com


nit: should this be https://llm-engine.domain.com? (in addition to changing the value inside values_sample.yaml)

since it feels like llm-engine should be some subdomain of domain.com, not the other way around, if users are self-hosting at domain.com.

Make sense! Updated!

ruizehung-scale added 2 commits August 7, 2023 20:13

Add documentation on pointing llmengine client to self-hosted infrast…

c23da96

…ructure

url -> URL

91caba8

ruizehung-scale requested review from seanshi-scale, yixu34 and yunfeng-scale August 7, 2023 20:16

ruizehung-scale commented Aug 7, 2023

View reviewed changes

ruizehung-scale self-assigned this Aug 7, 2023

yunfeng-scale approved these changes Aug 7, 2023

View reviewed changes

yixu34 reviewed Aug 7, 2023

View reviewed changes

ruizehung-scale added 2 commits August 7, 2023 22:50

Add code sample for setting LLM_ENGINE_BASE_PATH

db1a9de

Small wording fix

771b693

yixu34 reviewed Aug 7, 2023

View reviewed changes

ruizehung-scale and others added 2 commits August 8, 2023 00:24

Wording fix

68dfec4

Merge branch 'main' into point-client-to-self-hosted

0a443ef

ruizehung-scale requested a review from yixu34 August 8, 2023 00:27

seanshi-scale reviewed Aug 8, 2023

View reviewed changes

Update dns_host_domain

a19b460

ruizehung-scale requested a review from seanshi-scale August 8, 2023 19:47

seanshi-scale approved these changes Aug 8, 2023

View reviewed changes

yixu34 approved these changes Aug 8, 2023

View reviewed changes

ruizehung-scale merged commit 9a1d567 into main Aug 8, 2023

ruizehung-scale deleted the point-client-to-self-hosted branch August 8, 2023 20:32

Add documentation on pointing llmengine client to self-hosted infrastructure #200

Add documentation on pointing llmengine client to self-hosted infrastructure #200

Uh oh!

Conversation

ruizehung-scale commented Aug 7, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ruizehung-scale Aug 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ruizehung-scale Aug 7, 2023 •

edited

Loading