From e32cba0ada29d83f130831fab600c2cd7856f9e4 Mon Sep 17 00:00:00 2001 From: "Arun C. Murthy" Date: Tue, 18 Jul 2023 21:10:15 -0700 Subject: [PATCH 1/2] Clarified current self-hosting features. --- docs/index.md | 9 +++++---- 1 file changed, 5 insertions(+), 4 deletions(-) diff --git a/docs/index.md b/docs/index.md index 798519ee..4358833e 100644 --- a/docs/index.md +++ b/docs/index.md @@ -48,10 +48,11 @@ auto-scaling deployment with simple APIs. ### Features Coming Soon -**Kubernetes Installation Documentation**: We are working hard to document the installation and -maintenance of inference and fine-tuning functionality on your infrastructure. -For now, our documentation covers using our client libraries to access Scale's -hosted infrastructure. +**Kubernetes Installation Enhancements**: We are working hard to enhance the +installation and maintenance of inference and fine-tuning functionality on +your infrastructure. For now, our documentation covers _experimental_ libraries +to [deploy language models on your infrastructure](../guides/self_hosting) +and libraries to access Scale's [hosted infrastructure](https://spellbook.scale.com). **Fast Cold-Start Times**: To prevent GPUs from idling, LLM Engine automatically scales your model to zero when it's not in use and scales up From 0177af38a930c4a630d092db930c544ba8a21f40 Mon Sep 17 00:00:00 2001 From: "Arun C. Murthy" Date: Tue, 18 Jul 2023 21:14:39 -0700 Subject: [PATCH 2/2] Fixed link --- docs/index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/index.md b/docs/index.md index 4358833e..01d4f84e 100644 --- a/docs/index.md +++ b/docs/index.md @@ -51,7 +51,7 @@ auto-scaling deployment with simple APIs. **Kubernetes Installation Enhancements**: We are working hard to enhance the installation and maintenance of inference and fine-tuning functionality on your infrastructure. For now, our documentation covers _experimental_ libraries -to [deploy language models on your infrastructure](../guides/self_hosting) +to [deploy language models on your infrastructure](guides/self_hosting) and libraries to access Scale's [hosted infrastructure](https://spellbook.scale.com). **Fast Cold-Start Times**: To prevent GPUs from idling, LLM Engine