From 86635f5dd109b5ff078ebf624cf85479b6e189e9 Mon Sep 17 00:00:00 2001 From: fpagny Date: Thu, 20 Mar 2025 18:17:08 +0100 Subject: [PATCH 1/2] feat(genapi): update faq --- pages/managed-inference/faq.mdx | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/pages/managed-inference/faq.mdx b/pages/managed-inference/faq.mdx index c57618208b..219fc4d954 100644 --- a/pages/managed-inference/faq.mdx +++ b/pages/managed-inference/faq.mdx @@ -60,6 +60,10 @@ You can select the Instance type based on your model’s computational needs and Billing is based on the Instance type and usage duration. Unlike [Generative APIs](/generative-apis/quickstart/), which are billed per token, Managed Inference provides predictable costs based on the allocated infrastructure. Pricing details can be found on the [Scaleway pricing page](https://www.scaleway.com/en/pricing/model-as-a-service/#managed-inference). +## Can I pause Managed Inference billing when the instance is not in use ? +When a Managed Inference deployment is running, corresponding resources are provisioned and thus billed. Resources can therefore not be paused. +However, you can still optimize your Managed Inference deployment to fit within specific time range (such as during working hours). To do so, you can automate deployment creation and deletion using [Managed Inference API](https://www.scaleway.com/en/developers/api/inference/), [Terraform](https://registry.terraform.io/providers/scaleway/scaleway/latest/docs/resources/inference_deployment) or [Scaleway SDKs](https://www.scaleway.com/en/docs/scaleway-sdk/). These actions can be programmed using [Serverless Jobs]() to be performed periodically in an automated way. + ## Can I run inference on private models? Yes, Managed Inference allows you to deploy private models with access control settings. You can restrict access to specific users, teams, or networks. From d8942dd410ba43a489669132c6f847ff783df429 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?N=C3=A9da?= <87707325+nerda-codes@users.noreply.github.com> Date: Fri, 21 Mar 2025 13:03:41 +0100 Subject: [PATCH 2/2] Update pages/managed-inference/faq.mdx Co-authored-by: Rowena Jones <36301604+RoRoJ@users.noreply.github.com> --- pages/managed-inference/faq.mdx | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/pages/managed-inference/faq.mdx b/pages/managed-inference/faq.mdx index 219fc4d954..38995b0110 100644 --- a/pages/managed-inference/faq.mdx +++ b/pages/managed-inference/faq.mdx @@ -62,7 +62,7 @@ Pricing details can be found on the [Scaleway pricing page](https://www.scaleway ## Can I pause Managed Inference billing when the instance is not in use ? When a Managed Inference deployment is running, corresponding resources are provisioned and thus billed. Resources can therefore not be paused. -However, you can still optimize your Managed Inference deployment to fit within specific time range (such as during working hours). To do so, you can automate deployment creation and deletion using [Managed Inference API](https://www.scaleway.com/en/developers/api/inference/), [Terraform](https://registry.terraform.io/providers/scaleway/scaleway/latest/docs/resources/inference_deployment) or [Scaleway SDKs](https://www.scaleway.com/en/docs/scaleway-sdk/). These actions can be programmed using [Serverless Jobs]() to be performed periodically in an automated way. +However, you can still optimize your Managed Inference deployment to fit within specific time ranges (such as during working hours). To do so, you can automate deployment creation and deletion using the [Managed Inference API](https://www.scaleway.com/en/developers/api/inference/), [Terraform](https://registry.terraform.io/providers/scaleway/scaleway/latest/docs/resources/inference_deployment) or [Scaleway SDKs](https://www.scaleway.com/en/docs/scaleway-sdk/). These actions can be programmed using [Serverless Jobs](/serverless-jobs/) to be automatically carried out periodically. ## Can I run inference on private models? Yes, Managed Inference allows you to deploy private models with access control settings. You can restrict access to specific users, teams, or networks.