From c9191072796d814b4730b8dfd7746aea833a2af8 Mon Sep 17 00:00:00 2001 From: Adam Demjen Date: Wed, 3 Dec 2025 13:21:57 -0500 Subject: [PATCH 1/4] WIP --- serverless/pages/sign-up.asciidoc | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/serverless/pages/sign-up.asciidoc b/serverless/pages/sign-up.asciidoc index d4dfcad875..e81a2a6b63 100644 --- a/serverless/pages/sign-up.asciidoc +++ b/serverless/pages/sign-up.asciidoc @@ -55,6 +55,10 @@ To learn more about Elastic Cloud Hosted, check our https://www.elastic.co/guide * Search Boost Window is limited to 7 days. This setting only exists in {es-serverless} projects * Scaling is limited for serverless projects in trials. Failures might occur if the workload requires memory or compute beyond what the above search power and search boost window setting limits can provide. +**Inference tokens** + +* TBD + **Remove limitations** Subscribe to https://www.elastic.co/guide/en/cloud/current/ec-billing-details.html[Elastic Cloud] for the following benefits: From d06fe11d28f5ded778828d2cd68f61a20b48bd2f Mon Sep 17 00:00:00 2001 From: Adam Demjen Date: Wed, 3 Dec 2025 13:29:16 -0500 Subject: [PATCH 2/4] Document EIS fair usage limits --- .../deploy/elastic-cloud/create-an-organization.md | 7 +++++++ serverless/pages/sign-up.asciidoc | 5 ++++- 2 files changed, 11 insertions(+), 1 deletion(-) diff --git a/deploy-manage/deploy/elastic-cloud/create-an-organization.md b/deploy-manage/deploy/elastic-cloud/create-an-organization.md index daa6d91e90..d063ef483b 100644 --- a/deploy-manage/deploy/elastic-cloud/create-an-organization.md +++ b/deploy-manage/deploy/elastic-cloud/create-an-organization.md @@ -76,6 +76,13 @@ For more information, check the [{{ech}} documentation](cloud-hosted.md). * Scaling is limited for {{serverless-short}} projects in trials. Failures might occur if the workload requires memory or compute beyond what the above search power and search boost window setting limits can provide. * We monitor token usage per account for the Elastic Managed LLM. If an account uses over one million tokens in 24 hours, we will inform you and then disable access to the LLM. This is in accordance with our fair use policy for trials. +**Inference tokens** + +* You can use these models hosted by the Elastic {{infer-cap}} Service with the following limits: + * **Elastic Managed LLM:** 100 million input tokens in a 24-hour period or 5 million output tokens in a 24-hour period + * **ELSER**: 1 billion tokens in a 24-hour period +* Access to some models may be paused temporarily if either of these limits are exceeded + **Remove limitations** Subscribe to [{{ecloud}}](/deploy-manage/cloud-organization/billing/add-billing-details.md) for the following benefits: diff --git a/serverless/pages/sign-up.asciidoc b/serverless/pages/sign-up.asciidoc index e81a2a6b63..e0b43ed2d8 100644 --- a/serverless/pages/sign-up.asciidoc +++ b/serverless/pages/sign-up.asciidoc @@ -57,7 +57,10 @@ To learn more about Elastic Cloud Hosted, check our https://www.elastic.co/guide **Inference tokens** -* TBD +* You can use these models hosted by the Elastic {infer-cap} Service with the following limits: + * **Elastic Managed LLM:** 100 million input tokens in a 24-hour period or 5 million output tokens in a 24-hour period + * **ELSER**: 1 billion tokens in a 24-hour period +* Access to some models may be paused temporarily if either of these limits are exceeded **Remove limitations** From 16a5e42c5753c2fabc191bafcb649428c54e1b80 Mon Sep 17 00:00:00 2001 From: Adam Demjen Date: Wed, 3 Dec 2025 13:32:56 -0500 Subject: [PATCH 3/4] Revert asciidoc changes --- serverless/pages/sign-up.asciidoc | 5 +---- 1 file changed, 1 insertion(+), 4 deletions(-) diff --git a/serverless/pages/sign-up.asciidoc b/serverless/pages/sign-up.asciidoc index e0b43ed2d8..e81a2a6b63 100644 --- a/serverless/pages/sign-up.asciidoc +++ b/serverless/pages/sign-up.asciidoc @@ -57,10 +57,7 @@ To learn more about Elastic Cloud Hosted, check our https://www.elastic.co/guide **Inference tokens** -* You can use these models hosted by the Elastic {infer-cap} Service with the following limits: - * **Elastic Managed LLM:** 100 million input tokens in a 24-hour period or 5 million output tokens in a 24-hour period - * **ELSER**: 1 billion tokens in a 24-hour period -* Access to some models may be paused temporarily if either of these limits are exceeded +* TBD **Remove limitations** From 9a14257c21ac17d24312b1f2d299ad7ad306c12c Mon Sep 17 00:00:00 2001 From: Adam Demjen Date: Wed, 3 Dec 2025 13:33:19 -0500 Subject: [PATCH 4/4] Revert "WIP" This reverts commit c9191072796d814b4730b8dfd7746aea833a2af8. --- serverless/pages/sign-up.asciidoc | 4 ---- 1 file changed, 4 deletions(-) diff --git a/serverless/pages/sign-up.asciidoc b/serverless/pages/sign-up.asciidoc index e81a2a6b63..d4dfcad875 100644 --- a/serverless/pages/sign-up.asciidoc +++ b/serverless/pages/sign-up.asciidoc @@ -55,10 +55,6 @@ To learn more about Elastic Cloud Hosted, check our https://www.elastic.co/guide * Search Boost Window is limited to 7 days. This setting only exists in {es-serverless} projects * Scaling is limited for serverless projects in trials. Failures might occur if the workload requires memory or compute beyond what the above search power and search boost window setting limits can provide. -**Inference tokens** - -* TBD - **Remove limitations** Subscribe to https://www.elastic.co/guide/en/cloud/current/ec-billing-details.html[Elastic Cloud] for the following benefits: