Skip to content

[Serverless]: Provide information on how users can manage ML VCU costs #229

@ppf2

Description

@ppf2

Serverless Docs

Welcome to Elastic Serverless

Description

It can be helpful to add another bullet point under this section https://www.elastic.co/guide/en/serverless/current/elasticsearch-billing.html#elasticsearch-billing-managing-elasticsearch-costs that talks about the two ways to control the ML VCU costs:

  • Set adaptive resources to Low to allow ML to scale down to 0 # of allocations when there are no active inference requests
  • When using the inference API for Elasticsearch or ELSER, enable adaptive_allocations which will allow ML to scale down the models to 0 # of allocations when there are no active inference requests

Resources and additional context

https://www.elastic.co/guide/en/serverless/current/elasticsearch-billing.html#elasticsearch-billing-managing-elasticsearch-costs

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions