[INFERENCE] Batch size for chunked text should be dynamically calculated from the chunk size

### Elasticsearch Version

9.1

### Installed Plugins

_No response_

### Java Version

_bundled_

### OS Version

Any

### Problem Description

Chunked inputs from semantic_text are automatically batched into a single request. If a bulk ingest request contains multiple semantic_text fields they will be batched together up to a certain batch size. The OpenAI embeddings API has a max batch size of 2048 inputs and 2048 is the value used to control the batch size in the OpenAI integration. 

https://github.com/elastic/elasticsearch/blob/4a39b4ce2dc10853f0769d37b308fa1557474451/x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/services/openai/OpenAiServiceFields.java#L19

The OpenAI embeddings API is also limited to 300,000 tokens per [request](https://platform.openai.com/docs/api-reference/embeddings/create) if the request contains 2048 inputs then that equals 146 tokens per input (300,000 / 2048) which is a  small doc. 

The max number of items in a single embedding request needs to respect the 300,000 tokens limit. In practice this means that a batch size of 2048 will rarely be appropriate and the chunk size should be taken into consideration. 

### Steps to Reproduce

Create an index with a semantic text field and bulk upload 2000 long documents. 

### Logs (if relevant)

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[INFERENCE] Batch size for chunked text should be dynamically calculated from the chunk size #135015

Elasticsearch Version

Installed Plugins

Java Version

OS Version

Problem Description

Steps to Reproduce

Logs (if relevant)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[INFERENCE] Batch size for chunked text should be dynamically calculated from the chunk size #135015

Description

Elasticsearch Version

Installed Plugins

Java Version

OS Version

Problem Description

Steps to Reproduce

Logs (if relevant)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions