Skip to content

Conversation

dan-rubinstein
Copy link
Member

This change introduces:

Logic to allow users to pass in chunking_settings as part of their requests to create an inference endpoint
Implements the above logic within the OpenAI service allowing users to choose between the two existing chunking strategies (word based and sentence based)

These features are currently hidden behind a feature flag for 8.16.

Note: Chunking currently only runs when large documents are ingested to an index with an inference field.

Backport of #112074

…I embedding endpoints (elastic#112074)

* Adding ChunkingSettings logic and enabling ChunkingSettings for OpenAI embedding endpoints

* Cleaning up naming in ChunkingSettings logic

* Incrementing InferenceIndex version

* Removing DefaultChunkingSettings, cleaning up chunking settings class and related tests, add chunking strategy to inference index

* Adding check for up to date index mappings when creating an inference endpoint

* Fixing transport version conflict

* Adding validation for invalid chunking settings inputs and improving error messaging

* Reverting SystemIndexMappingUpdateService changes and adding error messaging on mixed cluster exception
@elasticsearchmachine elasticsearchmachine added v8.16.0 needs:triage Requires assignment of a team area label external-contributor Pull request authored by a developer outside the Elasticsearch team labels Sep 26, 2024
@dan-rubinstein dan-rubinstein added >enhancement :ml Machine learning backport Team:ML Meta label for the ML team auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) v8.16.0 and removed needs:triage Requires assignment of a team area label external-contributor Pull request authored by a developer outside the Elasticsearch team v8.16.0 labels Sep 26, 2024
@elasticsearchmachine elasticsearchmachine merged commit e337ce6 into elastic:8.x Sep 26, 2024
15 checks passed
@dan-rubinstein dan-rubinstein deleted the backport/112074 branch September 26, 2024 14:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) backport >enhancement :ml Machine learning Team:ML Meta label for the ML team v8.16.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants