[ML] make inference model definitions writeable #96804

benwtrent · 2023-06-13T15:16:54Z

Model inference definitions are currently not serializable between nodes.

However, it is required for future LTR work that inference optimized models be serializable from the coordinator -> data nodes.

This adds wire serialization code for our ensemble and tree inference models. From a user perspective, this change does nothing.

elasticsearchmachine · 2023-06-13T15:17:18Z

Pinging @elastic/ml-core (Team:ML)

benwtrent · 2023-06-13T15:18:35Z

Pinging @elastic/es-search (Team:Search)

droberts195

LGTM

There's nothing wrong with this change in itself. However, I think there could be complications with running boosted tree models extensively during searches to support LTR. Autoscaling doesn't take into account the memory required for inference models inside the JVM. Currently it's possible to work around this by disabling autoscaling and selecting bigger nodes manually. In serverless that won't be possible. This isn't a new problem, but introducing LTR could change it from an obscure edge case that hardly affects anyone into a major problem.

benwtrent · 2023-06-13T17:36:58Z

Autoscaling doesn't take into account the memory required for inference models inside the JVM. Currently it's possible to work around this by disabling autoscaling and selecting bigger nodes manually. In serverless that won't be possible.

Good point. I will think a bit more about how we expose this. It would be bad for 1000s of search requests to all instantiate the same model needlessly.

benwtrent · 2023-06-13T18:28:15Z

I am going to close this for now and think a bit more.

Regardless of the autoscaling issue, I am not sure we want to serialize models for the sake of inferencing at all without caching them in their location (like the ModelLoader).

[ML] make inference model definitions writeable

5bdf672

benwtrent added >non-issue :ml Machine learning v8.9.0 labels Jun 13, 2023

elasticsearchmachine added the Team:ML Meta label for the ML team label Jun 13, 2023

davidkyle mentioned this pull request Jun 13, 2023

[ML] Trained model config validation can give incorrect errors in multi-node clusters #94854

Open

droberts195 approved these changes Jun 13, 2023

View reviewed changes

benwtrent closed this Jun 13, 2023

benwtrent deleted the feature/ml-make-inference-models-serializable branch August 29, 2023 17:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] make inference model definitions writeable #96804

[ML] make inference model definitions writeable #96804

benwtrent commented Jun 13, 2023

elasticsearchmachine commented Jun 13, 2023

benwtrent commented Jun 13, 2023

droberts195 left a comment

benwtrent commented Jun 13, 2023

benwtrent commented Jun 13, 2023

[ML] make inference model definitions writeable #96804

[ML] make inference model definitions writeable #96804

Conversation

benwtrent commented Jun 13, 2023

elasticsearchmachine commented Jun 13, 2023

benwtrent commented Jun 13, 2023

droberts195 left a comment

Choose a reason for hiding this comment

benwtrent commented Jun 13, 2023

benwtrent commented Jun 13, 2023