[ML] Automatically download the ELSER model when PUT in _inference #104334

maxhniebergall · 2024-01-12T17:58:45Z

Closes https://github.com/elastic/ml-team/issues/1098

Previously, to be able to use the ELSER model, there were two steps: 1. Put (download) ELSER using the trained models API; 2. Put the ELSER model using the _inference API.

With this change, these two steps are combined, so now to install ELSER in _inference, all one has to do is PUT the ELSER model using the _inference API.

For example,

curl -X PUT "localhost:9200/_inference/sparse_embedding/<model_id>" \
-H 'Content-Type: application/json' -u <user>:<password> \
-d'  
{
  "service": "elser",
  "service_settings": {
    "num_allocations": 1,
    "num_threads": 1
  },
  "task_settings": {}
}
'

This reverts commit 9bc579f.

elasticsearchmachine · 2024-01-12T17:59:08Z

Pinging @elastic/ml-core (Team:ML)

elasticsearchmachine · 2024-01-12T17:59:09Z

Hi @maxhniebergall, I've created a changelog YAML for you.

...e-service-tests/src/javaRestTest/java/org/elasticsearch/xpack/inference/InferenceCrudIT.java

jonathan-buttner · 2024-01-12T18:19:28Z

server/src/main/java/org/elasticsearch/inference/InferenceService.java

+     * @param modelVariant The configuration of the model variant to be downloaded
+     * @param listener The listener
+     */
+    default void putModel(Model modelVariant, ActionListener<Boolean> listener) {


Should we leverage the start method that's in this interface for downloading the model instead of adding a new one?

I think we could use start, although I am unsure about the benefit. I suppose it would decrease the complexity of the interface, and it is a bit weird that startModel in TransportPutInferenceModelAction calls two methods in the InferenceService. However, I think that this also represents the underlying business logic (that start is both starting an inference model and putting a trained model definition). I think that creating a second method in the interface has the benefit of splitting up these two objectives into separate methods in the ElserMlNodeService (and other InferenceServices), which represent the two API calls we are making to support the business logic.

So, I can't really imagine why it would be better to avoid creating another method in this interface, but please let me know if I missed it!

jonathan-buttner · 2024-01-12T18:27:17Z

...rence/src/main/java/org/elasticsearch/xpack/inference/services/elser/ElserMlNodeService.java

+            var input = new TrainedModelInput(fieldNames);
+            var config = TrainedModelConfig.builder().setInput(input).setModelId(modelVariant).build();
+            PutTrainedModelAction.Request putRequest = new PutTrainedModelAction.Request(config, false, true);
+            executeAsyncWithOrigin(client, ML_ORIGIN, PutTrainedModelAction.INSTANCE, putRequest, listener.delegateFailure((l, r) -> {


Hmm I wonder if we should be using a new origin for inference instead of ML_ORIGIN 🤔 @davidkyle ? Or do we need this as ML_ORIGIN so it gets the permissions correct?

I didn't really consider using a different origin. I'll have to look into what the impact of the origin is. Thanks for pointing this out!

Looks like there is already an inference origin, so I switched to using that. Thanks @jonathan-buttner !

davidkyle

LGTM

…lastic#104334) * Automatically download ELSER when PUT in _inference * Revert "Disable elser download test case in inf IT (elastic#104271)" * add IT * disable IT

maxhniebergall added 4 commits January 12, 2024 12:11

Automatically download ELSER when PUT in _inference

3f87ec1

Revert "Disable elser download test case in inf IT (#104271)"

ebc0473

This reverts commit 9bc579f.

cleanup todo

dad9f0c

add IT tests

de99f0d

maxhniebergall added >enhancement :ml Machine learning v8.13.0 labels Jan 12, 2024

elasticsearchmachine added the Team:ML Meta label for the ML team label Jan 12, 2024

Update docs/changelog/104334.yaml

0ef827c

maxhniebergall commented Jan 12, 2024

View reviewed changes

...e-service-tests/src/javaRestTest/java/org/elasticsearch/xpack/inference/InferenceCrudIT.java Outdated Show resolved Hide resolved

maxhniebergall requested review from davidkyle and jonathan-buttner January 12, 2024 17:59

jonathan-buttner reviewed Jan 12, 2024

View reviewed changes

maxhniebergall added 2 commits January 12, 2024 13:51

Switched to INFERENCE_ORIGIN

0cb595b

disable IT

971c49a

davidkyle approved these changes Jan 15, 2024

View reviewed changes

maxhniebergall merged commit 6a4a22f into main Jan 15, 2024
16 checks passed

maxhniebergall deleted the _infDownloadElserOnPut branch January 15, 2024 15:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Automatically download the ELSER model when PUT in _inference #104334

[ML] Automatically download the ELSER model when PUT in _inference #104334

maxhniebergall commented Jan 12, 2024

elasticsearchmachine commented Jan 12, 2024

elasticsearchmachine commented Jan 12, 2024

jonathan-buttner Jan 12, 2024

maxhniebergall Jan 12, 2024

jonathan-buttner Jan 12, 2024

maxhniebergall Jan 12, 2024

maxhniebergall Jan 12, 2024

davidkyle left a comment

[ML] Automatically download the ELSER model when PUT in _inference #104334

[ML] Automatically download the ELSER model when PUT in _inference #104334

Conversation

maxhniebergall commented Jan 12, 2024

elasticsearchmachine commented Jan 12, 2024

elasticsearchmachine commented Jan 12, 2024

jonathan-buttner Jan 12, 2024

Choose a reason for hiding this comment

maxhniebergall Jan 12, 2024

Choose a reason for hiding this comment

jonathan-buttner Jan 12, 2024

Choose a reason for hiding this comment

maxhniebergall Jan 12, 2024

Choose a reason for hiding this comment

maxhniebergall Jan 12, 2024

Choose a reason for hiding this comment

davidkyle left a comment

Choose a reason for hiding this comment