Add cross-encoder model documentation #6357

kolchfa-aws · 2024-02-06T00:22:43Z

Closes #6352

Checklist

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and subject to the Developers Certificate of Origin.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

HenryL27

Couple tweaks, but otherwise lgtm!

HenryL27 · 2024-02-15T22:30:09Z

_ml-commons-plugin/custom-local-models.md

+
+Cross-encoder models support query reranking. 
+
+To register a cross-encoder model, send a request in the following format. The `model_config` object is optinoal. Cross-encoder models' `function_name` is `TEXT_SIMILARITY`. For example, the following request registers a `ms-marco-TinyBERT-L-2-v2` model:


optinoal -> optional

HenryL27 · 2024-02-15T22:44:05Z

_search-plugins/search-relevance/reranking-search-results.md

@@ -13,7 +13,7 @@ Introduced 2.12
 You can rerank search results using a cross-encoder reranker in order to improve search relevance. To implement reranking, you need to configure a [search pipeline]({{site.url}}{{site.baseurl}}/search-plugins/search-pipelines/index/) that runs at search time. The search pipeline intercepts search results and applies the [`rerank` processor]({{site.url}}{{site.baseurl}}/search-plugins/search-pipelines/rerank-processor/) to them. The `rerank` processor evaluates the search results and sorts them based on the new scores provided by the cross-encoder model. 

 **PREREQUISITE**<br>
-Before using hybrid search, you must set up a cross-encoder model. For more information, see [Choosing a model]({{site.url}}{{site.baseurl}}/ml-commons-plugin/integrating-ml-models/#choosing-a-model).
+Before using hybrid search, you must set up a cross-encoder model. For more information, see [Cross-encoder models]({{site.url}}{{site.baseurl}}/ml-commons-plugin/custom-local-models/#cross-encoder-models).


Prerequisite for reranking, not hybrid search.

Thank you! Good catch.

_search-plugins/search-relevance/reranking-search-results.md

Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

_ml-commons-plugin/custom-local-models.md

Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

vagimeli

LGTM, with minimal edits.

vagimeli · 2024-02-15T23:39:07Z

_ml-commons-plugin/custom-local-models.md

+
+Cross-encoder models support query reranking. 
+
+To register a cross-encoder model, send a request in the following format. The `model_config` object is optinoal. Cross-encoder models' `function_name` is `TEXT_SIMILARITY`. For example, the following request registers a `ms-marco-TinyBERT-L-2-v2` model:


Suggested change

To register a cross-encoder model, send a request in the following format. The `model_config` object is optinoal. Cross-encoder models' `function_name` is `TEXT_SIMILARITY`. For example, the following request registers a `ms-marco-TinyBERT-L-2-v2` model:

To register a cross-encoder model, send a request in the following format. The `model_config` object is optional. The cross-encoder model's `function_name` is `TEXT_SIMILARITY`. For example, the following request registers a `ms-marco-TinyBERT-L-2-v2` model:

dhrubo-os · 2024-02-15T22:16:03Z

_ml-commons-plugin/custom-local-models.md

+        "embedding_dimension": 1,
+        "framework_type": "huggingface_transformers",
+        "total_chunks":2,
+        "all_config": "{\"total_chunks\":2,\"is_hidden\":false}"


\"is_hidden\":false is unnecessary here.

otherwise looks good.

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

kolchfa-aws · 2024-02-16T13:30:40Z

Thank you for the quick review, @HenryL27, @dhrubo-os, and @vagimeli!

natebower

@kolchfa-aws Just a few changes. Thanks!

_ml-commons-plugin/custom-local-models.md

natebower · 2024-02-16T13:35:16Z

_ml-commons-plugin/custom-local-models.md

+}
+```
+
+Higher document score means higher similarity. In the preceding response, documents are scored as follows against the query text `today is sunny`:


Either "A higher document score" or "Higher document scores"

natebower · 2024-02-16T13:37:50Z

_ml-commons-plugin/pretrained-models.md

Please define IDF.

_ml-commons-plugin/custom-local-models.md

Co-authored-by: Nathan Bower <nbower@amazon.com> Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

_ml-commons-plugin/pretrained-models.md

Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

* Add cross-ranking model documentation Signed-off-by: Fanit Kolchina <kolchfa@amazon.com> * Model id format Signed-off-by: Fanit Kolchina <kolchfa@amazon.com> * Move to custom models Signed-off-by: Fanit Kolchina <kolchfa@amazon.com> * Update _search-plugins/search-relevance/reranking-search-results.md Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update _ml-commons-plugin/custom-local-models.md Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Tech review and doc review comments Signed-off-by: Fanit Kolchina <kolchfa@amazon.com> * Apply suggestions from code review Co-authored-by: Nathan Bower <nbower@amazon.com> Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update _ml-commons-plugin/pretrained-models.md Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> --------- Signed-off-by: Fanit Kolchina <kolchfa@amazon.com> Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Nathan Bower <nbower@amazon.com>

Add cross-ranking model documentation

f91b7d9

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

kolchfa-aws self-assigned this Feb 6, 2024

kolchfa-aws requested review from hdhalter, Naarcha-AWS, vagimeli, AMoo-Miki, natebower and dlvenable as code owners February 6, 2024 00:22

Model id format

7e9d05e

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

hdhalter added 2 - In progress Issue/PR: The issue or PR is in progress. release-notes PR: Include this PR in the automated release notes v2.12.0 labels Feb 7, 2024

kolchfa-aws mentioned this pull request Feb 13, 2024

Add documentation for new reranking feature in 2.12 #6368

Merged

1 task

hdhalter added 3 - Tech review PR: Tech review in progress and removed 2 - In progress Issue/PR: The issue or PR is in progress. labels Feb 13, 2024

kolchfa-aws added 2 commits February 15, 2024 16:40

Merge branch 'main' into new-model

3356eca

Move to custom models

8349e34

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

HenryL27 approved these changes Feb 15, 2024

View reviewed changes

kolchfa-aws commented Feb 16, 2024

View reviewed changes

_search-plugins/search-relevance/reranking-search-results.md Outdated Show resolved Hide resolved

Update _search-plugins/search-relevance/reranking-search-results.md

f1ab8ba

Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

kolchfa-aws commented Feb 16, 2024

View reviewed changes

_ml-commons-plugin/custom-local-models.md Outdated Show resolved Hide resolved

Update _ml-commons-plugin/custom-local-models.md

d63f7f0

Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

vagimeli approved these changes Feb 16, 2024

View reviewed changes

dhrubo-os reviewed Feb 16, 2024

View reviewed changes

dhrubo-os approved these changes Feb 16, 2024

View reviewed changes

Tech review and doc review comments

a812080

Signed-off-by: Fanit Kolchina <kolchfa@amazon.com>

natebower reviewed Feb 16, 2024

View reviewed changes

kolchfa-aws commented Feb 16, 2024

View reviewed changes

_ml-commons-plugin/custom-local-models.md Outdated Show resolved Hide resolved

Apply suggestions from code review

4d94879

Co-authored-by: Nathan Bower <nbower@amazon.com> Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

kolchfa-aws commented Feb 16, 2024

View reviewed changes

_ml-commons-plugin/pretrained-models.md Outdated Show resolved Hide resolved

Update _ml-commons-plugin/pretrained-models.md

4d32e7a

Signed-off-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>

kolchfa-aws merged commit e76ec7c into main Feb 16, 2024
4 checks passed

kolchfa-aws deleted the new-model branch March 28, 2024 21:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cross-encoder model documentation #6357

Add cross-encoder model documentation #6357

kolchfa-aws commented Feb 6, 2024

HenryL27 left a comment

HenryL27 Feb 15, 2024

HenryL27 Feb 15, 2024

kolchfa-aws Feb 16, 2024

vagimeli left a comment

vagimeli Feb 15, 2024

dhrubo-os Feb 15, 2024

dhrubo-os Feb 16, 2024

kolchfa-aws commented Feb 16, 2024

natebower left a comment

natebower Feb 16, 2024

natebower Feb 16, 2024


		Cross-encoder models support query reranking.

		To register a cross-encoder model, send a request in the following format. The `model_config` object is optinoal. Cross-encoder models' `function_name` is `TEXT_SIMILARITY`. For example, the following request registers a `ms-marco-TinyBERT-L-2-v2` model:

Add cross-encoder model documentation #6357

Add cross-encoder model documentation #6357

Conversation

kolchfa-aws commented Feb 6, 2024

Checklist

HenryL27 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vagimeli left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kolchfa-aws commented Feb 16, 2024

natebower left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment