Improving how search relevance score is calculated for multi_match queries on optional fields #81466

kaykanloo · 2021-12-07T20:08:54Z

Hi,

I might be missing something trivial here, so please bear with me; but the search relevance score for multi_match queries on optional fields (properties in a document that do not necessarily always have a value assigned to them) in specific situations seem to be very different from what end-user might expect, mainly due to the way that IDF is calculated.

Please consider the example below:

Test data:

curl -XDELETE localhost:9200/test-index

curl -XPUT localhost:9200/test-index

curl -XPOST localhost:9200/test-index/_bulk --header 'Content-Type: application/json' -d '
{"index":{"_id":"1"}}
{"required_field":"RareWord"}
{"index":{"_id":"2"}}
{"required_field":"RareWord"}
{"index":{"_id":"3"}}
{"required_field":"CommonWord"}
{"index":{"_id":"4"}}
{"required_field":"CommonWord"}
{"index":{"_id":"5"}}
{"required_field":"CommonWord"}
{"index":{"_id":"6"}}
{"required_field":"CommonWord"}
{"index":{"_id":"7"}}
{"required_field":"CommonWord"}
{"index":{"_id":"8"}}
{"required_field":"CommonWord"}
{"index":{"_id":"9"}}
{"required_field":"CommonWord","optional_field":"RareWord AnotherRareWord"}
{"index":{"_id":"10"}}
{"required_field":"CommonWord","optional_field":"RareWord AnotherRareWord"}
'

Search Query:

If I run a search query similar to one below:

curl -XGET localhost:9200/test-index/_search --header 'Content-Type: application/json' -d '
{
  "query": {
    "multi_match": {
      "query": "RareWord AnotherRareWord",
      "fields": [
        "required_field",
        "optional_field"
      ]
    }
  }
}
'

Expectation

The end-user would expect Document 9 and 10 to score higher than others, because they contain the exact two words of the search query in their optional_field

Reality

Document 1 would score better than 10, even though it only contains one of the the two words of the search query; which is the opposite of what end-users most likely expect.

{
    "took": 4,
    "timed_out": false,
    "_shards": {
        "total": 1,
        "successful": 1,
        "skipped": 0,
        "failed": 0
    },
    "hits": {
        "total": {
            "value": 4,
            "relation": "eq"
        },
        "max_score": 1.4816045,
        "hits": [
            {
                "_index": "test-index",
                "_type": "_doc",
                "_id": "1",
                "_score": 1.4816045,
                "_source": {
                    "required_field": "RareWord"
                }
            },
            {
                "_index": "test-index",
                "_type": "_doc",
                "_id": "2",
                "_score": 1.4816045,
                "_source": {
                    "required_field": "RareWord"
                }
            },
            {
                "_index": "test-index",
                "_type": "_doc",
                "_id": "9",
                "_score": 0.36464313,
                "_source": {
                    "required_field": "CommonWord",
                    "optional_field": "RareWord AnotherRareWord"
                }
            },
            {
                "_index": "test-index",
                "_type": "_doc",
                "_id": "10",
                "_score": 0.36464313,
                "_source": {
                    "required_field": "CommonWord",
                    "optional_field": "RareWord AnotherRareWord"
                }
            }
        ]
    }
}

A closer look at _explain

Here is the _explain results of running the same search query for Document 1:

{
  "_index" : "my-index",
  "_type" : "_doc",
  "_id" : "1",
  "matched" : true,
  "explanation" : {
    "value" : 1.4816045,
    "description" : "max of:",
    "details" : [
      {
        "value" : 1.4816045,
        "description" : "sum of:",
        "details" : [
          {
            "value" : 1.4816045,
            "description" : "weight(required_field:rareword in 0) [PerFieldSimilarity], result of:",
            "details" : [
              {
                "value" : 1.4816045,
                "description" : "score(freq=1.0), computed as boost * idf * tf from:",
                "details" : [
                  {
                    "value" : 2.2,
                    "description" : "boost",
                    "details" : [ ]
                  },
                  {
                    "value" : 1.4816046,
                    "description" : "idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:",
                    "details" : [
                      {
                        "value" : 2,
                        "description" : "n, number of documents containing term",
                        "details" : [ ]
                      },
                      {
                        "value" : 10,
                        "description" : "N, total number of documents with field",
                        "details" : [ ]
                      }
                    ]
                  },
                  {
                    "value" : 0.45454544,
                    "description" : "tf, computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:",
                    "details" : [
                      {
                        "value" : 1.0,
                        "description" : "freq, occurrences of term within document",
                        "details" : [ ]
                      },
                      {
                        "value" : 1.2,
                        "description" : "k1, term saturation parameter",
                        "details" : [ ]
                      },
                      {
                        "value" : 0.75,
                        "description" : "b, length normalization parameter",
                        "details" : [ ]
                      },
                      {
                        "value" : 1.0,
                        "description" : "dl, length of field",
                        "details" : [ ]
                      },
                      {
                        "value" : 1.0,
                        "description" : "avgdl, average length of field",
                        "details" : [ ]
                      }
                    ]
                  }
                ]
              }
            ]
          }
        ]
      }
    ]
  }
}

And here is the _explain results of running the same search query for Document 10:

{
  "_index" : "my-index",
  "_type" : "_doc",
  "_id" : "10",
  "matched" : true,
  "explanation" : {
    "value" : 0.36464313,
    "description" : "max of:",
    "details" : [
      {
        "value" : 0.36464313,
        "description" : "sum of:",
        "details" : [
          {
            "value" : 0.18232156,
            "description" : "weight(optional_field:rareword in 9) [PerFieldSimilarity], result of:",
            "details" : [
              {
                "value" : 0.18232156,
                "description" : "score(freq=1.0), computed as boost * idf * tf from:",
                "details" : [
                  {
                    "value" : 2.2,
                    "description" : "boost",
                    "details" : [ ]
                  },
                  {
                    "value" : 0.18232156,
                    "description" : "idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:",
                    "details" : [
                      {
                        "value" : 2,
                        "description" : "n, number of documents containing term",
                        "details" : [ ]
                      },
                      {
                        "value" : 2,
                        "description" : "N, total number of documents with field",
                        "details" : [ ]
                      }
                    ]
                  },
                  {
                    "value" : 0.45454544,
                    "description" : "tf, computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:",
                    "details" : [
                      {
                        "value" : 1.0,
                        "description" : "freq, occurrences of term within document",
                        "details" : [ ]
                      },
                      {
                        "value" : 1.2,
                        "description" : "k1, term saturation parameter",
                        "details" : [ ]
                      },
                      {
                        "value" : 0.75,
                        "description" : "b, length normalization parameter",
                        "details" : [ ]
                      },
                      {
                        "value" : 2.0,
                        "description" : "dl, length of field",
                        "details" : [ ]
                      },
                      {
                        "value" : 2.0,
                        "description" : "avgdl, average length of field",
                        "details" : [ ]
                      }
                    ]
                  }
                ]
              }
            ]
          },
          {
            "value" : 0.18232156,
            "description" : "weight(optional_field:anotherrareword in 9) [PerFieldSimilarity], result of:",
            "details" : [
              {
                "value" : 0.18232156,
                "description" : "score(freq=1.0), computed as boost * idf * tf from:",
                "details" : [
                  {
                    "value" : 2.2,
                    "description" : "boost",
                    "details" : [ ]
                  },
                  {
                    "value" : 0.18232156,
                    "description" : "idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:",
                    "details" : [
                      {
                        "value" : 2,
                        "description" : "n, number of documents containing term",
                        "details" : [ ]
                      },
                      {
                        "value" : 2,
                        "description" : "N, total number of documents with field",
                        "details" : [ ]
                      }
                    ]
                  },
                  {
                    "value" : 0.45454544,
                    "description" : "tf, computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:",
                    "details" : [
                      {
                        "value" : 1.0,
                        "description" : "freq, occurrences of term within document",
                        "details" : [ ]
                      },
                      {
                        "value" : 1.2,
                        "description" : "k1, term saturation parameter",
                        "details" : [ ]
                      },
                      {
                        "value" : 0.75,
                        "description" : "b, length normalization parameter",
                        "details" : [ ]
                      },
                      {
                        "value" : 2.0,
                        "description" : "dl, length of field",
                        "details" : [ ]
                      },
                      {
                        "value" : 2.0,
                        "description" : "avgdl, average length of field",
                        "details" : [ ]
                      }
                    ]
                  }
                ]
              }
            ]
          }
        ]
      }
    ]
  }
}

As you can see, Document 10 scores worse, mainly due to the lower IDF value (0.18232156). Looking closely, it's because IDF uses N, total number of documents with field: 2 instead of simply considering the total number of the documents in the index: 10.

Feature Request

I was wondering if there is any way that I could force multi_match query to consider all the documents (instead of only those that contain the field) when computing the IDF value for an optional field, hence resulting in a relevance score which is closer to the expectations of the end-users?

Disclaimer:

I originally posted this as a question on the official elasticsearch forum, and stackoverflow but so far I haven't received any suggestions that would indicate there's an easy way of doing what I've described using multi_match. However I admit there might be a better way to write a search query to achieve the intended result, one that I am not aware of and ideally does not rely on using something like combined_fields due to various limitations that would impose.

Any feedback would be greatly appreciated. Thanks.

elasticmachine · 2021-12-07T21:45:11Z

Pinging @elastic/es-search (Team:Search)

kaykanloo added >enhancement needs:triage Requires assignment of a team area label labels Dec 7, 2021

not-napoleon added the :Search/Ranking Scoring, rescoring, rank evaluation. label Dec 7, 2021

elasticmachine added the Team:Search Meta label for search team label Dec 7, 2021

not-napoleon removed the needs:triage Requires assignment of a team area label label Dec 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving how search relevance score is calculated for multi_match queries on optional fields #81466

Improving how search relevance score is calculated for multi_match queries on optional fields #81466

kaykanloo commented Dec 7, 2021

elasticmachine commented Dec 7, 2021

Improving how search relevance score is calculated for multi_match queries on optional fields #81466

Improving how search relevance score is calculated for multi_match queries on optional fields #81466

Comments

kaykanloo commented Dec 7, 2021

elasticmachine commented Dec 7, 2021