Utilize `docIdRunEnd` on `ReqExclBulkScorer` #14806

HUSTERGS · 2025-06-18T07:19:37Z

Description

This PR propose to utilize docIdRunEnd on ReqExclBulkScorer, so we can jump faster on MUST_NOT clause

github-actions · 2025-06-18T07:20:32Z

This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog label to it and you will stop receiving this reminder on future updates to the PR.

github-actions · 2025-06-18T09:34:06Z

This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog label to it and you will stop receiving this reminder on future updates to the PR.

github-actions · 2025-06-18T09:35:47Z

This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog label to it and you will stop receiving this reminder on future updates to the PR.

gf2121 · 2025-06-18T12:11:09Z

lucene/core/src/java/org/apache/lucene/search/ReqExclBulkScorer.java

+        if (exclTwoPhase == null) {
+          // from upTo to docIdRunEnd() are excluded, so we scored up to docIdRunEnd()
+          upTo = exclApproximation.docIDRunEnd();
+        } else if (exclTwoPhase.matches()) {


Is upTo = Math.max(upTo + 1, exclTwoPhase.docIdRunEnd()) correct here?

I think it's correct here, and I run lucene test locally, everything looks fine. What I'm concerned here is that only DocValuesRangeIterator implement it's own docIdRunEnd method for now, I'm not sure whether adding this will hurt the performance or not

gf2121 · 2025-06-18T12:18:42Z

lucene/core/src/java/org/apache/lucene/search/ReqExclBulkScorer.java

-        if (exclTwoPhase == null || exclTwoPhase.matches()) {
+        if (exclTwoPhase == null) {
+          // from upTo to docIdRunEnd() are excluded, so we scored up to docIdRunEnd()
+          upTo = exclApproximation.docIDRunEnd();


Do we need to check exclApproximation.docID() != DocIdSetIterator#NO_MORE_DOCS?

If I understand correctly, exclApproximation.docID() should equals exclDoc, which is equal to upTo (under the if clause ), and upTo is less than max, so exclApproximation.docID() should never be DocIdSetIterator#NO_MORE_DOCS?

Oh yes, You are right. Thanks for explanation!

init

025fcc5

github-project-automation bot added this to OpenSearch Lucene & Core Performance Tracking Jun 18, 2025

github-project-automation bot moved this to Open in OpenSearch Lucene & Core Performance Tracking Jun 18, 2025

github-actions bot added the module:core/search label Jun 18, 2025

refactor

8e55fa5

remove comment

aaf4f99

change

dc71858

github-actions bot added this to the 10.3.0 milestone Jun 18, 2025

gf2121 reviewed Jun 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Utilize `docIdRunEnd` on `ReqExclBulkScorer` #14806

Utilize `docIdRunEnd` on `ReqExclBulkScorer` #14806

HUSTERGS commented Jun 18, 2025

Uh oh!

github-actions bot commented Jun 18, 2025

Uh oh!

github-actions bot commented Jun 18, 2025

Uh oh!

github-actions bot commented Jun 18, 2025

Uh oh!

gf2121 Jun 18, 2025

Uh oh!

HUSTERGS Jun 18, 2025

Uh oh!

gf2121 Jun 18, 2025

Uh oh!

HUSTERGS Jun 18, 2025

Uh oh!

gf2121 Jun 19, 2025

Uh oh!

Uh oh!

Utilize docIdRunEnd on ReqExclBulkScorer #14806

Are you sure you want to change the base?

Utilize docIdRunEnd on ReqExclBulkScorer #14806

Conversation

HUSTERGS commented Jun 18, 2025

Description

Uh oh!

github-actions bot commented Jun 18, 2025

Uh oh!

github-actions bot commented Jun 18, 2025

Uh oh!

github-actions bot commented Jun 18, 2025

Uh oh!

gf2121 Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

HUSTERGS Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

gf2121 Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

HUSTERGS Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

gf2121 Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Utilize `docIdRunEnd` on `ReqExclBulkScorer` #14806

Utilize `docIdRunEnd` on `ReqExclBulkScorer` #14806