Optimize BM25 scoring in DAAT MaxScore#1629
Conversation
|
@lyang24 🔍 Important: PR Classification Needed! For efficient project management and a seamless review process, it's essential to classify your PR correctly. Here's how:
For any PR outside the kind/improvement category, ensure you link to the associated issue using the format: “issue: #”. Thanks for your efforts and contribution to the community!. |
|
@lyang24 e2e jenkins job failed, comment |
4712e80 to
897294a
Compare
|
@lyang24 e2e jenkins job failed, comment |
|
/run-e2e |
|
this is great, thanks for the PR! |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: lyang24, zhengbuqian The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
@lyang24 e2e jenkins job failed, comment |
|
waiting for #1619 |
|
@lyang24 |
Signed-off-by: lyang24 <lanqingy93@gmail.com>
897294a to
585548d
Compare
rebased thanks |
|
/lgtm |
|
/tide |
|
@Mergifyio refresh |
✅ Pull request refreshed |
|
/kind improvement |
issue: #1636
BM25 DAAT_MAXSCORE Benchmark: optimize-daat-maxscore-bm25-hoist vs main
Optimization: Hoist BM25 doc-normalization (p2 + p3 * row_sums[doc_id]) out of the per-term inner loop in DaatMaxScoreSearcher, and precompute qval * p1 per cursor
in BM25DimScorer.
Setup
Results