-
Notifications
You must be signed in to change notification settings - Fork 24.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove XMoreLikeThis #10626
Comments
Pinging @elastic/es-search-aggs |
I did a first pass comparing the code of the Lucene MoreLikeThis and our XMoreLikeThis class an their differences. Apart from a few methods that the current Lucene class (as of 8.1.0 snapshot) has and that we don't use, the major difference is that we keep a set of "skipTerms" and have a few additional "like(...)" methods that we call in our version of the MoreLikeThisQuery. I think a first step for removal would be to try to move this code out of XMoreLikeThis to make the two classes more similar. |
Had another closer look after a long time here. There are two major additions we have on the ES side in XMoreLikeThis:
The diff isn't big, I'm going to open a wip PR where I use a modified copy of the current Lucene impl on our side just to see if this passes tests and to show the diff. I see two options here:
|
I opened #71117 to better show the diff. |
Pinging @elastic/es-search (Team:Search) |
Hello @javanna @cbuescher @jpountz , is this issue still open. I would like to contribute. |
Given this issue was opened 8 years ago, I am assuming we reached a place of no return here. Thanks @Kiriakos1998 for your offer. This would involve contributing changes back to Lucene, or breaking the current more like this functionality in Elasticsearch. That is why it's marked high hanging fruit. I would suggest finding another issue, if you don't mind. |
Pinging @elastic/es-search-relevance (Team:Search Relevance) |
This class is a fork of Lucene's
MoreLikeThis
but already diverged quite a lot. If we want to be able to maintain this functionality in the long term, we need to merge improvements back to Lucene, otherwise at some point changes coming from Lucene upgrades will be impossible to reconciliate with changes of this fork.The text was updated successfully, but these errors were encountered: