New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement "interesting terms" in More Like This handler #1412
Comments
+1 nice feature |
+1 I'm excited to see this feature. Still no milestone yet? |
+1 This would be really useful for my project too. |
+100 Must have. Especially if projects migrate from Lucene to Elastic Search |
+1 to get list of "Interesting Terms" |
+1 this would be really useful |
There are two things that we may wish to achieve with "interesting terms" and More Like This.
The former is, I think, what Solr does with "mlt.interestingTerms", while the later could be achieved with "explain". However, "explain" is more for debugging purposes and so its output is very verbose. Perhaps, it would be more desirable to simply return a list of matched interesting terms. Any thoughts? I haven't tested the "mlt.interestingTerms" Solr feature, would anyone know what the exact behavior and output are? |
+1 This is almost must have if one want to move more like this (with boosting) from SOLR to elasticsearch. Any updates on this? |
For Fuzzy Queries: ``` GET /imdb/movies/_validate/query?explain=true { "query": { "fuzzy": { "actors": "kyle" } } } Response: { ... "explanations": [ { "index": "imdb", "valid": true, "explanation": "filtered(actors:eyle^0.75 actors:kale^0.75 actors:kayle^0.75 ... )->cache(_type:movies)" } ] } ``` For More Like This: ``` GET /imdb/movies/_validate/query?explain=true { "query": { "more_like_this": { "like": { "_id": "88247" } } } } Response: { ... "explanations": [ { "index": "imdb", "valid": true, "explanation": "filtered((((title:terminator^3.71334 plot:kyle^1.0604408 plot:cyborg^1.0863208 ... )~2)) -ConstantScore(_uid:movies#88247))->cache(_type:movies)" } ] } ``` Relates to elastic#1412
…d queries This commit adds a `rewrite` parameter to the validate API in order to shown how the given query is re-written into primitive queries. For example, an MLT query is re-written into a disjunction of the selected terms. Other use cases include `fuzzy`, `common_terms`, or `match` query especially with a `cutoff_frequency` parameter. Note that the explanation is only given for a single randomly chosen shard only, so the output may vary from one shard to another. Relates #1412 Closes #10147
…d queries This commit adds a `rewrite` parameter to the validate API in order to shown how the given query is re-written into primitive queries. For example, an MLT query is re-written into a disjunction of the selected terms. Other use cases include `fuzzy`, `common_terms`, or `match` query especially with a `cutoff_frequency` parameter. Note that the explanation is only given for a single randomly chosen shard only, so the output may vary from one shard to another. Relates #1412 Closes #10147
closed by #10147 |
Sorry for bringing this old issue up. |
Solr implements the "interesting terms" for MLT queries: http://wiki.apache.org/solr/MoreLikeThisHandler
"One of: "list", "details", "none" -- this will show what "interesting" terms are used for the MoreLikeThis query. These are the top tf/idf terms. NOTE: if you select 'details', this shows you the term and boost used for each term. Unless mlt.boost=true all terms will have boost=1.0"
The text was updated successfully, but these errors were encountered: