-
Notifications
You must be signed in to change notification settings - Fork 975
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimize Facets#getTopDims across Facets implementations [LUCENE-10488] #11524
Comments
Greg Miller (@gsmiller) (migrated from JIRA) Note that I have an open PR that proposes some significant changes to association facets, so might be worth trying to avoid large merge collisions with that if someone jumps on this. |
Yuting Gan (@Yuti-G) (migrated from JIRA) Hi @gsmiller , I have brought the optimization changes of getTopDims from The optimization logic is very similar to what I did in Thanks, Yuting
|
Greg Miller (@gsmiller) (migrated from JIRA) Very exciting. Thanks @Yuti-G! Also, please note that the refactoring change I mentioned above for association facets is now merged (#11480), so it should be easy now to move forward with optimizations there as well if you're interested (or if anyone else is interested). Thanks again! |
ASF subversion and git services (migrated from JIRA) Commit ef43242 in lucene's branch refs/heads/main from Yuting Gan LUCENE-10488: Optimized getTopDims in ConcurrentSSDVFacetCounts (#777) |
ASF subversion and git services (migrated from JIRA) Commit 57f8cb2 in lucene's branch refs/heads/main from Yuting Gan LUCENE-10488: Optimize Facets#getTopDims in IntTaxonomyFacets (#779) |
ASF subversion and git services (migrated from JIRA) Commit f0ec226 in lucene's branch refs/heads/main from Yuting Gan LUCENE-10488: Optimize Facets#getTopDims in FloatTaxonomyFacets (#806) |
ASF subversion and git services (migrated from JIRA) Commit e01b65d in lucene's branch refs/heads/main from Greg Miller CHANGES entry for LUCENE-10488 |
ASF subversion and git services (migrated from JIRA) Commit 87655fd in lucene's branch refs/heads/branch_9x from Greg Miller LUCENE-10488: Optimized Facets#getTopDims for taxonomy faceting and ConcurrentSSDVFacetCounts (#889) Co-authored-by: Yuting Gan <44444710+Yuti-G@users.noreply.github.com> |
Greg Miller (@gsmiller) (migrated from JIRA) Merged to |
Yuting Gan (@Yuti-G) (migrated from JIRA) Thank you so much for reviewing and merging my PRs! I will work on adding getTopDim to benchmarks soon. |
Alan Woodward (@romseygeek) (migrated from JIRA) Bulk close for 9.2.0 release |
#11361 added a new
getTopDims
API, allowing users to specify the number of "top" dimensions they want. The default implementation just delegates togetAllDims
and returns the number of top dims requested, but some Facets sub-classes can do this more optimally. #11361 demonstrated this inSortedSetDocValueFacetCounts
, but we can take it further. There's at least some opportunity to do better in:ConcurrentSortedSetDocValuesFacetCounts
FastTaxonomyFacetCounts
TaxonomyFacetSumFloatAssociations
TaxonomyFacetSumIntAssociations
Migrated from LUCENE-10488 by Greg Miller (@gsmiller), resolved May 13 2022
Pull requests: #777, #779, #806
The text was updated successfully, but these errors were encountered: