Aggregations: Return an upper bound of the maximum error for terms #6696

jpountz · 2014-07-02T22:46:05Z

The fact that terms aggregations don't give accurate counts is a bit deceptive. Without changing the way they are implemented, maybe we should make terms aggregations return an upper bound of the maximum error on the document count as part of the response? I think this would help make clear that there are potential accuracy issues, as well as make this inaccuracy easier to manage since there is a known upper bound on the error?

…r the terms aggregation. This is only applicable when the order is set to _count. The upper bound of the error in the doc count is calculated by summing the doc count of the last term on each shard which did not return the term. The implementation calculates the error by summing the doc count for the last term on each shard for which the term IS returned and then subtracts this value from the sum of the doc counts for the last term from ALL shards. Closes #6696

jpountz self-assigned this Jul 2, 2014

jpountz mentioned this issue Jul 2, 2014

Aggregations: Memory-bound terms #6697

Closed

martijnvg added the low hanging fruit label Jul 4, 2014

martijnvg assigned colings86 and unassigned jpountz Jul 4, 2014

martijnvg added v1.4.0 labels Jul 4, 2014

colings86 mentioned this issue Jul 8, 2014

Added an option to show the upper bound of the error for the terms aggregation #6778

Closed

colings86 closed this as completed in 655157c Jul 25, 2014

clintongormley added the >enhancement label Sep 11, 2014

jasontedor mentioned this issue May 12, 2017

Remove Netty logging hack #24653

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aggregations: Return an upper bound of the maximum error for terms #6696

Aggregations: Return an upper bound of the maximum error for terms #6696

jpountz commented Jul 2, 2014

Navigation Menu

Aggregations: Return an upper bound of the maximum error for terms #6696

Aggregations: Return an upper bound of the maximum error for terms #6696

Comments

jpountz commented Jul 2, 2014