Cardinality aggregation #5426

jpountz · 2014-03-13T18:19:46Z

The cardinality aggregation is a metric aggregation that allows to compute approximate unique counts based on the HyperLogLog++ algorithm which has the nice properties of both being close to accurate on low cardinalities and having fixed memory usage so that estimating high cardinalities doesn't blow up memory.

Example:

{
    "aggs" : {
        "author_count" : { 
            "cardinality" : { 
                "field" : "author"
            }
        }
    }
}

This aggregation computes unique term counts using the hyperloglog++ algorithm which uses linear counting to estimate low cardinalities and hyperloglog on higher cardinalities. Since this algorithm works on hashes, it is useful for high-cardinality fields to store the hash of values directly in the index, which is the purpose of the new `murmur3` field type. This is less necessary on low-cardinality string fields because the aggregator is smart enough to only compute the hash once per unique value per segment thanks to ordinals, or on numeric fields since hashing them is very fast. Close #5426

jpountz closed this as completed in 5821fa0 Mar 13, 2014

This was referenced Mar 13, 2014

Unique count for a field per shard #1211

Closed

Count distinct by field #5322

Closed

Term Count API #640

Closed

Terms facet: count and from features #1044

Closed

jpountz added v1.2.0 and removed v1.2.0 labels Mar 13, 2014

karmi mentioned this issue Mar 14, 2014

terms facet gives wrong count with n_shards > 1 #1305

Closed

clintongormley added the feature label Mar 21, 2014

clintongormley added the :Analytics/Aggregations Aggregations label Jun 6, 2015

$@polyfractal$ polyfractal mentioned this issue Aug 19, 2020

Datasketches HllSketch aggregation for ElasticSearch #61006

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cardinality aggregation #5426

Cardinality aggregation #5426

jpountz commented Mar 13, 2014

Cardinality aggregation #5426

Cardinality aggregation #5426

Comments

jpountz commented Mar 13, 2014