-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Closed
Description
We've got druid data ingestion use case in which pre-computed HLL indexes can be stored in approximately 6*10^4 rows but preparing same data in raw format and ingesting it in druid requires preparation of 10^12 rows.
Because of this we've switched to thetaSketch aggregator which allows loading of pre-computer theta sketch.
However in our use case we only need count distinct feature so theta sketch itself adds too much overhead (it's much more space consuming than hyperUnique).
It would be great if:
- druid would allow ingesting of pre-computed hyperUnique (like thetaSketch does)
- druid would expose public API which allows to compute and merge hyperUnique values so it can be precomputed in other systems (for example in Hive using UDF)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels