Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support distinctCountRawThetaSketch aggregation that returns serialized sketch. #5465

Merged
merged 1 commit into from
May 31, 2020

Conversation

mayankshriv
Copy link
Contributor

  1. Support a variation of theta sketch based distinct count aggregation function that returns
    serialized bytes of the final aggregated sketch, instead of the actual distinct value.

  2. The return value is hex encoded String of the serialized sketch bytes. This can be
    deserialized at the client side by the library using org.apache.commons.codec.binary as:
    Hex.decodeHex(stringValue.toCharArray()). This is the same as any other byte[] value
    returned by Pinot.

  3. Added unit test for the new aggregation function.

…ed sketch.

1. Support a variation of theta sketch based distinct count aggregation function that returns
   serialized bytes of the final aggregated sketch, instead of the actual distinct value.

2. The return value is hex encoded String of the serialized sketch bytes. This can be
   deserialized at the client side by the library using org.apache.commons.codec.binary as:
   `Hex.decodeHex(stringValue.toCharArray())`. This is the same as any other byte[] value
   returned by Pinot.

3. Added unit test for the new aggregation function.
@mayankshriv mayankshriv merged commit 01a316e into apache:master May 31, 2020
@mayankshriv mayankshriv deleted the theta-sketch branch May 31, 2020 04:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants