-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
auto sharding strategy for theta sketch #9437
Comments
Here is the doc for the function: https://docs.pinot.apache.org/configuration-reference/functions/distinctcountthetasketch There is one parameter that can be passed in the function: |
May be my question wasn't clear. I understand what theta sketches are , but trying to understand how you build auto sharding for some high cardinality segments when constructing theta sketch , what is considered high cardinality , what thresholds ? |
The implementation for this support is in the |
I was going through the pr : #5316
Can you please point me to how or where is this implemented. How do we define high cardinality threshold
I am running into issues where different sets can be different cardinality and error is high and wanted insights on how to tune theta params during my indexing phase . what is a reasonable theta threshold to decide high cardinality
The text was updated successfully, but these errors were encountered: