-
Notifications
You must be signed in to change notification settings - Fork 3.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
More accurate dictionary size estimation in RowBasedKeySerde #4768
Comments
What do you mean by "overhead" here? I thought the only concern with it might be that it only supports HotSpot JVM (though I'm not sure). But it could be fine, if Druid doesn't really aim to support other JVMs. Apart from that, it should be perfectly fine to use JOL. Note e. g. that Presto, a high-profile project similar to Druid, uses it. |
Ok, and another issue could be the license, because JOL is GPL. |
I meant the overhead to get the size of each string in dictionary because it will be a very frequent operation.
This is a really sad news.. |
@jihoonson see #4771 |
Or, if still using this approach, |
#5583 might be related as well. |
This issue has been marked as stale due to 280 days of inactivity. It will be closed in 2 weeks if no further activity occurs. If this issue is still relevant, please simply write any comment. Even if closed, you can still revive the issue at any time or discuss it on the dev@druid.apache.org list. Thank you for your contributions. |
Still relevant. |
This issue is no longer marked as stale. |
This issue has been marked as stale due to 280 days of inactivity. It will be closed in 4 weeks if no further activity occurs. If this issue is still relevant, please simply write any comment. Even if closed, you can still revive the issue at any time or discuss it on the dev@druid.apache.org list. Thank you for your contributions. |
This issue has been closed due to lack of activity. If you think that is incorrect, or the issue requires additional review, you can revive the issue at any time. |
The original discussion is #4704 (comment).
RowBasedKeySerde
currently builds a dictionary at query time. To avoid using too many memory, the dictionary size is limited by user configuration. However, the current dictionary size estimation is based on a rough calculation.JOL might be an alternative for more accurate dictionary size estimation if its overhead is not so large.
The text was updated successfully, but these errors were encountered: