Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add proper sampling for estimating counts #3

Open
ramou opened this issue Nov 13, 2019 · 0 comments
Open

Add proper sampling for estimating counts #3

ramou opened this issue Nov 13, 2019 · 0 comments

Comments

@ramou
Copy link
Owner

ramou commented Nov 13, 2019

We know how to do it. We've diced up the problem, done a bunch of the proofs and we've even written out how we'll do it. Let's do it and squeeze out that extra boost in performance when someone's sorting data that isn't uniformly distributed. It ain't that hard and we're already paying the cost for start/end counts because we knew we'd do this eventually.

That said, when I do this is it worth having an explicit uniform distribution version that doesn't do the start/end to squeeze out that tiny improvement from replacing a memory lookup with some basic arithmetic? I'll decide when I actually do this ticket.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant