Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Statistics-based query optimisation #269

Open
kal opened this issue Jan 20, 2016 · 0 comments
Open

Statistics-based query optimisation #269

kal opened this issue Jan 20, 2016 · 0 comments

Comments

@kal
Copy link
Contributor

kal commented Jan 20, 2016

Related to #111

This is a note to myself for future reference...

Tried using the existing store stats for optimization purposes, but currently we aren't really collecting enough statistics. We currently only have a total statements count for each predicate. Ideally we should also collection the number of distinct subjects and objects for each predicate as that would allow us to do a rough estimate of the weight of a pattern with any two of s, p, and o bound. With the current statistics we have to use default weightings for s and o which can throw off the optimizer quite badly.

It would be worth looking at updating collection of store statistics and then revisiting this.

kal pushed a commit that referenced this issue Jan 20, 2016
This work is still incomplete though as testing shows that the optimizer
isn't really doing a good job. This is primarily (I think) because we
don't have enough stats. See issue #269
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant