Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Seeding index with a precalculated KNN graph #81

Open
parashardhapola opened this issue Oct 14, 2019 · 1 comment
Open

Seeding index with a precalculated KNN graph #81

parashardhapola opened this issue Oct 14, 2019 · 1 comment

Comments

@parashardhapola
Copy link

Hi,

Sorry if this sounds completely outlandish. This may be somewhat related to #79.

The idea is to provide a pre-calculated graph to the index. And at this 'index seeding' step, a different distance metric than the one used to calculate the seed graph may be used. Thereafter, this graph can be updated with further data batches or/and queried.
Do you think the current implementation of pynndescent makes this feasible? If yes, what would be assumptions about the seed graph that one might need to take care of?

@parashardhapola parashardhapola changed the title Seeding index with a calculated KNN graph Seeding index with a precalculated KNN graph Oct 14, 2019
@lmcinnes
Copy link
Owner

That's not a bad idea if people have a KNN graph already. So right now we "seed" a knn-graph based on results from an rp-forest. It would not be hard, I think to add an alternative to allow a user supplied knn-graph. Do you have an interest in attempting a pull request?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants