Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Retrieving the original index of the samples in subset #18

Closed
hoangthienan95 opened this issue Sep 28, 2020 · 2 comments
Closed

Retrieving the original index of the samples in subset #18

hoangthienan95 opened this issue Sep 28, 2020 · 2 comments

Comments

@hoangthienan95
Copy link

Hi Jacob,

Thank you for a very cool library. I have a quick question. Maybe I missed something, but right now there are no way to retrieve the original index of the samples in the subset to identify which samples/row number the algorithm chose? This need comes up in 2 scenarios:

  1. When I want to know whether I have sufficiently covered my data distribution based on UMAP 2D embedding, I need to know the index of the samples in the subset to merge it back with the UMAP representation

  2. When I have features [A, B, C], but only want to perform submodular optimization on features [A, B], and then want to know the values of feature C of all the samples in the subset.

Right now I don't see any way to extract out the index/numpy array row number of the chosen samples.

@jmschrei
Copy link
Owner

Howdy

The ranking attribute should return this for you.

@hoangthienan95
Copy link
Author

Thanks! I felt like I missed something from the docs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants