Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Labels #19

Closed
Afbcary opened this issue May 19, 2019 · 2 comments
Closed

Feature Labels #19

Afbcary opened this issue May 19, 2019 · 2 comments

Comments

@Afbcary
Copy link

Afbcary commented May 19, 2019

When I construct the PCA object from a dataset and run getExplainedVariance() I get a number[] of variance values, sorted in descending order. I'd like to know which feature relates to each variance value. I could do this if the report specified the original index of the feature in the sample or allowed me to pass in labels.

I've read the API documentation and pulled down the source code, but I really haven't found much explanation there. Am I missing how to do this? I'd be happy to help improve documentation or add this code if you agree it's missing and necessary.

Use case here (see console)
code

@Afbcary Afbcary closed this as completed May 19, 2019
@N8python
Copy link

How did you solve this? I'm experiencing a similar problem.

@Afbcary
Copy link
Author

Afbcary commented Apr 13, 2020

@N8python

I think when I wrote this I was fundamentally misunderstanding how PCA works. The variance values relate to new composite features that are combinations of the original features.

I found this article illuminating: https://blog.exploratory.io/an-introduction-to-principal-component-analysis-pca-with-2018-world-soccer-players-data-810d84a14eab

In that article, he ascribes meaning to the newly created components by graphing each data point in the original dimensions as well as the new composite dimensions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants