Principal Component Analysis using Python

Principal Component Analysis, or PCA, is a dimensionality-reduction method that is often used to reduce the dimensionality of large data sets, by transforming a large set of variables into a smaller one that still contains most of the information in the large set.

The stages in this exercise are as follows:

Split the data set.
Train the model without PCA.
Train the model with PCA.
Evaluate the results of the two models.

Model performance without PCA

We will use the Decision Tree model and calculate how accurate it is without using PCA.

Accuracy without PCA is 0.9 or 90%.

Model Performance with PCA

We will use PCA and calculate the variance of each attribute. The results of the variance of each attribute are as follows.

The result is 1 attribute has a variance of 0.931, which means that the attribute stores high information and is much more significant than other attributes. Looking at the previous variances, we can take the best 2 principal components because the total variance when added up is 0.977 which is quite high.

The results of the accuracy test after using PCA are as follows.

In the experiment above, we can see that with only 2 main components or 2 attributes, the model still has a fairly high accuracy, which is 80%. With principal components, you can reduce less significant attributes in predictions and speed up machine learning model training time.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
images		images
README.md		README.md
pca.ipynb		pca.ipynb
pca.py		pca.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Principal Component Analysis using Python

The stages in this exercise are as follows:

Model performance without PCA

Model Performance with PCA

About

Uh oh!

Releases

Packages

Languages

insancs/pca-python

Folders and files

Latest commit

History

Repository files navigation

Principal Component Analysis using Python

The stages in this exercise are as follows:

Model performance without PCA

Model Performance with PCA

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages