Nonlinear Component Analysis as a Kernel Eigenvalue Problem

Course project for IE529: Stats of Big data & Clustering, 2017 Fall, UIUC

Our team member

Name	Guthub Homepage
Jvn Karthik	N/A
Naman Shukla	namanUIUC
Shubham Bansal	bansalshubh91
Zhenye Na	Zhenye-Na
Ziyu Zhou	Ziyu0

Our primary work

We implemnt the experiments presented in the paper Nonlinear Component Analysis as a Kernel Eigenvalue Problem by Bernhard Schölkopf, Alexander Smola, Klaus-Robert Müller. Also, we write our own example on Kernel PCA. In this regard, you can read our report and our presentation slides.

Dependencies

In order to run the experiments, make sure you have all dependencies installed

matplotlib (>= 2.0.0)
scipy (>=0.19.0)
numpy (>=1.12.1)
sklearn (>=0.0)

You can install them by typing

pip3 install 'whatever you need'

The prgramming languages we use are Python and MATLAB. If you do not have access to MATLAB on your laptop. We advise you to install Octave instead. You can refer to this webpage for installing.

Experiments in paper

In the paper, there are two major experiments:

Toy example: 4-degree Polynomial Kernel PCA
Character Recognition (USPS Dataset)

Toy example: 4-degree Polynomial Kernel PCA

We implemented this part with MATLAB. The code can be found here.

Our own implementation

SVM and KPCA on Iris Dataset

Principal Component Analysis (PCA) is a dimensionality reduction technique that is used to transform and a high-dimensional dataset into a smaller dimen- sional subspace to give a directed impression of the dataset prior to running a machine learning algorithm on the data. The Iris dataset is in a 4 th dimensions (features) of three different iris ower species.

Related codes can be found here.

SVC on USPS Dataset

The dataset contains numeric data obtained from the scanning of handwritten digits from envelopes by the U.S. Postal Service. The original scanned digits are binary and of diﬀerent sizes and orientations; the images here have been deslanted and size normalized, resulting in 16 × 16 grayscale. We will ﬁrst extract features via Kernel PCA and apply that to a SVM classiﬁer to train and test on the splitted USPS dataset.

Related codes can be found here.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Output		Output
application		application
dataset		dataset
docs		docs
our_kpca		our_kpca
toy_example		toy_example
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nonlinear Component Analysis as a Kernel Eigenvalue Problem

Our team member

Our primary work

Dependencies

Experiments in paper

Toy example: 4-degree Polynomial Kernel PCA

Our own implementation

SVM and KPCA on Iris Dataset

SVC on USPS Dataset

About

Releases

Packages

Contributors 2

Languages

License

Zhenye-Na/npca

Folders and files

Latest commit

History

Repository files navigation

Nonlinear Component Analysis as a Kernel Eigenvalue Problem

Our team member

Our primary work

Dependencies

Experiments in paper

Toy example: 4-degree Polynomial Kernel PCA

Our own implementation

SVM and KPCA on Iris Dataset

SVC on USPS Dataset

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages