PCA-KL

Python code for the paper

Levada, A.L.M. PCA-KL: a parametric dimensionality reduction approach for unsupervised metric learning. Advances in Data Analysis and Classification, 15, 829–868 (2021). https://doi.org/10.1007/s11634-020-00434-3

PCA-KL is a parametric algorithm for unsupervised dimensionality reduction based on the computation of the entropic covariance matrix, a surrogate for the covariance matrix of the data using the relative entropy between Gaussian distributions instead of the usual Euclidean distance between data points. The PCA-KL algorithm can be summarized as:

From the input data build an undirected proximity graph using the KNN rule;
For each patch, that is, a central point and its neighbors compute the mean and variance of each feature: For simplicity we assume a Gaussian model, but other distributions could be adopted at this stage. At the end, this step generates for each patch, a parametric vector.
Compute the mean parametric vector for all patches, which represents the average distribution, given all the dataset:
Compute the matrix C, a surrogate for the covariance matrix based on the relative entropy (KL-divergence) between each parametric vector and the average distribution;
Select the d < m eigenvectors associated to the d largest eigenvectors of the surrogate matrix C to compose the projection matrix;
Project the data into the linear subspace.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
PCAKL.py		PCAKL.py
README.md		README.md
pca_KL.py		pca_KL.py
pca_KL_3D.py		pca_KL_3D.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PCAKL.py

PCAKL.py

README.md

README.md

pca_KL.py

pca_KL.py

pca_KL_3D.py

pca_KL_3D.py

Repository files navigation

PCA-KL

About

Releases

Packages

Languages

alexandrelevada/PCA-KL

Folders and files

Latest commit

History

Repository files navigation

PCA-KL

About

Resources

Stars

Watchers

Forks

Languages