GitHub - luiscarlosgph/latentplot: Python package to plot the latent space of a set of images with different methods.

Description

Python package to plot the latent space of a set of images with different methods.

Install with pip

$ python3 -m pip install latentplot --user

Install from source

$ git clone https://github.com/luiscarlosgph/latentplot.git
$ cd latentplot
$ python3 setup.py install --user

Exemplary code snippet

# List of BGR images of shape (H, W, 3)
images = [ ... ]           

# List of vectors of shape (D,), where D is the vector dimension
feature_vectors = [ ... ]  

# List of integer class labels
labels = [ ... ]           

# Produce a BGR image containing a 2D plot of the latent space with t-SNE
plotter = latentplot.Plotter(method='tsne')  # You can use either 'pca', 'tsne' or 'umap'                              
im_tsne = plotter.plot(images, feature_vectors, labels)  # Providing labels is optional

The latentplot.Plotter constructor parameters are:

Parameter name	Description
method	Method used to reduce the feature vectors to a 2D space. Available options: pca, tsne, umap.
width	Desired output image width. Default is 15360 pixels (16K).
height	Desired output image height. Default is 8640 pixels (16K).
dpi	DPI for the output image. Default is 300.
cell_factor	Proportion of the reduced latent space that each cell will occupy. Default is 0.01.
dark_mode	Set it to False to have a white background with black font. Default is True.
hide_axes	Hide axes, ticks and marks. Default is True.
**kwargs	The rest of the arguments you pass will be forwarded to the dimensionality reduction method.

Exemplary results

CIFAR-10: the size of the images in this dataset is 32x32 pixels. The feature vectors to produce these plots were extracted with InceptionV3 trained on ImageNet. The colour of the rectangle around each image indicates the class label of the image. The colour for each class is randomly chosen in every run.
- PCA:
- t-SNE:
- UMAP:

Notes on dimensionality reduction methods

PCA (principal component analysis):

Assumes correlation (linear relationship) between features, sensitive to the scale of the features (features whose range is wider are more likely to become principle components), and it is not robust to outliers.
t-SNE (t-Distributed Stochastic Neighbor Embedding):

t-SNE does not assume linear relationships between features. Observations that are close in the high-dimensional space are expected to be close in the dimensionality-reduced space. t-SNE copes well with odd outliers.
UMAP (Uniform Manifold Approximation and Projection):

Fast, and scales well with regards to both dataset size and dimensionality.

Author

Luis Carlos Garcia Peraza Herrera (luiscarlos.gph@gmail.com), 2023.

License

This code repository is shared under an MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Install with pip

Install from source

Exemplary code snippet

Exemplary results

Notes on dimensionality reduction methods

Author

License

About

Releases 1

Packages

Languages

License

luiscarlosgph/latentplot

Folders and files

Latest commit

History

Repository files navigation

Description

Install with pip

Install from source

Exemplary code snippet

Exemplary results

Notes on dimensionality reduction methods

Author

License

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages