NoLiES: The non-linear embeddings surveyor

This repository contains a Jupyter Notebook implementation of the method presented in "Attribute-based Explanation of Non-Linear Embeddings of High-Dimensional Data" by Jan-Tobias Sohns, Michaela Schmitt, Fabian Jirasek, Hans Hasse, and Heike Leitte, submitted to and presented at IEEE VIS 2021.

Requirements

pandas
bokeh
pyviz::panel
scikit-learn
conda-forge::umap-learn
shapely

Interactive Demos

Name	Description	Size (data points x attributes)	Embedding
Iris	Iris dataset	150 x 5	MDS
Wine	UCI wine origin dataset	178 x 14	MDS
OECD Better Life	OECD better life dataset	41 x 11	MDS
Penguins	UMAP projection of the penguin dataset.	344 x 7	UMAP
Covertype	Covertype dataset with 3.5k data points. Updates take a few seconds.	3500 x 11	UMAP
Chemistry	Chemistry dataset of learned features describing activity coefficient compared to chemical class	240 x 4	MDS

Working with your own data

Download the repository and create an environment with the dependencies:

git clone https://github.com/Jan-To/nolies
conda env create -f nolies.yml
conda activate nolies

Make a copy of the template jupyter notebook:

cp template.ipynb my_data.ipynb

Update the notebook to load your data. Open the notebook with jupyter lab or jupyter notebook and edit the section Load data. Important parameters are grouped in the section Parameters and Preprocessing:

jupyter lab

Start the interactive web app:

panel serve --show my_data.ipynb

Citation

If you find this useful, please cite our paper:

@article{
title = {Attribute-based Explanation of Non-Linear Embeddings of High-Dimensional Data},
authors = {J.-T. Sohns and M. Schmitt and F. Jirasek and H. Hasse and H. Leitte},
journal = {IEEE Transactions on Visualization &amp; Computer Graphics},
year = {2022},
volume = {28},
number = {01},
pages = {540-550},
doi = {10.1109/TVCG.2021.3114870},
publisher = {IEEE Computer Society},
address = {Los Alamitos, CA, USA},
month = {jan}
}

Acknowledgements

This work was inspired and supported by:

IRTG 2057
NFDI DataPlant
Dagstuhl workshop

Name	Name	Last commit message	Last commit date
Latest commit Jan-To fixes for deprecated functions May 26, 2023 83a615e · May 26, 2023 History 7 Commits
data	data	add files	Jan 4, 2023
embedding	embedding	fixes for deprecated functions	May 26, 2023
LICENSE	LICENSE	Initial commit	Jan 4, 2023
README.md	README.md	Update Readme to new binder links	Feb 10, 2023
demo_betterlife.ipynb	demo_betterlife.ipynb	fixes for deprecated functions	May 26, 2023
demo_covtype.ipynb	demo_covtype.ipynb	fixes for deprecated functions	May 26, 2023
demo_iris.ipynb	demo_iris.ipynb	fixes for deprecated functions	May 26, 2023
demo_penguins.ipynb	demo_penguins.ipynb	fixes for deprecated functions	May 26, 2023
demo_thermoWithGroups.ipynb	demo_thermoWithGroups.ipynb	fixes for deprecated functions	May 26, 2023
demo_wine.ipynb	demo_wine.ipynb	fixes for deprecated functions	May 26, 2023
environment.yml	environment.yml	update environment.yml	Feb 10, 2023
nolies.yml	nolies.yml	add files	Jan 4, 2023
panelserverextension.py	panelserverextension.py	add files	Jan 4, 2023
postBuild	postBuild	add files	Jan 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NoLiES: The non-linear embeddings surveyor

Requirements

Interactive Demos

Working with your own data

Citation

Acknowledgements

About

Releases

Packages

Languages

License

Jan-To/nolies

Folders and files

Latest commit

History

Repository files navigation

NoLiES: The non-linear embeddings surveyor

Requirements

Interactive Demos

Working with your own data

Citation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages