prot-rna-umap-clustering

Joint UMAP embedding and clustering of proteomic and transcriptomic data.

The notebook was created in Jupyter Lab on Windows running Python 3.8. The workflow depends on third-party libraries that can be installed via pip:
pip install scipy numpy pandas matplotlib seaborn scikit-learn umap-learn.

The data is available in conjunction with the article by Hultqvist et al. The mass spectrometry-based proteomic files are deposited at PRIDE archive, and the relative protein abundance table from the proteomic analysis can be found in this GitHub repository. The RPKM table from the transcriptomic experiment can be found at the Gene Expression Omnibus (GEO) project page .

The project consisted of 10 samples of Escherichia coli cultures that belonged to 5 different conditions. Mass spectrometry-based proteomic and transcriptomic data has been acquired for each of the samples. The aim of this data processing workflow is to cluster the genes in an unsupervised fashion based on their profiles of change across both data sets.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Prot_Data		Prot_Data
Ecoli_Prot_RNAseq_UMAP_Clust.html		Ecoli_Prot_RNAseq_UMAP_Clust.html
Ecoli_Prot_RNAseq_UMAP_Clust.ipynb		Ecoli_Prot_RNAseq_UMAP_Clust.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

prot-rna-umap-clustering

About

Releases

Packages

Languages

License

dev-ev/prot-rna-umap-clustering

Folders and files

Latest commit

History

Repository files navigation

prot-rna-umap-clustering

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages