MONTI: A multi-omics non-negative tensor decomposition framework for gene-level integrative analysis

https://www.frontiersin.org/articles/10.3389/fgene.2021.682841/abstract

Multi-omics data is frequently measured to characterize biological mechanisms underlying phenotypes. Complex relationships in multi-omics data, if mined, can lead to more accurate classification of patient samples according to the phenotypes.

MONTI (Multi-Omics Non-negative Tensor decomposition for Integrative analysis) is a tool that can be used to integrate and analyze large sets of multi-omics data. MONTI identifies gene regulatory multi-omics features specific to a group of samples that share a common biological trait.

Below is an illustration of the analysis workflow of MONTI.

The output of MONTI is a simple gene list with information of their associated subtypes, which can be used for further downstream analysis. For example, the Venn diagram below shows the genes that are found to be associated to colorectal cancer subtypes CMS1, CMS2, CMS3 and CMS4. These genes showed to be informative in separating the four subtypes as shown in the t-SNE plot.

Install MONTI

MONTI is developed in python3 and can be installed as below

python3 -m pip install monti

Documentation

The functions and objects used by MONTI are documented here.

Tutorial using colon cancer data (TCGA-COAD)

A brief tutorial for using MONTI can be found under the 'tutorial' directory. The associated multi-omics data are included.

If the above link does not work, the tutorial data is also available here.

Before starting the tutorial, the dataset should be downloaded. After download decompress data by

cd <download_path>
tar -xzvf tutorial_data_coad.tar.gz

The <download_path> should also be used as the tutorial directory, or you can simply move the data to another directory to be used for the tutorial.

The data includes three omics data, 1) gene expression (mRNA), 2) methylation level and 3) miRNA expression. They are raw data directly collected from the TCGA portal.

In the jupyter notebook shows an example of how to integrate multi-omics data in a gene-level manner and extract features that can classify the molecular subtypes of COAD.

The tutorial includes the below analysis procedures:

gene-level transformation
normalization
feature selection
classification accuracy measurement and
plotting of the results

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
documentation		documentation
images		images
src		src
tutorial		tutorial
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
monti_tutorial_coad.ipynb		monti_tutorial_coad.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MONTI: A multi-omics non-negative tensor decomposition framework for gene-level integrative analysis

Install MONTI

Documentation

Tutorial using colon cancer data (TCGA-COAD)

About

Releases

Packages

Languages

License

inukj/MONTI

Folders and files

Latest commit

History

Repository files navigation

MONTI: A multi-omics non-negative tensor decomposition framework for gene-level integrative analysis

Install MONTI

Documentation

Tutorial using colon cancer data (TCGA-COAD)

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages