Skip to content
/ MONTI Public

MONTI is a tool for analyzing large multi-omics cancer cohort data in association with clinical featuers

License

Notifications You must be signed in to change notification settings

inukj/MONTI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MONTI: A multi-omics non-negative tensor decomposition framework for gene-level integrative analysis

https://www.frontiersin.org/articles/10.3389/fgene.2021.682841/abstract

Multi-omics data is frequently measured to characterize biological mechanisms underlying phenotypes. Complex relationships in multi-omics data, if mined, can lead to more accurate classification of patient samples according to the phenotypes.

MONTI (Multi-Omics Non-negative Tensor decomposition for Integrative analysis) is a tool that can be used to integrate and analyze large sets of multi-omics data. MONTI identifies gene regulatory multi-omics features specific to a group of samples that share a common biological trait.

Below is an illustration of the analysis workflow of MONTI. workflow

The output of MONTI is a simple gene list with information of their associated subtypes, which can be used for further downstream analysis. For example, the Venn diagram below shows the genes that are found to be associated to colorectal cancer subtypes CMS1, CMS2, CMS3 and CMS4. These genes showed to be informative in separating the four subtypes as shown in the t-SNE plot.

example output


Install MONTI

MONTI is developed in python3 and can be installed as below

python3 -m pip install monti

Documentation

The functions and objects used by MONTI are documented here.

Tutorial using colon cancer data (TCGA-COAD)

A brief tutorial for using MONTI can be found under the 'tutorial' directory. The associated multi-omics data are included.

If the above link does not work, the tutorial data is also available here.

Before starting the tutorial, the dataset should be downloaded. After download decompress data by

cd <download_path>
tar -xzvf tutorial_data_coad.tar.gz

The <download_path> should also be used as the tutorial directory, or you can simply move the data to another directory to be used for the tutorial.

The data includes three omics data, 1) gene expression (mRNA), 2) methylation level and 3) miRNA expression. They are raw data directly collected from the TCGA portal.

In the jupyter notebook shows an example of how to integrate multi-omics data in a gene-level manner and extract features that can classify the molecular subtypes of COAD.

The tutorial includes the below analysis procedures:

  • gene-level transformation
  • normalization
  • feature selection
  • classification accuracy measurement and
  • plotting of the results

About

MONTI is a tool for analyzing large multi-omics cancer cohort data in association with clinical featuers

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published