SSMD: Semi-Supervised Mouse data Deconvolution

Description

Deconvolution of mouse transcriptomic data is challenged by the fact that mouse models carry various genetic and physiological perturbations, making it questionable to assume fixed cell types and cell type marker genes for different dataset scenarios. We developed a Semi-Supervised Mouse data Deconvolution (SSMD) method to study the mouse tissue microenvironment (TME). SSMD is featured by (i) a novel non-parametric method to discover data set specific cell type signature genes; (ii) a community detection approach for fixing cell types and their marker genes; (iii) a constrained matrix decomposition method to solve cell type relative proportions that is robust to diverse experimental platforms. In summary, SSMD addressed several key challenges in the deconvolution of mouse tissue data, including: (1) varied cell types and marker genes caused by highly divergent genotypic and phenotypic conditions of mouse experiment, (2) diverse experimental platforms of mouse transcriptomics data, (3) small sample size and limited training data source, and (4) capable to estimate the proportion of 35 cell types in blood, inflammatory, central nervous or hematopoietic systems. In silico and experimental validation of SSMD demonstrated its high sensitivity and accuracy in identifying (sub) cell types and predicting cell proportions comparing to state-of-the-arts methods. A user-friendly R package and a web server of SSMD are released via https://ssmd.ccbb.iupui.edu.

Installation

install.packages("devtools")
devtools::install_github("zy26/SSMD")

Usage

estimate.proportion <- function(data, lambda = lambda)

Arguments

data input gene expression matrix. MGI gene symbol should be as their row names
parameter threshold of mean correlation to define rank-1 co-expression module

Value

An object of class is also invisibly returned. This is a list containing the following components:

Stat_all statistics for all rank-1 co-expression module. CT: cell type; mean: mean correlation inside the module; Core_overlap_number: Overlap number with core marker list; Core_overlap_rate: overlap rate with core marker list; BCV_rank: bcv rank of the first base
module_keep modules with the high overlap number with core marker list for each cell type
proportion estimated proportion for each cell type

Examples

#load your own gene expression data
load('example_bulk.RData')
estimate.proportion(data, lambda = 0.8)

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
R		R
data		data
man		man
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
NAMESPACE		NAMESPACE
README.md		README.md
README.png		README.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SSMD: Semi-Supervised Mouse data Deconvolution

Description

Installation

Usage

Arguments

Value

Examples

About

Releases

Packages

Languages

zcslab/SSMD-1

Folders and files

Latest commit

History

Repository files navigation

SSMD: Semi-Supervised Mouse data Deconvolution

Description

Installation

Usage

Arguments

Value

Examples

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages