Skip to content

bioc/granulator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

granulator: Rapid benchmarking of methods for in silico deconvolution of bulk RNA-seq data

Lifecycle:stable license

Introduction

Heterogeneity in the cellular composition of bulk RNA-seq data may prevent or bias the results from differential expression analysis. To circumvent this limitation, in silico deconvolution infers cell type abundances by modelling gene expression levels as weighted sums of the cell-type specific expression profiles. Several computational methods have been developed to estimate cell type proportions from bulk transcriptomics data, and to account for cell type heterogeneity in the statistical analysis. The R package granulator provides a unified testing interface to rapidly run and benchmark multiple state-of-the-art deconvolution methods. We demonstrate its usage on published bulk RNA-seq data from peripheral blood mononuclear cells.

Methods

The methods currently implemented in granulator are reported in Table 1.

Name Function Method License Reference
ols stats::lsfit Ordinary least squares free (GPL-2)
nnls nnls::nnls Non-negative least squares free (GPL-2, GPL-3) reimplemented based on (Abbas et al. 2009)
qprogwc limSolve::lsei Quadratic programming with non-negativity and sum-to-one constraint free (GPL-2, GPL-3) reimplemented based on (Gong and Szustakowski 2013)
qprog limSolve::Solve Quadratic programming without constraints free (GPL-2, GPL-3)
rls MASS::rlm Re-weighted least squares free (GPL-2, GPL-3) reimplemented based on (Monaco et al. 2019)
svr e1071::svr Support vector regression free (GPL-2, GPL-3) reimplemented based on (Newman et al. 2015)
dtangle dtangle::dtangle Linear mixing model free (GPL-3) (Hunt et al. 2018)

Table 1 - Deconvolution methods. List of deconvolution algorithms available in granulator.

Installation

granulator can be installed from Bioconductor using:

if (!requireNamespace("BiocManager", quietly = TRUE))
    install.packages("BiocManager")
BiocManager::install("granulator")

The package can be loaded using:

library(granulator)

Data

The datasets included in the package comprises bulk RNA-seq gene expression data of peripheral blood mononuclear cells (PBMCs) from 12 healthy donors and bulk RNA-seq data of 29 isolated immune cell types from 4 healthy donors (Monaco et al. 2019), publicly available at NCBI database under GEO accession number GSE107011.

Vignettes

We show how to use granulator for the deconvolution of bulk RNA-seq data from peripheral blood mononuclear cells (PBMCs) into the individual cellular components and how to assess the quality of the obtained predictions in the following vignette: Deconvolution of bulk RNA-seq data with granulator.

References

Abbas, Alexander R., Kristen Wolslegel, Dhaya Seshasayee, Zora Modrusan, and Hilary F. Clark. 2009. “Deconvolution of blood microarray data identifies cellular activation patterns in systemic lupus erythematosus.” PLoS ONE 4 (7). https://doi.org/10.1371/journal.pone.0006098.

Gong, Ting, and Joseph D Szustakowski. 2013. “DeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data.” Bioinformatics 29 (8): 1083–85. https://doi.org/10.1093/bioinformatics/btt090.

Hunt, Gregory J, Saskia Freytag, Melanie Bahlo, and Johann A Gagnon-Bartsch. 2018. “Dtangle: Accurate and Robust Cell Type Deconvolution.” Bioinformatics 35 (12): 2093–99. https://doi.org/10.1093/bioinformatics/bty926.

Monaco, Gianni, Bernett Lee, Weili Xu, Seri Mustafah, You Yi Hwang, Christophe Carré, Nicolas Burdin, et al. 2019. “RNA-Seq Signatures Normalized by mRNA Abundance Allow Absolute Deconvolution of Human Immune Cell Types.” Cell Reports 26 (6): 1627–1640.e7. https://doi.org/10.1016/j.celrep.2019.01.041.

Newman, Aaron M., Chih Long Liu, Michael R. Green, Andrew J. Gentles, Weiguo Feng, Yue Xu, Chuong D. Hoang, Maximilian Diehn, and Ash A. Alizadeh. 2015. “Robust enumeration of cell subsets from tissue expression profiles.” Nature Methods 12 (5): 453–57. https://doi.org/10.1038/nmeth.3337.

About

This is a read-only mirror of the git repos at https://bioconductor.org

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published